Link Harvest Debug Results

Posted in: WordPress, Development

The debug versions of Link Harvest have yielded a mess of inconsistent data. It appears that the link harvest action can fail for any number of reasons.

Some of them:

  • PHP running out of memory.
  • Hitting the script execution time limit.
  • Downloading a media file (.mp3, .mov, etc.).
  • Overzealous firewalls and plugins like Bad Behavior.
  • A myriad of unknown reasons.

I’m going to address the media file issue and explicitly set a long script timeout in a 1.1 release and see if that makes much of a difference. I’m not sure when that will be available, but I wanted to post the results of the debug efforts and thank those that sent in their debug logs.

Popularity: 7% [?]

Posted March 8th, 2007 @ 7:22 PM

10 Replies

  1. Britgirl adds this Comment:

    You could add to that plugins like Ultimate Tag Warrior. I had to de-activate Link Harvest as it doesn’t appear to work with UTW. For example I could not add any tags when I had LH turned on. Once I turned it off, I could add tags to my posts without a problem. It currently remains deactivated.

    March 8th, 2007 at 9:05 pm

  2. Laundro adds this Comment:

    Thanks for all your efforts Alex. You write some of the best plugins available.

    March 9th, 2007 at 5:50 am

  3. fmf adds this Comment:

    Glad to read you can reach some point with debugs, even if not all the problems will be solved.
    Thanks for this good plugin.

    March 13th, 2007 at 9:10 am

  4. Cigar Jack adds this Comment:

    I was able to get mine to work by disabling other plugins just while I ran the harvest and also told it to ignore a few links.

    April 4th, 2007 at 5:42 pm

  5. Funky Dung adds this Comment:

    Any progress on this?

    April 11th, 2007 at 7:32 pm

  6. Alex adds this Comment:

    At some point I’ll have time to finish coding in the exceptions and release a new version. I think I checked some stuff in to the SVN repo, but I don’t recall if it’s in a good state or not.

    April 11th, 2007 at 11:38 pm

  7. Pi adds this Comment:

    Hi Alex, and thanks for your efforts. I’d suggest you that you find out where in the code the error is triggered. In other words, which of all the calls to harvest_error() is the culprit. In my tests, I found out that it’s in
    if (progress == “”)
    (that’s line 797)

    The other errors are never triggered.

    I also found out that the sum of posts+pages in my site is 14 (=6+8), but LH reports 9 when doing the harvest.

    April 17th, 2007 at 9:17 pm

  8. Alex adds this Comment:

    I have nearly a dozen different types of errors reported, some are due to broken server configs, some are due to timeouts, some are due to running out of memory, some are due to script execution limits…

    I really wish there was just one issue here.

    April 17th, 2007 at 9:22 pm

  9. Pi adds this Comment:

    Hi again. I discovered something which might be useful. I have my site at MochaHost, but also locally (with my own Apache+PHP server). When I try LH in my local site, it doesn’t run. When I try it in the remote site, it does run. Maybe it’s a different setting in PHP’s configuration?

    April 17th, 2007 at 9:30 pm

  10. Alex adds this Comment:

    Exactly.

    April 18th, 2007 at 6:42 am

Add a Comment

Please note: Use of a non-personal web site or blog in the field below and/or comments that are off-topic, personal attacks, or support requests will likely be removed at my discretion.

Note: This post is over a year old. You may want to check later in this blog to see if there is new information relevant to your comment.

Around the web » « Text Link Ads Sponsorship

About This Site

This is the personal web site of Alex King, an independent developer based in Denver, Colorado USA. More...


Crowd Favorite

Crowd Favorite is my software and web development business.

We build web applications, design and develop custom WordPress themes and plugins, and build custom sites using WordPress as a CMS.


I also have a tumblog that aggregates my online content from other services (Twitter, Flickr, del.icio.us. etc.).

I'm voting for Barack Obama

Ads