2012-03-19

Version 0.84 Released!.

Hey you guys, you should use the discussion board to talk about your projects more. In any case...

The crawler now has its own DNS caching mechanism independent of cURL's. Yioop now has a detection mechanism for when websites are becoming congested. One can also set a quota on the number of urls downloaded/hour from sites. A webcrawl statistics page can now be generated for a crawl. A bug in robots.txt handling as well as a bug in archive handling that were introduced in Version 0.82 have been fixed. The demo site now an example crawl of 100 million pages crawled with the previous version of the software.
Hey you guys, you should use the discussion board to talk about your projects more. In any case...<br><br>The crawler now has its own DNS caching mechanism independent of cURL's. Yioop now has a detection mechanism for when websites are becoming congested. One can also set a quota on the number of urls downloaded/hour from sites. A webcrawl statistics page can now be generated for a crawl. A bug in robots.txt handling as well as a bug in archive handling that were introduced in Version 0.82 have been fixed. The demo site now an example crawl of 100 million pages crawled with the previous version of the software.
X