2013-07-24

Version 0.96 of Yioop Released!.

This version includes a new hybrid inverted index/suffix tree indexing scheme which should make calculating search results from future crawls faster (doesn't affect old crawls). Yioop can now make use of HTTP ETag: and Expire: information when deciding whether to download a URL it has seen before. Yioop now also supports the creation of classifiers using active learning. These can be used to label and add scoring information to documents during a crawl. Version 0.96 also includes improvements to the RSS feed news_updater and a segmenter for Chinese.
This version includes a new hybrid inverted index/suffix tree indexing scheme which should make calculating search results from future crawls faster (doesn't affect old crawls). Yioop can now make use of HTTP ETag: and Expire: information when deciding whether to download a URL it has seen before. Yioop now also supports the creation of classifiers using active learning. These can be used to label and add scoring information to documents during a crawl. Version 0.96 also includes improvements to the RSS feed news_updater and a segmenter for Chinese.
X