I really do love Yioop, but it seems because it stores the crawl, and settings, data in the Work Directory, instead of using something like MySQL or the Even better Hadoop system to store the data it is extremely slow even on small crawls, and can't possibly stand up against other search solutions like Nutch or ElasticSearch. So I thought I might as well at least suggest that as a Goal for the next few updates maybe the project could move away from the Work Directory, and onto a more conventional and faster system like Hadoop. Which may allow the Yioop Software to gain some popularity.
Also as a Note in its current state even with an SSD it takes around 30 seconds to rank 10 results for a new search term.
I really do love Yioop, but it seems because it stores the crawl, and settings, data in the Work Directory, instead of using something like MySQL or the Even better Hadoop system to store the data it is extremely slow even on small crawls, and can't possibly stand up against other search solutions like Nutch or ElasticSearch. So I thought I might as well at least suggest that as a Goal for the next few updates maybe the project could move away from the Work Directory, and onto a more conventional and faster system like Hadoop. Which may allow the Yioop Software to gain some popularity.
Also as a Note in its current state even with an SSD it takes around 30 seconds to rank 10 results for a new search term.