-- Yioop 9 on Windows, PHP 7.4 and Apache 2.4 not crawling
Yioop is dependent on curl, so as long as you have a modern copy of curl library installed, it should be HTTP/2 capable. I haven't added HTTP/3 support yet. You are starting your crawl from a http:// seed site, so if that page redirects to https:// you might get a redirect message. Again, that wouldn't explain why you are getting no search results. On the other hand, when I look at your page options, it looks like you have created a classifier that you are using to classify which web pages should be indexed. Did you try turning that off? Also, if you only check htm, html, it will only index web pages with that extension. So the page https://foo.com/ will not be indexed, because it does not have either extension, but https://foo.com/index.html would be indexed. The screening is done before any attempt at downloading the page, so at that point the mime type is not known. You can check unknown if you think it is okay to crawl https://foo.com/ . Also, I would tend to check at a minimum .txt as well.
Chris
(
Edited: 2022-07-27)
Yioop is dependent on curl, so as long as you have a modern copy of curl library installed, it should be HTTP/2 capable. I haven't added HTTP/3 support yet. You are starting your crawl from a http:// seed site, so if that page redirects to https:// you might get a redirect message. Again, that wouldn't explain why you are getting no search results. On the other hand, when I look at your page options, it looks like you have created a classifier that you are using to classify which web pages should be indexed. Did you try turning that off? Also, if you only check htm, html, it will only index web pages with that extension. So the page https://foo.com/ will not be indexed, because it does not have either extension, but https://foo.com/index.html would be indexed. The screening is done before any attempt at downloading the page, so at that point the mime type is not known. You can check unknown if you think it is okay to crawl https://foo.com/ . Also, I would tend to check at a minimum .txt as well.
Chris