Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I did run a Yacy web crawler (P2P websearch https://yacy.net) a while ago. As far I remember I just saw Yandex for a few times disallowed in the robots.txt when I had trouble with crawling a site. Mostly I just got an empty website for my Yacy crawler instead the "real" Website.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: