Detecting 'stealth' web-crawlers

后端 未结 11 1640
小鲜肉
小鲜肉 2020-11-28 00:15

What options are there to detect web-crawlers that do not want to be detected?

(I know that listing detection techniques will allow the smart stealth-crawle

11条回答
  •  星月不相逢
    2020-11-28 01:03

    An easy solution is to create a link and make it invisible

    Don't click me!
    

    Of course you should expect that some people who look at the source code follow that link just to see where it leads. But you could present those users with a captcha...

    Valid crawlers would, of course, also follow the link. But you should not implement a rel=nofollow, but look for the sign of a valid crawler. (like the user agent)

提交回复
热议问题