Detecting 'stealth' web-crawlers

后端 未结 11 1637
小鲜肉
小鲜肉 2020-11-28 00:15

What options are there to detect web-crawlers that do not want to be detected?

(I know that listing detection techniques will allow the smart stealth-crawle

11条回答
  •  佛祖请我去吃肉
    2020-11-28 00:56

    It's not actually that easy to keep up with the good user agent strings. Browser versions come and go. Making a statistic about user agent strings by different behaviors can reveal interesting things.

    I don't know how far this could be automated, but at least it is one differentiating thing.

提交回复
热议问题