Googlebots Ignoring robots.txt? [closed]

那年仲夏 提交于 2019-12-22 04:35:22

问题


I have a site with the following robots.txt in the root:

User-agent: *
Disabled: /

User-agent: Googlebot 
Disabled: /

User-agent: Googlebot-Image
Disallow: /

And pages within this site are getting scanned by Googlebots all day long. Is there something wrong with my file or with Google?


回答1:


It should be Disallow:, not Disabled:.




回答2:


Maybe give the Google robots.txt checker a try




回答3:


Google have an analysis tool for checking robots.txt entries, read about it here

You might also want to check the IP addresses of the "rogue" robots to see if they really are owned by Google.




回答4:


Also I believe that the bot goes down the page and takes the first directive that applies to it. In your case, Googlebot and Googlebot-Image would never see their specific directives because they would respect the "User-Agent: *" first.

Disregard this answer. I found information that points to this not being the case. The bot should find the directive specific to it and respect it



来源:https://stackoverflow.com/questions/344697/googlebots-ignoring-robots-txt

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!