How do I hide a site from search engines? [closed]

空扰寡人 提交于 2019-12-23 07:06:20

问题


I am looking to hide a site so that it doesn't show up in any search engine just wondering how I would go about this?


回答1:


Use a robots.txt file: http://www.google.com/support/webmasters/bin/answer.py?answer=156449




回答2:


Apart from password-protecting your site, you could add these lines to robots.txt:

User-agent: *
Disallow: /

This doesn't hide the site but rather instructs bots not to spider the content.




回答3:


You can somehow reduce your site being listed using a robots.txt. Note that this depends on the "goodwill" of the crawler, though (some spambots will explicitly look at locations that you disallow).

The only safe and reliable way of not having a site listed, sadly, is not putting it on the internet.

Simply not linking to your site will not work. Crawlers get their info from many sources, including browser referrers and domain registrars. So, in order to be "invisible", you would have to not visit your site and not register a domain (only access it via IP address).
And then, if you run your webserver based on IP address, you still have all the spambots probing random addresses. It will take a while, but they will find you.

Password protecting your site should work, effectively making it inaccessible. Though (and it is beyond my comprehension how that happens) for example there are literally thousands of ACM papers listed in Google which you cannot see without an account and logging in. Yet they are there.




回答4:


Use a robots.txt, deny from all search engines. They don't all respect robots.txt so check your server logs regularly and deny from ranges of suspected robots/crawlers:

http://httpd.apache.org/docs/2.2/howto/access.html




回答5:


You use a robots.txt file. Place the file in the root of the site with this content:

User-agent: *
Disallow: /



回答6:


most proper search engines uses bots or crawlers to websites and index them. you could Robot File method




回答7:


Have a look at nofollow Wikipedia




回答8:


You need to read about robots.txt file you are supposed to copy in your site's webroot – http://www.robotstxt.org/robotstxt.html.



来源:https://stackoverflow.com/questions/6732653/how-do-i-hide-a-site-from-search-engines

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!