Methods for preventing search engines from indexing irrelevant content on a page

前端 未结 4 1521
故里飘歌
故里飘歌 2020-12-17 16:29

I\'m looking for ways to prevent indexing of parts of a page. Specifically, comments on a page, since they weigh up entries a lot based on what users have written. This make

相关标签:
4条回答
  • 2020-12-17 16:48

    Javascript is an option but engines are getting better at reading javascript, to be honest I think your thinking too much into it, Engines love unique content, the more content you have on each page the better and if the users are providing it... its the holy grail.

    Just because your commenter made a reference to star wars on your toaster review doesn't mean your not going to rank for the toaster model, it just means you might rank for star wars toaster.

    Another idea would be, you could only show comments to people who are logged in, collegehumor do the same I believe, they show the amount of comments a post has but you have to login to see them.

    0 讨论(0)
  • 2020-12-17 16:49

    I would go with your JavaScript option. It has two advantages:

    1) bots don't see it 2) it would speed up your page load time (load the comments asynchronously and unobtrusively, e.g. via jQuery) ... page load times have a much underrated positive effect on your search rankings

    0 讨论(0)
  • 2020-12-17 16:49

    This is the first I have heard that search engines provide a method for informing them that part of a page is irrelevant.

    Google has a feature for web masters to declare parts of their site for a web search engine to use to find pages when crawling.

    1. http://www.google.com/webmasters/
    2. http://www.sitemaps.org/protocol.php

    You might be able to relatively de-emphasize some things on the page by specifying the most relevant keywords using META tag(s) in the HEAD section of your HTML pages. I think that is more in line with the engineering philosophy used to architect search engines in the first place.

    Look at Google's Search Engine Optimization tips. They spell out clearly what they will and will not let you do to influence how they index your site.

    0 讨论(0)
  • 2020-12-17 17:05

    googleoff and googleon are for the Google Search Appliance, which is a search engine they sell to companies that need to search through their own internal documents. It's not effective for the live Google site.

    I think number 1 is the best solution, actually. The search engines doesn't like when you give them other material than you give your users so number 2 could get you kicked out from the search listings altogether.

    0 讨论(0)
提交回复
热议问题