Robots.txt restriction of category URLs

别说谁变了你拦得住时间么 提交于 2019-12-02 13:33:21

This is not possible in the original robots.txt specification.

But some parsers may support wildcards in Disallow anyway, for example, Google:

Googlebot (but not all search engines) respects some pattern matching.

So for Google’s bots, you could use the following line:

Disallow: /*/video

This should block any URLs whose paths starts with anything, and contains "video", for example:

  • /foo/video
  • /foo/videos
  • /foo/video.html
  • /foo/video/bar
  • /foo/bar/videos
  • /foo/bar/foo/bar/videos

Other parsers not supporting this would interpret it literally, i.e., they would block the following URLs:

  • /*/video
  • /*/videos
  • /*/video/foo
标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!