otheus
e35693475c
Update robots.txt to exclude group_members and project_members, which can expose sensitive user information to the web. Please see https://developers.google.com/search/reference/robots_txt for the correct wildcard format.
2018-11-29 22:06:42 +00:00
Moritz Schlarb
2be7ddb704
Update robots.txt ( #51167 )
2018-09-07 07:27:29 +00:00
Achilleas Pipinellis
71f707bfe8
Add the /help page in robots.txt
...
The /help page has docs which we don't want to be crawled
as we prefer the docs website instead.
Related https://gitlab.com/gitlab-org/gitlab-ce/issues/44433
2018-03-22 10:55:02 +01:00
eric sabelhaus
3ac6054bf9
correct User_agent placement in robots.txt
2017-01-18 07:21:16 -05:00
Matt Harrison
f1df7b1bc2
update robots.txt disallow
...
Allows projects in groups starting with "s" while still disallowing the
snippets short urls.
2016-09-23 10:57:44 -04:00
Connor Shea
d71edf0ded
Disallow search engines from indexing uploads from a GitLab project.
...
This can sometimes include sensitive information from private projects and confidential issues. It shouldn't be indexed. Resolves #15551 .
2016-05-16 15:04:14 -05:00
Ben Bodenmiller
abf18abb52
allow crawling of commit page but not patch/diffs
...
Commit page has valuable information that search engines should be allowed to crawl however the .patch and .diff pages have no new information that is not on commit page
2015-10-04 23:39:24 -07:00
Ben Bodenmiller
595a93ee2c
disallow irrelevant pages by default in robots
...
Update default robots.txt rules to disallow irrelevant pages that search
engines should not care about. This will still allow important pages
like the files, commit details, merge requests, issues, comments, etc.
to be crawled.
2015-08-17 23:11:16 -07:00
gitlabhq
9ba1224867
init commit
2011-10-09 00:36:38 +03:00