Robots exclusion standard
To give robots instructions about which pages of a Web site they can access, site owners put a text file called robots.txt in the main directory of their Web site, e.g. http://www.example.com/robots.txt. This text file tells robots which parts of the site they can and cannot access. However, robots can ignore robots.txt files, especially malicious (bad) robots. If the robots.txt file does not exist, Web robots assume that they can see all parts of the site.
Examples of robots.txt files[change | edit source]
References[change | edit source]
- "Robot Exclusion Standard". HelpForWebBeginners.com. http://www.helpforwebbeginners.com/webmasters/robot-exclusion-standard.html. Retrieved 2012-02-13.
- "About /robots.txt". Robotstxt.org. http://www.robotstxt.org/robotstxt.html. Retrieved 2012-02-13.