Robots.txt Generator. Also known as robots exclusion protocol, Robots.txt is a text file stored in a site’s root directory that tells a search engine Crawler which site pages and sub-folders should not be included in the search engine Index. However, there is no guarantee that a Crawler will comply with this request. Robots.txt is an alternative to a Meta Robots Tag or password protection.
The robots exclusion standard, also known as the robots exclusion protocol or simply robots.txt, is a standard used by websites to communicate with web crawlers and other web robots. The standard specifies how to inform the web robot about which areas of the website should not be processed or scanned. Robots are often used by search engines to categorize websites. Not all robots cooperate with the standard; email harvesters, spambots, malware and robots that scan for security vulnerabilities may even start with the portions of the website where they have been told to stay out. The standard can be used in conjunction with Sitemaps, a robot inclusion standard for websites.