# The method used to exclude robots from a server is to create a file on the server # which specifies an access policy for robots. This file must be accessible via # HTTP on the local URL "/robots.txt". # The format and semantics of the "/robots.txt" file are as follows: # The file consists of one or more records separated by one or more blank lines # (terminated by CR,CR/NL, or NL). Each record contains lines of the form # ":". The field name is case insensitive. # Comments can be included in file using UNIX bourne shell conventions: the '#' # character is used to indicate that preceding space (if any) and the remainder of # the line up to the line termination is discarded. Lines containing only a comment # are discarded completely, and therefore do not indicate a record boundary. # The record starts with one or more User-agent lines, followed by one or more Disallow # lines, as detailed below. Unrecognised headers are ignored. # User-agent # The value of this field is the name of the robot the record is describing access policy # for. If more than one User-agent field is present the record describes an identical # access policy for more than one robot. At least one field needs to be present per record. # The robot should be liberal in interpreting this field. A case insensitive substring # match of the name without version information is recommended. # If the value is '*', the record describes the default access policy for any robot that # has not matched any of the other records. It is not allowed to have multiple such records # in the "/robots.txt" file. # Disallow # The value of this field specifies a partial URL that is not to be visited. This can be a # full path, or a partial path; any URL that starts with this value will not be retrieved. # For example, Disallow: /help disallows both /help.html and /help/index.html, whereas # Disallow: /help/ would disallow /help/index.html but allow /help.html. # Any empty value, indicates that all URLs can be retrieved. At least one Disallow field # needs to be present in a record. # The presence of an empty "/robots.txt" file has no explicit associated semantics, it will # be treated as if it was not present, i.e. all robots will consider themselves welcome. # http://www.robotstxt.org/wc/norobots.html for further information # Don't crawl the following pages: # Shopping Cart Pages # Account Pages # Automatic Renewal Pages # Order Status Pages # QuickBooks BUY page User-agent: * Disallow: /commerce/checkout/ Disallow: /commerce/account/secure/ Disallow: /commerce/autorenew/secure/ Disallow: /commerce/orderstatus/ Disallow: /commerce/catalog/buy.jhtml Disallow: /qb/reviews/