# file: robots.txt. # www.qx100.com # 按照robots.txt的标准写法,规定一些不允许爬虫爬的页面或目录。 # robots.txt 的写法参照 # Format is: # User-agent: # Disallow: | # ----------------------------------------------------------------------------- User-agent: YoudaoBot Disallow: / User-agent: Baiduspider Allow:/ User-agent: Baidu spider Allow:/ User-agent:Googlebot Allow:/ User-agent:Google bot Allow:/ User-agent: Googlespider Allow:/ User-agent: Google spider Allow:/ User-agent: MSNBot Allow:/ User-agent: ia_archiver Allow:/ User-agent:Yahoospider Disallow: /sj/ User-agent:Yahoo spider Disallow: /sj/ User-agent:sosospider Disallow: /sj/ User-agent:sosos pider Disallow: /sj/ User-agent:wespe.despider Disallow: /sj/ User-agent:wespe.de spider Disallow: /sj/