# # Robots.txt tells spiders and robots to stay away from certain # parts of our web space. For info, see # # http://info.webcrawler.com/mak/projects/robots/norobots.html # # Yuck. Used to have Disallow: ~ to keep robots out of # personal user web space. However, I want robots to # index ~dwilliss/programming/ The draft of the new # robots.txt format allows for Allow: lines, but that # isn't implemented yet. # Disallow: ~admin Disallow: ~analog Disallow: ~Architext Disallow: ~architexture Disallow: ~awstats Disallow: ~awstats1 Disallow: /etc Disallow: ~excite Disallow: ~excitews Disallow: ~ftp Disallow: ~lanceserver Disallow: ~logs Disallow: ~Mail Disallow: ~mysql Disallow: ~netstat Disallow: ~press Disallow: ~www Disallow: ~valserver # Disallow: /usr # Disallow: /dev # Disallow: /pub # Don't index pub. It changes Disallow: /www/stats/ # Don't index our server statistics Disallow: /www/dinosaur/in_prog/ # Don't look at stuff that is not ready Disallow: /work # Stuff that is not ready Disallow: ~atlasdocs # c:/ "local" data for online atlases