Crawling vs. Indexing – robots.txt Debate


This is really a long and tiresome debate… I personally have argued this point for a long time to no avail with Zen Cart forum posters… Even linked to Sebastian for verification. But, alas, the debate drags on.

Much to my absolute delight, Matt Cutts recently (10/6/2009) came right out and said… what we have been saying all along… Blocking a page with your robots.txt DOES NOT prevent indexing, only crawling. The correct way to prevent a page from being indexed is with a Meta robots noindex tag… That’s all… and here’s Matt to verify these truths.

One final thing…. Your sitemap is not a command for Google or any other search engine to index your pages, it is simply a xml document to assist the search engine spiders in better locating your content for crawl. Google and other search engines are not required to index your pages and alas, you cannot force them either.


One response to “Crawling vs. Indexing – robots.txt Debate”