Tuesday, December 29, 2009

How to know when googlebot last crawled my page

The word crawling means how often googlebot visit your webpage in order to Index your web page. The program that is responsible for fetching web page information is called googlebot. Googlebot also called as robot, bot or spider. Googlebot uses an algorithmic process to determine which sites to crawl, how often, and how many pages to fetch from each site.

The most important your page is, the most often googlebot visit your webpage. Now it is question how I am able to know when my page was last crawled by googlebot. There are several methods to do it.

Using Google Webmaster Tools
The first you need to setup google webmaster tools. In the post http://arjudba.blogspot.com/2009/07/how-to-setup-google-webmaster-tool-for.html it is discussed how to setup google webmaster tools.

After you setup webmaster tools login to your webmaster tools, click Site Configuration tab and then click Crawler access. On the right side under under Test robots.txt you can find before how many hours/minutes/days your site was last visited by crawler. Here is a screenshot for this blog from the google webmaster links.
google webmaster tools crawler

Using Google Cached Links
Google takes a snapshot of each page as it crawls the web and caches these as a back-up in case the original page is unavailable.

In order to check when your page is cached just
1)Open www.google.com

2)Type your site address and click Google Search. Following image shows an example in case of my blog.
arju blog in google

3)You will see the search results like below.
search result for arju blog

4)If you click on the "Cached" link, you will see the web page as it looked when we indexed it. The cached content is the content Google uses to judge whether this page is a relevant match for your query. Following image shows the caching result from my blog.
cache page arju blog in google

When the cached page is displayed, it will have a header at the top which serves as a reminder that this is not necessarily the most recent version of the page. Terms that match your query are highlighted on the cached version to make it easier for you to see why your page is relevant. From the cache version you can easily check the date 29 Dec 2009 02:18:42 GMT. So homepage last cached in 29 Dec 2009 02:18:42 GMT.

Related Documents
http://arjudba.blogspot.com/2009/08/how-to-increase-your-technorati.html

No comments:

Post a Comment