Beta HostReview | HostReview

web crawl

Google to start tracking content "behind" web forms?

According to a post in Google's Webmaster Central Blog, the company is experimenting with ways to start crawling content that is accessible via HTML forms. This will add to Google's index some of the content that was previously located in the "Deep Web," beyond the crawling capabilities of current spiders. According to estimates, tens of thousands terabytes of data are located in the Deep Web.