You are here
Around the World in 2 Billion Pages
Joy Davidson | 16 March 2007
In December 2006, Internet Archive [external] received a grant from the Mellon Foundation [external] for their ongoing development of the Heritrix web crawler [external]. Using this grant, they will embark on a 2 billion page web crawl beginning in July; the largest web crawl they have ever attempted. They are currently seeking URL submissions for this historic crawl from libraries and archives as well as other cultural and memory institutions; and especially want international web content from a large variety of countries, geographic regions and language bases.