Crawl Release
December 2024 Crawl Archive Now Available
The crawl archive for December 2024 is now available. The data was crawled between December 1st and December 15th, and contains 2.64 billion web pages (or 394 TiB of uncompressed content). Page captures are from 47.5 million hosts or 38.3 million registered domains and include 1.05 billion new URLs, not visited in any of our prior crawls.
Sebastian Nagel
Sebastian is a Distinguished Engineer with Common Crawl.