Spaces:
Running
Running
metadata
title: README
emoji: π
colorFrom: indigo
colorTo: purple
sdk: static
pinned: true
short_description: Explore Common Crawl's metadata and experimental datasets
Common Crawl
Welcome to the Common Crawl Foundation's Hugging Face page!
We aim to provide metadata and experimental versions of our latest data products here.
Useful Links
- Common Crawl's official website
- Our existing statistics webpages (GitHub repo)
- AWS infrastructure status page
Datasets
Explore our datasets hosted on Hugging Face:
- Common Crawl Citations
- Common Crawl Citations, Annotated
- Common Crawl Statistics
- EOT 2024 Host-Level Logs (only available to EOT collaborators)
We look forward to supporting the research and development community with these resources.