Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
pietrolesci 's Collections
UnimixLM
Interesting Pre-Training Datasets
The Pile Companion
Generalisation-Profiles
Machine Translation Datasets
Text Classification Datasets
Dialogue State Tracking Datasets
NLI Eval Datasets
AnchorAL
Memorisation-Profiles
Tokenisation-Bias

Text Classification Datasets

updated Nov 12, 2024

A curated collection of common datasets for text classification

Upvote
1

  • pietrolesci/amazoncat-13k

    Viewer • Updated Apr 9, 2025 • 5.99M • 317 • 1

  • pietrolesci/civilcomments-wilds

    Viewer • Updated Jul 2, 2024 • 893k • 34 • 2

  • pietrolesci/dbpedia_14_indexed

    Viewer • Updated May 11, 2023 • 630k • 18

  • pietrolesci/DBPedia_Classes_indexed

    Viewer • Updated May 11, 2023 • 338k • 27

  • pietrolesci/pubmed-20k-rct

    Viewer • Updated May 12, 2023 • 236k • 178

  • pietrolesci/eurlex-57k

    Viewer • Updated Sep 11, 2023 • 235k • 52

  • pietrolesci/pubmed-200k-rct

    Viewer • Updated Sep 11, 2023 • 9.08M • 116

  • pietrolesci/imdb

    Viewer • Updated Sep 11, 2023 • 200k • 71 • 2

  • pietrolesci/agnews

    Viewer • Updated Apr 9, 2025 • 510k • 104

  • pietrolesci/wikitoxic

    Viewer • Updated Apr 9, 2025 • 894k • 109 • 1

  • pietrolesci/hyperpartisan_news_detection

    Viewer • Updated Sep 25, 2023 • 1.5M • 113 • 2

  • pietrolesci/yahoo_answers_topics

    Viewer • Updated Sep 25, 2023 • 2.92M • 45
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs