Open Datasets
updated
Updated
•
307
•
86
fka/awesome-chatgpt-prompts
Viewer
•
Updated
•
1.03k
•
17.3k
•
9.55k
Viewer
•
Updated
•
470M
•
39.8k
•
333
Viewer
•
Updated
•
2.2M
•
5.27k
•
386
Matthijs/cmu-arctic-xvectors
Viewer
•
Updated
•
7.93k
•
17.7k
•
62
parler-tts/libritts-r-filtered-speaker-descriptions
Viewer
•
Updated
•
359k
•
152
•
7
Viewer
•
Updated
•
860k
•
11k
•
528
alpindale/two-million-bluesky-posts
Viewer
•
Updated
•
2.11M
•
466
•
200
arimalabs/2.3-million-bluesky-posts
Viewer
•
Updated
•
2.37M
•
29
•
5
Viewer
•
Updated
•
70k
•
65.1k
•
223
Viewer
•
Updated
•
1.34M
•
3.02k
•
30
Viewer
•
Updated
•
1.12M
•
1.49k
•
4
parler-tts/libritts_r_filtered
Viewer
•
Updated
•
359k
•
2.05k
•
21
opendiffusionai/cc12m-cleaned
Viewer
•
Updated
•
8.53M
•
434
•
10
Viewer
•
Updated
•
31.4k
•
358
•
22
Preview
•
Updated
•
354
•
7
Viewer
•
Updated
•
61.6M
•
78.2k
•
1.12k
parler-tts/mls-eng-speaker-descriptions
Viewer
•
Updated
•
10.8M
•
114
•
10
Viewer
•
Updated
•
111M
•
1.09k
•
98
Updated
•
41
•
2
Viewer
•
Updated
•
602k
•
7.8k
•
144
Viewer
•
Updated
•
4.48B
•
90.4k
•
716
Viewer
•
Updated
•
1.55k
•
17
•
4
Updated
•
6.92k
•
138
Viewer
•
Updated
•
59.1k
•
1k
•
12
keremberke/license-plate-object-detection
Viewer
•
Updated
•
8.83k
•
758
•
33
Updated
•
34
•
8
Viewer
•
Updated
•
98.6k
•
2.69k
•
100
nebius/SWE-agent-trajectories
Viewer
•
Updated
•
80k
•
466
•
67
Viewer
•
Updated
•
3.4k
•
2.77k
•
56
cfahlgren1/react-code-instructions
Viewer
•
Updated
•
74.4k
•
226
•
155
DAMO-NLP-SG/multimodal_textbook
Updated
•
738
•
156
NovaSky-AI/Sky-T1_data_17k
Viewer
•
Updated
•
16.4k
•
201
•
187
Viewer
•
Updated
•
5.45B
•
5.67k
•
441
Viewer
•
Updated
•
546M
•
15.9k
•
907
hoskinson-center/proof-pile
Viewer
•
Updated
•
363k
•
1.52k
•
63
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
•
3.5B
•
307k
•
903
EleutherAI/the_pile_deduplicated
Viewer
•
Updated
•
134M
•
8.15k
•
107
MohamedRashad/multilingual-tts
Viewer
•
Updated
•
25.5k
•
103
•
45
Viewer
•
Updated
•
16.4k
•
31
•
4
facebook/multilingual_librispeech
Viewer
•
Updated
•
1.49M
•
10.6k
•
167
Viewer
•
Updated
•
1.25M
•
12.3k
•
85
Viewer
•
Updated
•
2.77M
•
4.57k
•
113
Fumika/Wikinews-multilingual
Viewer
•
Updated
•
15.2k
•
60
•
7
ayymen/Weblate-Translations
Viewer
•
Updated
•
11.7M
•
1.63k
•
16
Updated
•
127k
•
153
Helsinki-NLP/opus_wikipedia
Viewer
•
Updated
•
1.75M
•
109
•
10
Viewer
•
Updated
•
3.59M
•
30
•
1
MLCommons/unsupervised_peoples_speech
Updated
•
23.6k
•
69
HKUSTAudio/Llasa_opensource_speech_data_160k_hours_tokenized
Updated
•
68
•
30
Viewer
•
Updated
•
10k
•
3.31k
•
527
Viewer
•
Updated
•
68.1k
•
104k
•
21
allenai/RLVR-GSM-MATH-IF-Mixed-Constraints
Viewer
•
Updated
•
29.9k
•
1.32k
•
30
allenai/olmo-2-0325-32b-preference-mix
Updated
•
107
•
15
allenai/tulu-3-sft-olmo-2-mixture-0225
Viewer
•
Updated
•
866k
•
999
•
22
Viewer
•
Updated
•
170M
•
39.4k
•
90
Viewer
•
Updated
•
621M
•
22.5k
•
86
Viewer
•
Updated
•
932
•
14.7k
•
589
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
Viewer
•
Updated
•
110k
•
394
•
720
Viewer
•
Updated
•
102k
•
249
•
47
Viewer
•
Updated
•
450k
•
13.1k
•
694
Viewer
•
Updated
•
167M
•
2.29k
•
61