Bridging the Data Provenance Gap Across Text, Speech and Video Paper ⢠2412.17847 ⢠Published Dec 19, 2024 ⢠9
Consent in Crisis: The Rapid Decline of the AI Data Commons Paper ⢠2407.14933 ⢠Published Jul 20, 2024 ⢠12
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper ⢠2406.15877 ⢠Published Jun 22, 2024 ⢠46
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order Paper ⢠2404.00399 ⢠Published Mar 30, 2024 ⢠42
MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data Paper ⢠2403.11207 ⢠Published Mar 17, 2024 ⢠15
Can Language Models Employ the Socratic Method? Experiments with Code Debugging Paper ⢠2310.03210 ⢠Published Oct 4, 2023
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning Paper ⢠2402.06619 ⢠Published Feb 9, 2024 ⢠55
CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation Paper ⢠2401.12208 ⢠Published Jan 22, 2024 ⢠22
BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing Paper ⢠2206.15076 ⢠Published Jun 30, 2022 ⢠4
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper ⢠2211.05100 ⢠Published Nov 9, 2022 ⢠29
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset Paper ⢠2303.03915 ⢠Published Mar 7, 2023 ⢠7
Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors Paper ⢠2305.18274 ⢠Published May 29, 2023 ⢠4
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper ⢠2211.05100 ⢠Published Nov 9, 2022 ⢠29
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper ⢠2211.05100 ⢠Published Nov 9, 2022 ⢠29
hugginglearners/amazon-reviews-sentiment-analysis Viewer ⢠Updated Aug 18, 2022 ⢠4.92k ⢠304 ⢠1