view article Article Announcing the Synthetic Online Conversations Dataset (SOC) By marcodsn • 21 days ago • 11
MolmoAct Data Mixture Collection All datasets for the MolmoAct (Multimodal Open Language Model for Action) release. • 4 items • Updated 7 days ago • 12
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! By dvilasuero and 5 others • 25 days ago • 76
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 28 days ago • 481
view article Article What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models By yjernite and 5 others • 29 days ago • 27
view article Article Introducing Command A Vision: Multimodal AI built for Business By CohereLabs and 3 others • Jul 31 • 63
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • Jul 29 • 164
Dayhoff Atlas Collection The models and datasets that comprise the Dayhoff Atlas • 10 items • Updated Jul 28 • 8
view article Article Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨ By Wauplin and 2 others • Jul 25 • 80
view article Article TimeScope: How Long Can Your Video Large Multimodal Model Go? By orrzohar and 3 others • Jul 23 • 39
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • Jul 9 • 663
SmolLM3 evaluation datasets Collection Datasets to decontaminate the post-training mixtures against. Use the subset and column values described per entry • 13 items • Updated Jul 8 • 5
SmolLM3 pretraining datasets Collection datasets used in SmolLM3 pretraining • 15 items • Updated 20 days ago • 28