SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 10 days ago • 100
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper • 2501.18511 • Published 8 days ago • 17
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 8 days ago • 88
Language Detection Collection StaticVectors models to detect language. Exports of FastText that run in NumPy without needing FastText • 2 items • Updated 12 days ago • 3
CodeXEmbed: A Generalist Embedding Model Family for Multiligual and Multi-task Code Retrieval Paper • 2411.12644 • Published Nov 19, 2024 • 3
view article Article Python Is All You Need? Introducing Dria-Agent-α By andthattoo and 1 other • 28 days ago • 22
KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model Paper • 2501.01028 • Published Jan 2 • 13
view article Article Synthetic Data Generation with FastData and Hugging Face By asoria • about 1 month ago • 14
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 • Jan 3 • 32
Dolphin 3.0 Collection Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 9 items • Updated about 8 hours ago • 65