Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study Paper • 2502.02481 • Published 18 days ago • 3
GemmaX2 Collection GemmaX2 language models, including pretrained and instruction-tuned models of 2 sizes, including 2B, 9B. • 7 items • Updated 15 days ago • 8
Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities? Paper • 2502.12215 • Published 5 days ago • 13
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Paper • 2502.10248 • Published 8 days ago • 49
view article Article Open Preference Dataset for Text-to-Image Generation by the 🤗 Community Dec 9, 2024 • 54
Soundwave: Less is More for Speech-Text Alignment in LLMs Paper • 2502.12900 • Published 4 days ago • 73
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥 5 days ago • 86
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 4 days ago • 50
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published 12 days ago • 133
One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs Paper • 2502.10454 • Published 10 days ago • 6
HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation Paper • 2502.12148 • Published 5 days ago • 16
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents Paper • 2502.09560 • Published 9 days ago • 32
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • 11 days ago • 6
Tools for learning AI Collection This is a collection of tools on the hub that teachers and students can use to learn AI! • 9 items • Updated 5 days ago • 61
SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models Paper • 2502.09604 • Published 9 days ago • 30
DPO-Shift: Shifting the Distribution of Direct Preference Optimization Paper • 2502.07599 • Published 11 days ago • 14