deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • Updated 3 days ago • 675k • • 771
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 16 days ago • 338
FreedomIntelligence/medical_o1_verifier_3B Text Classification • Updated Dec 30, 2024 • 1.77k • 11
Running 505 505 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making Paper • 2409.16686 • Published Sep 25, 2024 • 10
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models Paper • 2410.14059 • Published Oct 17, 2024 • 58