DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation Paper โข 2501.16764 โข Published 3 days ago โข 14
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper โข 2501.17161 โข Published 2 days ago โข 48
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 โข 3 items โข Updated 4 days ago โข 287
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper โข 2501.12948 โข Published 8 days ago โข 273
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers Paper โข 2408.06195 โข Published Aug 12, 2024 โข 70
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 16 days ago โข 129
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper โข 2501.07301 โข Published 18 days ago โข 89
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning Paper โข 2501.06458 โข Published 20 days ago โข 29
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper โข 2412.18925 โข Published Dec 25, 2024 โข 97