CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Paper • 2502.07316 • Published about 1 month ago • 47
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 346
Test-time Computing: from System-1 Thinking to System-2 Thinking Paper • 2501.02497 • Published Jan 5 • 42
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 193
Look Once to Hear: Target Speech Hearing with Noisy Examples Paper • 2405.06289 • Published May 10, 2024 • 3
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning Paper • 2405.12130 • Published May 20, 2024 • 50