BuiDoan
's Collections
Great paper
updated
Paper
•
2410.05258
•
Published
•
180
PaliGemma 2: A Family of Versatile VLMs for Transfer
Paper
•
2412.03555
•
Published
•
134
VisionZip: Longer is Better but Not Necessary in Vision Language Models
Paper
•
2412.04467
•
Published
•
119
o1-Coder: an o1 Replication for Coding
Paper
•
2412.00154
•
Published
•
45
SNOOPI: Supercharged One-step Diffusion Distillation with Proper
Guidance
Paper
•
2412.02687
•
Published
•
114
TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any
Point in Long Video
Paper
•
2411.18671
•
Published
•
20
Fully Open Source Moxin-7B Technical Report
Paper
•
2412.06845
•
Published
•
11
Small Language Models: Survey, Measurements, and Insights
Paper
•
2409.15790
•
Published
•
1
Paper
•
2407.10671
•
Published
•
167
Paper
•
2412.08905
•
Published
•
121
Apollo: An Exploration of Video Understanding in Large Multimodal Models
Paper
•
2412.10360
•
Published
•
147
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper
•
2412.09871
•
Published
•
109
Paper
•
2412.15115
•
Published
•
374
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper
•
2501.05366
•
Published
•
102
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep
Thinking
Paper
•
2501.04519
•
Published
•
284
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper
•
2501.08313
•
Published
•
298
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with
Large Language Models
Paper
•
2501.09686
•
Published
•
41
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Paper
•
2501.12948
•
Published
•
418
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model
Post-training
Paper
•
2501.17161
•
Published
•
123
Baichuan-Omni-1.5 Technical Report
Paper
•
2501.15368
•
Published
•
64
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human
Animation Models
Paper
•
2502.01061
•
Published
•
222
The Differences Between Direct Alignment Algorithms are a Blur
Paper
•
2502.01237
•
Published
•
115
Hermes 3 Technical Report
Paper
•
2408.11857
•
Published
•
56
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence
Generation up to 100K Tokens
Paper
•
2502.18890
•
Published
•
30
SemViQA: A Semantic Question Answering System for Vietnamese Information
Fact-Checking
Paper
•
2503.00955
•
Published
•
28
InternVL3: Exploring Advanced Training and Test-Time Recipes for
Open-Source Multimodal Models
Paper
•
2504.10479
•
Published
•
281
Tina: Tiny Reasoning Models via LoRA
Paper
•
2504.15777
•
Published
•
55
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper
•
2505.03335
•
Published
•
184