-
Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs
Paper • 2503.16870 • Published • 5 -
Gemma 3 Technical Report
Paper • 2503.19786 • Published • 39 -
Qwen2.5-Omni Technical Report
Paper • 2503.20215 • Published • 112 -
Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking
Paper • 2503.19855 • Published • 24
Souvik Mandal
Souvik3333
·
AI & ML interests
None yet
Recent Activity
updated
a collection
2 days ago
Todo
updated
a collection
2 days ago
Todo
updated
a collection
2 days ago
Todo
Organizations
Collections
1
datasets
None public yet