SAMBIT CHAKRABORTY

sambitchakhf03

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

upvoted a paper 10 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

upvoted a paper 14 days ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

View all activity

Organizations

sambitchakhf03's activity

upvoted 2 papers 10 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 11 days ago • 140

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 17 days ago • 115

upvoted a paper 14 days ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published 19 days ago • 51

upvoted a paper 15 days ago

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Paper • 2502.03860 • Published 18 days ago • 23

upvoted a paper 17 days ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published 19 days ago • 13

upvoted a paper 23 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 25 days ago • 55

upvoted 5 papers about 1 month ago

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Paper • 2501.10799 • Published Jan 18 • 15

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 36

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 53

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published Jan 10 • 61

upvoted an article about 2 months ago

Article

Accelerating Language Model Inference with Mixture of Attentions

and 1 other •

Jan 7

• 24

upvoted 3 papers about 2 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 90

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published Jan 6 • 40

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published Dec 24, 2024 • 46

upvoted 2 papers 2 months ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 107

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 78

upvoted a paper 5 months ago

EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

Paper • 2409.10819 • Published Sep 17, 2024 • 19

upvoted a collection 6 months ago

Seamless Communication

Collection

A significant step towards removing language barriers through expressive, fast and high-quality AI translation. • 16 items • Updated Jan 16, 2024 • 153

upvoted a paper 6 months ago

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Paper • 2408.11039 • Published Aug 20, 2024 • 59