Vadim Kurochkin

Vadim21221

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

upvoted a paper about 2 months ago

Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story

upvoted a paper about 2 months ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

View all activity

Organizations

None yet

upvoted a paper 19 days ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published Dec 10, 2025 • 78

upvoted 2 papers about 2 months ago

Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story

Paper • 2511.15210 • Published Nov 19, 2025 • 89

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Paper • 2511.13254 • Published Nov 17, 2025 • 136

upvoted a paper 5 months ago

Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success

Paper • 2508.04280 • Published Aug 6, 2025 • 35

upvoted an article 7 months ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

Jun 3, 2025

•

upvoted a paper 7 months ago

STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis

Paper • 2506.06276 • Published Jun 6, 2025 • 26

upvoted a collection 7 months ago

Qwen3

Collection

84 items • Updated 10 days ago • 1.55k

upvoted a paper 7 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 321

upvoted a paper 8 months ago

Train Sparse Autoencoders Efficiently by Utilizing Features Correlation

Paper • 2505.22255 • Published May 28, 2025 • 24

upvoted 2 papers 11 months ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published Feb 5, 2025 • 60

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3, 2025 • 113

upvoted a paper over 1 year ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 138

Vadim Kurochkin

AI & ML interests

Recent Activity

Organizations

Vadim21221's activity

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL