anbinx

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

Relational Visual Similarity

upvoted an article about 2 months ago

🌳 QAT: The Art of Growing a Bonsai Model

upvoted a paper about 2 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

View all activity

Organizations

None yet

upvoted a paper 22 days ago

Relational Visual Similarity

Paper • 2512.07833 • Published 22 days ago • 24

upvoted an article about 2 months ago

Article

🌳 QAT: The Art of Growing a Bonsai Model

Nov 9

•

upvoted a paper about 2 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9 • 131

upvoted a paper 4 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 90

upvoted 2 papers 6 months ago

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 36

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1 • 78

upvoted 2 papers 8 months ago

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published May 15 • 34

An Empirical Study of Qwen3 Quantization

Paper • 2505.02214 • Published May 4 • 25

upvoted an article 8 months ago

Article

I trained a Language Model to schedule events with GRPO!

Apr 29

•

upvoted 2 papers 10 months ago

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published Feb 25 • 50

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

upvoted 3 papers about 1 year ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 92

Self-Consistency Preference Optimization

Paper • 2411.04109 • Published Nov 6, 2024 • 19

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 92

upvoted a paper over 1 year ago

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21, 2024 • 29

anbinx

AI & ML interests

Recent Activity

Organizations

anbinx's activity

🌳 QAT: The Art of Growing a Bonsai Model

I trained a Language Model to schedule events with GRPO!