anbinx's picture

15 28

anbinx

anbinx

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 23 days ago

Relational Visual Similarity

upvoted an article about 2 months ago

🌳 QAT: The Art of Growing a Bonsai Model

upvoted a paper about 2 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

View all activity

Organizations

None yet

upvoted a paper 23 days ago

Relational Visual Similarity

Paper • 2512.07833 • Published 23 days ago • 24

upvoted an article about 2 months ago

Article

🌳 QAT: The Art of Growing a Bonsai Model

Nov 9

•

15

upvoted a paper about 2 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9 • 131

liked a Space about 2 months ago

The Smol Training Playbook

The secrets to building world-class LLMs

updated a collection 4 months ago

大模型idea

18 items • Updated Sep 15 • 1

liked a dataset 4 months ago

HuggingFaceM4/FineVision

Viewer • Updated Oct 21 • 24.2M • 106k • 463

upvoted a paper 4 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 90

updated a collection 4 months ago

大模型idea

18 items • Updated Sep 15 • 1

upvoted a paper 6 months ago

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 36

updated a collection 6 months ago

大模型idea

18 items • Updated Sep 15 • 1

upvoted a paper 6 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1 • 79

updated a collection 6 months ago

大模型idea

18 items • Updated Sep 15 • 1

liked a model 7 months ago

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

Text Generation • 8B • Updated May 29 • 473k • • 1.01k

upvoted a paper 8 months ago

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published May 15 • 34

updated a collection 8 months ago

大模型idea

18 items • Updated Sep 15 • 1

upvoted a paper 8 months ago

An Empirical Study of Qwen3 Quantization

Paper • 2505.02214 • Published May 4 • 25

updated a collection 8 months ago

大模型idea

18 items • Updated Sep 15 • 1

upvoted an article 8 months ago

Article

I trained a Language Model to schedule events with GRPO!

Apr 29

•

91