Mingyang Song's picture

3 7 18

Mingyang Song

Nickyang

·

nick7nlp

AI & ML interests

LRMs, Long-Context LLMs, LLM Judges, Many-Shot ICL

Recent Activity

upvoted a collection 2 days ago

liked a model 2 days ago

tencent/Hunyuan-MT-Chimera-7B-fp8

liked a model 2 days ago

tencent/Hunyuan-MT-7B-fp8

View all activity

Organizations

None yet

upvoted a collection 2 days ago

Hunyuan-MT

4 items • Updated 2 days ago • 22

liked 4 models 2 days ago

tencent/Hunyuan-MT-Chimera-7B-fp8

Translation • 8B • Updated about 14 hours ago • 61 • 12

tencent/Hunyuan-MT-7B-fp8

Translation • 8B • Updated about 14 hours ago • 132 • 18

tencent/Hunyuan-MT-Chimera-7B

Translation • 8B • Updated about 14 hours ago • 132 • 40

tencent/Hunyuan-MT-7B

Translation • 8B • Updated about 14 hours ago • 487 • 380

updated 2 models 3 months ago

Nickyang/ConciseR-Zero-7B

Text Generation • Updated Jun 6 • 22 • 1

Nickyang/ConciseR-Zero-7B-Preview

Text Generation • Updated Jun 6 • 8 • 1

upvoted a paper 3 months ago

Can Many-Shot In-Context Learning Help Long-Context LLM Judges? See More, Judge Better!

Paper • 2406.11629 • Published Jun 17, 2024 • 1

liked a model 3 months ago

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

Text Generation • 8B • Updated May 29 • 352k • • 950

updated 2 collections 3 months ago

FastCuRL

The collection for the Paper "Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient Training R1-like Reasoning Models" • 6 items • Updated May 29 • 2

ConciseR

The collection for the Paper "Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning" • 5 items • Updated Jun 4 • 1

updated a dataset 3 months ago

Nickyang/ConciseR-Data

Viewer • Updated May 28 • 68.2k • 20 • 1

liked 3 models 3 months ago

Qwen/Qwen2.5-Math-7B

Text Generation • 8B • Updated Sep 23, 2024 • 90.7k • • 98

Nickyang/ConciseR-Zero-7B-Preview

Text Generation • Updated Jun 6 • 8 • 1

Nickyang/ConciseR-Zero-7B

Text Generation • Updated Jun 6 • 22 • 1

updated a collection 3 months ago

ConciseR

The collection for the Paper "Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning" • 5 items • Updated Jun 4 • 1

authored 2 papers 3 months ago

SSR-Zero: Simple Self-Rewarding Reinforcement Learning for Machine Translation

Paper • 2505.16637 • Published May 22

Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning

Paper • 2505.21178 • Published May 27 • 6

liked a dataset 3 months ago

Nickyang/ConciseR-Data

Viewer • Updated May 28 • 68.2k • 20 • 1