7 11 7

GSY

XiaoY1

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Scaling Laws for Code: Every Programming Language Matters

upvoted a paper 7 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

upvoted a paper 9 months ago

A Comprehensive Survey on Long Context Language Modeling

View all activity

Organizations

upvoted a paper 2 days ago

Scaling Laws for Code: Every Programming Language Matters

Paper • 2512.13472 • Published 11 days ago • 8

upvoted a paper 7 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 187

upvoted a paper 9 months ago

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published Mar 20 • 49

liked a model 10 months ago

Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • 8B • Updated Jan 12 • 556k • • 579

upvoted 3 papers 10 months ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 71

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6 • 21

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

Paper • 2502.15499 • Published Feb 21 • 15

liked a dataset 10 months ago

m-a-p/SuperGPQA

Viewer • Updated Apr 30 • 26.5k • 20.9k • 77

upvoted a paper 10 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 106

liked a dataset 11 months ago

CSJianYang/CodeArena

Viewer • Updated Dec 18, 2024 • 397 • 1.64k • 15

upvoted a paper about 1 year ago

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published Dec 6, 2024 • 50

updated 6 models over 1 year ago

upvoted a paper over 1 year ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 72

updated 2 models over 1 year ago

XiaoY1/Qwen2-7B-Instruct-DPO-code-beta0.5

Updated Sep 9, 2024 • 15

XiaoY1/Qwen2-7B-Instruct-DPO-math-beta0.5

Updated Sep 9, 2024 • 14

GSY

AI & ML interests

Recent Activity

Organizations

XiaoY1's activity