2 10 15

ymh233

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

upvoted a paper 12 days ago

MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

upvoted a paper about 2 months ago

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

View all activity

Organizations

upvoted a paper 7 days ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published 14 days ago • 116

upvoted a paper 12 days ago

MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

Paper • 2508.13186 • Published 19 days ago • 17

upvoted a paper about 2 months ago

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published Jul 16 • 41

upvoted 2 papers 3 months ago

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Paper • 2505.16175 • Published May 22 • 42

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21 • 53

upvoted a paper 4 months ago

AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection

Paper • 2505.07293 • Published May 12 • 27

upvoted 2 papers 5 months ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 45

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published Mar 24 • 32

upvoted a paper 6 months ago

Process-based Self-Rewarding Language Models

Paper • 2503.03746 • Published Mar 5 • 40

upvoted a paper 7 months ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 49

liked a dataset 11 months ago

Jianwen2003/DA-Code

Viewer • Updated Oct 8, 2024 • 500 • 91 • 4

liked 6 datasets over 1 year ago

liked a model over 1 year ago

miqudev/miqu-1-70b

69B • Updated Feb 4, 2024 • 301 • 987

New activity in Vivacem/MMIQC over 1 year ago

The number of data sets is inconsistent with the paper

#2 opened over 1 year ago by

ymh233

liked a dataset over 1 year ago

camel-ai/math

Viewer • Updated Jun 22, 2023 • 50k • 425 • 110

ymh233

AI & ML interests

Recent Activity

Organizations

ymh233's activity

The number of data sets is inconsistent with the paper