2 4 1

Yingqian Min

EliverQ

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

upvoted a paper 4 days ago

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

authored a paper 4 days ago

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

View all activity

Organizations

EliverQ's activity

upvoted 2 papers 4 days ago

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

Paper • 2503.04548 • Published 8 days ago • 8

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published 6 days ago • 24

authored a paper 4 days ago

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published 6 days ago • 24

commented a paper 4 days ago

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

Paper • 2503.04548 • Published 8 days ago • 8 •

authored a paper 4 days ago

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

Paper • 2503.04548 • Published 8 days ago • 8

updated a model about 2 months ago

RUC-AIBOX/STILL-3-1.5B-preview

Text Generation • Updated Jan 26 • 156 • 6

updated a dataset about 2 months ago

RUC-AIBOX/STILL-3-Preview-RL-Data

Viewer • Updated Jan 26 • 29.9k • 1.86k • 10

New activity in RUC-AIBOX/STILL-3-Preview-RL-Data about 2 months ago

Update README.md

#1 opened about 2 months ago by

ToheartZhang

published a dataset about 2 months ago

RUC-AIBOX/STILL-3-Preview-RL-Data

Viewer • Updated Jan 26 • 29.9k • 1.86k • 10

updated a model 2 months ago

RUC-AIBOX/Virgo-72B

Image-Text-to-Text • Updated Jan 10 • 42 • 6

updated a dataset 2 months ago

RUC-AIBOX/long_form_thought_data_5k

Viewer • Updated Dec 30, 2024 • 4.92k • 284 • 26

upvoted a paper 3 months ago

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published Dec 23, 2024 • 65

liked a dataset 3 months ago

RUC-AIBOX/long_form_thought_data_5k

Viewer • Updated Dec 30, 2024 • 4.92k • 284 • 26

upvoted a paper 3 months ago

Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems

Paper • 2412.09413 • Published Dec 12, 2024 • 1

authored a paper 3 months ago

A Survey of Large Language Models

Paper • 2303.18223 • Published Mar 31, 2023 • 13

updated 3 models 3 months ago