12 21 31

Yizhi Li

yizhilll

https://yizhilll.github.io

AI & ML interests

None yet

Recent Activity

updated a dataset 7 days ago

O1-OPEN/OpenO1-SFT-Ultra

updated a dataset 13 days ago

m-a-p/OpenO1_SFT_ultra_BoN_positvie_reward_v3_N-sample

published a dataset 14 days ago

m-a-p/OpenO1_SFT_ultra_BoN_positvie_reward_v3_N-sample

View all activity

Organizations

yizhilll's activity

updated a dataset 7 days ago

O1-OPEN/OpenO1-SFT-Ultra

Viewer • Updated 7 days ago • 18.7M • 478 • 53

updated a dataset 13 days ago

m-a-p/OpenO1_SFT_ultra_BoN_positvie_reward_v3_N-sample

Viewer • Updated 13 days ago • 43M • 37

published a dataset 14 days ago

m-a-p/OpenO1_SFT_ultra_BoN_positvie_reward_v3_N-sample

Viewer • Updated 13 days ago • 43M • 37

upvoted a paper 14 days ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published 15 days ago • 26

upvoted 2 papers 17 days ago

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

Paper • 2502.16614 • Published 19 days ago • 24

Audio-FLAN: A Preliminary Release

Paper • 2502.16584 • Published 19 days ago • 34

authored a paper 21 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 21 days ago • 97

commented a paper 21 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 21 days ago • 97 •

upvoted a paper 21 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 21 days ago • 97

liked a model about 1 month ago

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • Updated 27 days ago • 994k • 262

liked a dataset about 1 month ago

AI-MO/NuminaMath-1.5

Viewer • Updated Feb 10 • 896k • 4.4k • 119

updated 2 datasets about 1 month ago

m-a-p/OmniInstruct_v1

Viewer • Updated Jan 31 • 96.1k • 326 • 1

m-a-p/OmniBench

Viewer • Updated Jan 31 • 1.14k • 121 • 4

updated 3 datasets about 2 months ago