Ziniu Li's picture

3 3 24

Ziniu Li

znli

·

[email protected]

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

commented on a paper about 1 month ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

liked a dataset about 2 months ago

allenai/olmo-mix-1124

View all activity

Organizations

None yet

znli's activity

commented a paper about 1 month ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 108 •

New activity in Qwen/Qwen2.5-Math-RM-72B 5 months ago

Quantized Version

#8 opened 5 months ago by

commented a paper 6 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 138 •