Zheng's picture

Zheng

Libertaz

Libertaz Zheng

AI & ML interests

None yet

Recent Activity

upvoted an article 12 days ago

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

new activity over 1 year ago

llava-hf/llava-1.5-7b-hf:image processing is different from the github version

liked a dataset over 1 year ago

PKU-Alignment/PKU-SafeRLHF-30K

View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet