Zheng
Libertaz
AI & ML interests
None yet
Recent Activity
upvoted an article 12 days ago
A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond new activity
over 1 year ago
llava-hf/llava-1.5-7b-hf:image processing is different from the github version liked
a dataset over 1 year ago
PKU-Alignment/PKU-SafeRLHF-30K Organizations
None yet