AI & ML interests

Reinforcement Learning, Large Language Models, Value Alignment

Recent Activity

XuyaoWang  updated a dataset 4 days ago
PKU-Alignment/s1-m_beta
XuyaoWang  published a dataset 6 days ago
PKU-Alignment/s1-m_beta
XuyaoWang  updated a model 6 days ago
PKU-Alignment/s1-m_7b_beta
View all activity