Taiwei Shi's picture

Taiwei Shi

MaksimSTW

·

https://taiweis.com

AI & ML interests

reinforcement learning, alignment, human-AI collaboration, and computational social science

Recent Activity

authored a paper 6 days ago

Video-Based Reward Modeling for Computer-Use Agents

upvoted a paper 17 days ago

Video-Based Reward Modeling for Computer-Use Agents

authored a paper 26 days ago

DP-RFT: Learning to Generate Synthetic Text via Differentially Private Reinforcement Fine-Tuning

View all activity

Organizations

MaksimSTW 's models

None public yet