Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
sergiopaniegoΒ 
posted an update Jul 29
Post
2661
We just released TRL v0.20 with major multimodal upgrades!

πŸ‘οΈ VLM support for GRPO (highly requested by the community!)
🎞️ New GSPO trainer (from @Qwen , released last week, VLM-ready)
πŸ™ New MPO trainer (multimodal by design, as in the paper)

πŸ“ Full release notes here: https://github.com/huggingface/trl/releases/tag/v0.20.0
In this post