Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
mkurmanΒ 
posted an update 10 days ago
Post
2000
I've been working on something cool: a GRPO with an LLM evaluator that can also perform SFT on the feedback data - if you want. Check it out 😊

Any 🌟are more than welcome πŸ€—

https://github.com/mkurman/grpo-llm-evaluator
In this post