@mkurman on Hugging Face: "I've been working on something cool: a GRPO with an LLM evaluator that can…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

mkurman

posted an update 10 days ago

Post

2000

I've been working on something cool: a GRPO with an LLM evaluator that can also perform SFT on the feedback data - if you want. Check it out 😊

Any 🌟are more than welcome 🤗

https://github.com/mkurman/grpo-llm-evaluator

In this post

mkurman Mariusz Kurman