Yanggang Wang

esheep
·

AI & ML interests

None yet

Recent Activity

Organizations

None yet

esheep's activity

upvoted an article 2 days ago
commented on Open R1: Update #2 2 days ago
view reply

How exactly is the Qwen/Qwen2.5-Math-RM-72B model used? Is it solely for ranking multiple answers? Can it also serve as a tool to validate whether the answers are correct?