Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models
wangchenglong
wangclnlp
·
AI & ML interests
None yet
Recent Activity
updated
a collection
2 days ago
Probing-RM
upvoted
a
paper
2 months ago
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
upvoted
a
paper
2 months ago
GRAM-R^2: Self-Training Generative Foundation Reward Models for Reward
Reasoning