Ray2333 commited on
Commit
50aa8e7
·
verified ·
1 Parent(s): 123a793

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -18,7 +18,7 @@ This reward model is finetuned from [gemma-2b-it](https://huggingface.co/google/
18
  ## Evaluation
19
  We evaluate GRM 2B on the [reward model benchmark](https://huggingface.co/spaces/allenai/reward-bench), which achieves the **SOTA 2B Bradley–Terry model** Performance.
20
 
21
-
22
 
23
  | Model | Average | Chat | Chat Hard | Safety | Reasoning |
24
  |:-------------------------:|:-------------:|:---------:|:---------:|:--------:|:-----------:|
 
18
  ## Evaluation
19
  We evaluate GRM 2B on the [reward model benchmark](https://huggingface.co/spaces/allenai/reward-bench), which achieves the **SOTA 2B Bradley–Terry model** Performance.
20
 
21
+ **Note: Please download the `model.py` file from this repository to ensure the structure is loaded correctly and verify that the `v_head` is properly initialized.**
22
 
23
  | Model | Average | Chat | Chat Hard | Safety | Reasoning |
24
  |:-------------------------:|:-------------:|:---------:|:---------:|:--------:|:-----------:|