Update README.md
Browse files
README.md
CHANGED
@@ -52,8 +52,8 @@ dataset for this model. This results in a DPO dataset composed by triplets < ”
|
|
52 |
|
53 |
| | OR Bench 80K (refusal) ↓ | OR Bench Hard (refusal) ↓ |
|
54 |
|------------------------------|:---------------------:|:---------------:|
|
55 |
-
| Qwen-2.5-
|
56 |
-
| Qwen-2.5-
|
57 |
|
58 |
Note that this refusal ratio is computed as keyword matching with a curated list of kewords. For more information, check the paper.
|
59 |
|
|
|
52 |
|
53 |
| | OR Bench 80K (refusal) ↓ | OR Bench Hard (refusal) ↓ |
|
54 |
|------------------------------|:---------------------:|:---------------:|
|
55 |
+
| Qwen-2.5-72B-Instruct | 0.015 | 0.102 |
|
56 |
+
| Qwen-2.5-72B-Instruct-Egida-DPO | 0.016 | 0.170 |
|
57 |
|
58 |
Note that this refusal ratio is computed as keyword matching with a curated list of kewords. For more information, check the paper.
|
59 |
|