Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,19 @@
|
|
1 |
---
|
|
|
2 |
license: apache-2.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
base_model: mlabonne/NeuralMarcoro14-7B
|
3 |
license: apache-2.0
|
4 |
+
tags:
|
5 |
+
- mlabonne/NeuralMarcoro14-7B
|
6 |
+
- dpo
|
7 |
+
- 7B
|
8 |
+
- winograd
|
9 |
+
- mmlu_abstract_algebra
|
10 |
+
- mistral
|
11 |
+
datasets:
|
12 |
+
- hromi/winograd_dpo_basic
|
13 |
---
|
14 |
+
|
15 |
+
![](https://wizzion.com/sojka.jpg)
|
16 |
+
|
17 |
+
# UDKai_Garrulus
|
18 |
+
|
19 |
+
A less contaminated version of [udkai/Garrulus](https://huggingface.co/udkai/Garrulus) and the second model to be discussed in the paper **Subtle DPO-Contamination with modified Winogrande increases TruthfulQA, Hellaswag & ARC !**
|