mayflowergmbh
/

Wiedervereinigung-7b-dpo

@@ -12,15 +12,44 @@ base_model:
 - DRXD1000/Phoenix
 - VAGOsolutions/SauerkrautLM-7b-v1-mistral
 - malteos/hermeo-7b
 ---
 # Wiedervereinigung-7b-dpo
 ![image/png](https://huggingface.co/mayflowergmbh/Wiedervereinigung-7b/resolve/main/Wiedervereinigung-7b.png)
-Some of the best german models with 7b parameters as a dare_ties merge.
 Since the original models based on mistral - three of them on the brilliant german LeoLM/leo-mistral-hessianai-7b - they are reunited in this merged model.
-Hence the name. To improve result quality they are dpo-trained with a german translation of intel-orca-dpo using our german fork of [LLaMA-Factory](https://github.com/mayflower/LLaMA-Factory).
 Wiedervereinigung-7b itself is a  [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing) merge of:
 * [DiscoResearch/DiscoLM_German_7b_v1](https://huggingface.co/DiscoResearch/DiscoLM_German_7b_v1)
@@ -59,27 +88,6 @@ parameters:
 dtype: bfloat16
 ```
-## mt-bench-de
-Is the merged model good? Well, of course. But it is even better with the help of some dpo tuning.
-```json
-{
-    "first_turn": 7.3,
-    "second_turn": 6.925,
-    "categories": {
-        "writing": 8.425,
-        "roleplay": 8.6,
-        "reasoning": 5.4,
-        "math": 4.35,
-        "coding": 4.3,
-        "extraction": 7.975,
-        "stem": 8.5,
-        "humanities": 9.35
-    },
-    "average": 7.1125
-}
-```
 ## 💻 Usage

 - DRXD1000/Phoenix
 - VAGOsolutions/SauerkrautLM-7b-v1-mistral
 - malteos/hermeo-7b
+license: apache-2.0
+language:
+- de
+- en
 ---
 # Wiedervereinigung-7b-dpo
 ![image/png](https://huggingface.co/mayflowergmbh/Wiedervereinigung-7b/resolve/main/Wiedervereinigung-7b.png)
+This is a dpo aligned merge of multiple german models scoring 7.1 on the mt-bench-de average.
+It is a merge of the best german 7B models with 7b parameters as a dare_ties merge.
 Since the original models based on mistral - three of them on the brilliant german LeoLM/leo-mistral-hessianai-7b - they are reunited in this merged model.
+Therefore the name, no nationalist ideas involved. To improve result quality they are dpo-trained with a german translation of intel-orca-dpo
+using our german fork of [LLaMA-Factory](https://github.com/mayflower/LLaMA-Factory-de).
+## mt-bench-de
+Is the merged model good? Well, of course. But it is even better with the help of some dpo tuning.
+```json
+{
+    "first_turn": 7.3,
+    "second_turn": 6.925,
+    "categories": {
+        "writing": 8.425,
+        "roleplay": 8.6,
+        "reasoning": 5.4,
+        "math": 4.35,
+        "coding": 4.3,
+        "extraction": 7.975,
+        "stem": 8.5,
+        "humanities": 9.35
+    },
+    "average": 7.1125
+}
+```
 Wiedervereinigung-7b itself is a  [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing) merge of:
 * [DiscoResearch/DiscoLM_German_7b_v1](https://huggingface.co/DiscoResearch/DiscoLM_German_7b_v1)
 dtype: bfloat16
 ```
 ## 💻 Usage