mayflowergmbh
/

Wiedervereinigung-7b-dpo

Text Generation

DiscoResearch/DiscoLM_German_7b_v1

DRXD1000/Phoenix

VAGOsolutions/SauerkrautLM-7b-v1-mistral

malteos/hermeo-7b

text-generation-inference

Model card Files Files and versions Community

johannhartmann commited on Feb 9, 2024

Commit

90c6620

·

verified ·

1 Parent(s): 3179d3c

Update README.md

Files changed (1) hide show

README.md +9 -7

README.md CHANGED Viewed

@@ -22,15 +22,18 @@ language:
 ![image/png](https://huggingface.co/mayflowergmbh/Wiedervereinigung-7b/resolve/main/Wiedervereinigung-7b.png)
-This is a dpo aligned merge of multiple german models scoring 7.1 on the mt-bench-de average.
-It is a merge of the best german 7B models with 7b parameters as a dare_ties merge.
 Since the original models based on mistral - three of them on the brilliant german LeoLM/leo-mistral-hessianai-7b - they are reunited in this merged model.
-Therefore the name, no nationalist ideas involved. To improve result quality they are dpo-trained with a german translation of intel-orca-dpo
-using our german fork of [LLaMA-Factory](https://github.com/mayflower/LLaMA-Factory-de).
-## mt-bench-de
-Is the merged model good? Well, of course. But it is even better with the help of some dpo tuning.
 ```json
 {
@@ -57,7 +60,6 @@ Wiedervereinigung-7b itself is a  [LazyMergekit](https://colab.research.google.c
 * [VAGOsolutions/SauerkrautLM-7b-v1-mistral](https://huggingface.co/VAGOsolutions/SauerkrautLM-7b-v1-mistral)
 * [malteos/hermeo-7b](https://huggingface.co/malteos/hermeo-7b)
-All the actual heavylifting has been done by the creators of these models.
 ## 🧩 Configuration

 ![image/png](https://huggingface.co/mayflowergmbh/Wiedervereinigung-7b/resolve/main/Wiedervereinigung-7b.png)
+This is a dpo aligned merge of our favourite german models, scoring 7.11 on the mt-bench-de average.
 Since the original models based on mistral - three of them on the brilliant german LeoLM/leo-mistral-hessianai-7b - they are reunited in this merged model.
+Therefore the name, no nationalist ideas involved :-). To improve result quality they are dpo-trained with a german translation of intel-orca-dpo we added support for 10 german datasets to [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory).
+Kudos to the authors of the original models at [DiscoResearch](https://huggingface.co/DiscoResearch) and [VAGOsolutions](https://huggingface.co/VAGOsolutions), [Malte Ostendorff](https://huggingface.co/malteos)
+and [Matthias Uhlig](https://huggingface.co/DRXD1000). We are your fan club.
+This model was brought to you and the nvidia bill was paid by [Mayflower GmbH](https://mayflower.de/).
+## Benchmark results: mt-bench-de
+Is the merged model alone already good? Well, of course. But it is even better with the help of some dpo tuning.
 ```json
 {
 * [VAGOsolutions/SauerkrautLM-7b-v1-mistral](https://huggingface.co/VAGOsolutions/SauerkrautLM-7b-v1-mistral)
 * [malteos/hermeo-7b](https://huggingface.co/malteos/hermeo-7b)
 ## 🧩 Configuration