cstr
/

llama3-discolm-orca

Text Generation

Locutusque/llama-3-neural-chat-v1-8b

DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

cstr commited on Apr 22, 2024

Commit

34325e8

·

verified ·

1 Parent(s): 8e3d75e

Upload folder using huggingface_hub

Files changed (1) hide show

README.md +15 -18

README.md CHANGED Viewed

@@ -3,38 +3,35 @@ tags:
 - merge
 - mergekit
 - lazymergekit
-- mlabonne/OrpoLlama-3-8B
 - DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental
 base_model:
-- mlabonne/OrpoLlama-3-8B
 - DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental
 ---
 # llama3-discolm-orpo-t2
 llama3-discolm-orpo-t2 is a merge of the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):
-* [mlabonne/OrpoLlama-3-8B](https://huggingface.co/mlabonne/OrpoLlama-3-8B)
 * [DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental](https://huggingface.co/DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental)
 ## 🧩 Configuration
 ```yaml
-slices:
-  - sources:
-      - model: mlabonne/OrpoLlama-3-8B
-        layer_range: [0, 32]
-      - model: DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental
-        layer_range: [0, 32]
-merge_method: slerp
-base_model: mlabonne/OrpoLlama-3-8B
-parameters:
-  t:
-    - filter: self_attn
-      value: [1, 0.7, 0.5, 0.3, 0.1]
-    - filter: mlp
-      value: [0, 0.3, 0.5, 0.7, 0.9]
-    - value: 0.5
 dtype: bfloat16
 ```
 ## 💻 Usage

 - merge
 - mergekit
 - lazymergekit
+- meta-llama/Meta-Llama-3-8B
 - DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental
 base_model:
+- meta-llama/Meta-Llama-3-8B
 - DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental
 ---
 # llama3-discolm-orpo-t2
 llama3-discolm-orpo-t2 is a merge of the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):
+* [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B)
 * [DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental](https://huggingface.co/DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental)
 ## 🧩 Configuration
 ```yaml
+models:
+  - layer_range: [0, 40]
+    model: meta-llama/Meta-Llama-3-8B
+    parameters:
+      weight: 0.2
+  - layer_range: [0, 40]
+    model: DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental
+    parameters:
+      weight: 0.8
+merge_method: task_arithmetic
+base_model: meta-llama/Meta-Llama-3-8B
 dtype: bfloat16
+random_seed: 0
 ```
 ## 💻 Usage