EryriLabs commited on
Commit
b5b90c8
·
verified ·
1 Parent(s): 893f716

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -39
README.md CHANGED
@@ -1,39 +1,39 @@
1
- ---
2
- base_model:
3
- - deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
4
- - vtriple/Qwen-2.5-7B-Threatflux
5
- library_name: transformers
6
- tags:
7
- - mergekit
8
- - merge
9
-
10
- ---
11
- # out
12
-
13
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
14
-
15
- ## Merge Details
16
- ### Merge Method
17
-
18
- This model was merged using the SLERP merge method.
19
-
20
- ### Models Merged
21
-
22
- The following models were included in the merge:
23
- * [deepseek-ai/DeepSeek-R1-Distill-Qwen-7B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B)
24
- * [vtriple/Qwen-2.5-7B-Threatflux](https://huggingface.co/vtriple/Qwen-2.5-7B-Threatflux)
25
-
26
- ### Configuration
27
-
28
- The following YAML configuration was used to produce this model:
29
-
30
- ```yaml
31
- models:
32
- - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
33
- - model: vtriple/Qwen-2.5-7B-Threatflux
34
- merge_method: slerp
35
- base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
36
- dtype: bfloat16
37
- parameters:
38
- t: [0, 0.5, 0.25]
39
- ```
 
1
+ ---
2
+ base_model:
3
+ - deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
4
+ - vtriple/Qwen-2.5-7B-Threatflux
5
+ library_name: transformers
6
+ tags:
7
+ - mergekit
8
+ - merge
9
+ license: llama3.1
10
+ ---
11
+ # DeepSeek-R1-Distill-Llama-Thinking-Farmer-8B
12
+
13
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
14
+
15
+ ## Merge Details
16
+ ### Merge Method
17
+
18
+ This model was merged using the SLERP merge method.
19
+
20
+ ### Models Merged
21
+
22
+ The following models were included in the merge:
23
+ * [deepseek-ai/DeepSeek-R1-Distill-Qwen-7B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B)
24
+ * [vtriple/Qwen-2.5-7B-Threatflux](https://huggingface.co/vtriple/Qwen-2.5-7B-Threatflux)
25
+
26
+ ### Configuration
27
+
28
+ The following YAML configuration was used to produce this model:
29
+
30
+ ```yaml
31
+ models:
32
+ - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
33
+ - model: vtriple/Qwen-2.5-7B-Threatflux
34
+ merge_method: slerp
35
+ base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
36
+ dtype: bfloat16
37
+ parameters:
38
+ t: [0, 0.5, 0.25]
39
+ ```