lbourdois commited on
Commit
6fff39e
·
verified ·
1 Parent(s): 997cda7

Improve language tag

Browse files

Hi! As the model is multilingual, this is a PR to add other languages than English to the language tag to improve the referencing. Note that 29 languages are announced in the README, but only 13 are explicitly listed. I was therefore only able to add these 13 languages.

Files changed (1) hide show
  1. README.md +55 -42
README.md CHANGED
@@ -1,42 +1,55 @@
1
- ---
2
- base_model:
3
- - Qwen/Qwen2.5-0.5B-Instruct
4
- library_name: transformers
5
- tags:
6
- - mergekit
7
- - merge
8
-
9
- ---
10
- # merge
11
-
12
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
13
-
14
- ## Merge Details
15
- ### Merge Method
16
-
17
- This model was merged using the [task arithmetic](https://arxiv.org/abs/2212.04089) merge method using [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) as a base.
18
-
19
- ### Models Merged
20
-
21
- The following models were included in the merge:
22
-
23
-
24
- ### Configuration
25
-
26
- The following YAML configuration was used to produce this model:
27
-
28
- ```yaml
29
- base_model: Qwen/Qwen2.5-0.5B-Instruct
30
- dtype: bfloat16
31
- merge_method: task_arithmetic
32
- parameters:
33
- normalize: false
34
- slices:
35
- - sources:
36
- - layer_range: [0, 16]
37
- model: Qwen/Qwen2.5-0.5B-Instruct
38
- - layer_range: [0, 16]
39
- model: Qwen/Qwen2.5-0.5B-Instruct
40
- parameters:
41
- weight: 1.0
42
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-0.5B-Instruct
4
+ library_name: transformers
5
+ tags:
6
+ - mergekit
7
+ - merge
8
+ language:
9
+ - zho
10
+ - eng
11
+ - fra
12
+ - spa
13
+ - por
14
+ - deu
15
+ - ita
16
+ - rus
17
+ - jpn
18
+ - kor
19
+ - vie
20
+ - tha
21
+ - ara
22
+ ---
23
+ # merge
24
+
25
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
26
+
27
+ ## Merge Details
28
+ ### Merge Method
29
+
30
+ This model was merged using the [task arithmetic](https://arxiv.org/abs/2212.04089) merge method using [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) as a base.
31
+
32
+ ### Models Merged
33
+
34
+ The following models were included in the merge:
35
+
36
+
37
+ ### Configuration
38
+
39
+ The following YAML configuration was used to produce this model:
40
+
41
+ ```yaml
42
+ base_model: Qwen/Qwen2.5-0.5B-Instruct
43
+ dtype: bfloat16
44
+ merge_method: task_arithmetic
45
+ parameters:
46
+ normalize: false
47
+ slices:
48
+ - sources:
49
+ - layer_range: [0, 16]
50
+ model: Qwen/Qwen2.5-0.5B-Instruct
51
+ - layer_range: [0, 16]
52
+ model: Qwen/Qwen2.5-0.5B-Instruct
53
+ parameters:
54
+ weight: 1.0
55
+ ```