Downtown-Case commited on
Commit
ea1b9be
·
verified ·
1 Parent(s): ab488ec

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -1,10 +1,12 @@
1
  ---
2
- base_model: []
 
 
3
  library_name: transformers
4
  tags:
5
  - mergekit
6
  - merge
7
-
8
  ---
9
 
10
  This is a merge of Bytedance Seed-OSS-36B Base and Instruct, using the karcher-means method in [mergekit](https://github.com/cg123/mergekit), with the idea being to get Bytedance Instruct to 'feel' and write more like a raw continuation model.
@@ -33,8 +35,6 @@ The following YAML configuration was used to produce this model:
33
 
34
  ```yaml
35
  models:
36
- # - model: /home/alpha/Models/Raw/Qwen_Qwen2.5-14B
37
- # No parameters necessary for base model
38
  - model: /home/alpha/Models/Raw/ByteDance-Seed_Seed-OSS-36B-Base
39
  - model: /home/alpha/Models/Raw/ByteDance-Seed_Seed-OSS-36B-Instruct
40
  merge_method: karcher
@@ -45,4 +45,4 @@ parameters:
45
  int8_mask: true
46
  dtype: bfloat16
47
 
48
- ```
 
1
  ---
2
+ base_model:
3
+ - ByteDance-Seed/Seed-OSS-36B-Instruct
4
+ - ByteDance-Seed/Seed-OSS-36B-Base
5
  library_name: transformers
6
  tags:
7
  - mergekit
8
  - merge
9
+ license: apache-2.0
10
  ---
11
 
12
  This is a merge of Bytedance Seed-OSS-36B Base and Instruct, using the karcher-means method in [mergekit](https://github.com/cg123/mergekit), with the idea being to get Bytedance Instruct to 'feel' and write more like a raw continuation model.
 
35
 
36
  ```yaml
37
  models:
 
 
38
  - model: /home/alpha/Models/Raw/ByteDance-Seed_Seed-OSS-36B-Base
39
  - model: /home/alpha/Models/Raw/ByteDance-Seed_Seed-OSS-36B-Instruct
40
  merge_method: karcher
 
45
  int8_mask: true
46
  dtype: bfloat16
47
 
48
+ ```