Update README.md
Browse files
README.md
CHANGED
@@ -4,33 +4,33 @@ tags:
|
|
4 |
- merge
|
5 |
- mergekit
|
6 |
- lazymergekit
|
7 |
-
-
|
8 |
- machinists/Mistral-7B-SQL
|
9 |
---
|
10 |
|
11 |
# haLLAwa2
|
12 |
|
13 |
haLLAwa2 is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
|
14 |
-
|
15 |
-
* [machinists/Mistral-7B-SQL](https://huggingface.co/machinists/Mistral-7B-SQL)
|
16 |
|
17 |
## 🧩 Configuration
|
18 |
|
19 |
\```yaml
|
20 |
slices:
|
21 |
- sources:
|
22 |
-
- model:
|
23 |
layer_range: [0, 32]
|
24 |
- model: machinists/Mistral-7B-SQL
|
25 |
layer_range: [0, 32]
|
|
|
26 |
merge_method: slerp
|
27 |
-
base_model:
|
28 |
parameters:
|
29 |
t:
|
30 |
- filter: self_attn
|
31 |
value: [0, 0.5, 0.3, 0.7, 1]
|
32 |
- filter: mlp
|
33 |
value: [1, 0.5, 0.7, 0.3, 0]
|
34 |
-
- value: 0.5
|
35 |
dtype: bfloat16
|
36 |
\```
|
|
|
4 |
- merge
|
5 |
- mergekit
|
6 |
- lazymergekit
|
7 |
+
- OpenPipe/mistral-ft-optimized-1227
|
8 |
- machinists/Mistral-7B-SQL
|
9 |
---
|
10 |
|
11 |
# haLLAwa2
|
12 |
|
13 |
haLLAwa2 is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
|
14 |
+
|
|
|
15 |
|
16 |
## 🧩 Configuration
|
17 |
|
18 |
\```yaml
|
19 |
slices:
|
20 |
- sources:
|
21 |
+
- model: OpenPipe/mistral-ft-optimized-1227
|
22 |
layer_range: [0, 32]
|
23 |
- model: machinists/Mistral-7B-SQL
|
24 |
layer_range: [0, 32]
|
25 |
+
|
26 |
merge_method: slerp
|
27 |
+
base_model: OpenPipe/mistral-ft-optimized-1227
|
28 |
parameters:
|
29 |
t:
|
30 |
- filter: self_attn
|
31 |
value: [0, 0.5, 0.3, 0.7, 1]
|
32 |
- filter: mlp
|
33 |
value: [1, 0.5, 0.7, 0.3, 0]
|
34 |
+
- value: 0.5 # fallback for rest of tensors
|
35 |
dtype: bfloat16
|
36 |
\```
|