File size: 2,074 Bytes
3e32236
 
 
 
 
 
 
 
 
4d88b38
 
 
3e32236
4d88b38
 
3e32236
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
---
base_model:
- MaziyarPanahi/calme-2.4-rys-78B
- Sakalti/ultiima-78B
library_name: transformers
tags:
- mergekit
- merge

license: other
license_name: qwen
license_link: https://huggingface.co/Qwen/Qwen2.5-72B-Instruct/blob/main/LICENSE
---
ultiima series
Built With Qwen
# merge

This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

## Merge Details
### Merge Method

This model was merged using the [Linear DELLA](https://arxiv.org/abs/2406.11617) merge method using [Sakalti/ultiima-78B](https://huggingface.co/Sakalti/ultiima-78B) as a base.

### Models Merged

The following models were included in the merge:
* [MaziyarPanahi/calme-2.4-rys-78B](https://huggingface.co/MaziyarPanahi/calme-2.4-rys-78B)

### Configuration

The following YAML configuration was used to produce this model:

```yaml
merge_method: della_linear
base_model: Sakalti/ultiima-78B
dtype: float16
parameters:
  epsilon: 0.015            # Fine-grain scaling for precision.
  lambda: 1.6               # Strong emphasis on top-performing models.
  normalize: true           # Stable parameter integration across models.
adaptive_merge_parameters:
  task_weights:
    tinyArc: 1.75           # Logical reasoning.
    tinyHellaswag: 1.65     # Contextual predictions.
    tinyMMLU: 1.8           # Domain knowledge.
    tinyTruthfulQA: 2.0     # Prioritize truthful reasoning.
    tinyTruthfulQA_mc1: 1.85
    tinyWinogrande: 1.9     # Advanced reasoning and predictions.
    IFEval: 2.1             # Instruction-following and multitasking.
    BBH: 1.9                # Complex reasoning.
    MATH: 2.3               # Mathematical reasoning.
    GPQA: 2.2             # Factual QA.
    MUSR: 2.0             # Multi-step reasoning.
    MMLU-PRO: 2.2       # Domain multitask performance.
  smoothing_factor: 0.1     # Smooth blending across benchmarks.
models:
  - model: MaziyarPanahi/calme-2.4-rys-78B
    parameters:
      weight: 1
      density: 1
  - model: Sakalti/ultiima-78B
    parameters:
      weight: 1
      density: 1
```