LLENN-v0.69420-Qwen2.5-72b

image/png

Model stock merge for fun. Probably final model mix.
This merge is an answer to people's requests. I really don't wanna do more merges without myself considering to use it.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: EVA-UNIT-01/EVA-Qwen2.5-72B-v0.0
  - model: ZeusLabs/Chronos-Platinum-72B
  - model: anthracite-org/magnum-v4-72b
  - model: abacusai/Dracarys2-72B-Instruct
  - model: rombodawg/Rombos-LLM-V2.5-Qwen-72b
  - model: m8than/banana-2-b-72b

merge_method: model_stock
base_model: Qwen/Qwen2.5-72B
parameters:
  normalize: true
dtype: bfloat16

Prompt Format

ChatML works for the most part.

Sampler Settings

Personally I use the following:

Temp: 1.2
Min P: 0.07
Rep Pen: 1.1

Others have suggested the following:

Temp: 1.1
Top P: 0.98
Min P: 0.05
Downloads last month
34
Safetensors
Model size
72.7B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for KaraKaraWitch/LLENN-v0.69420-Qwen2.5-72b