lbgan
/

grpo_3n_r64-b2-ga8-lr3e-06-b10.9-b20.99-wd0.1-wr0.1-ng2-mgn0.1

text-generation-inference

Model card Files Files and versions

grpo_3n_r64-b2-ga8-lr3e-06-b10.9-b20.99-wd0.1-wr0.1-ng2-mgn0.1

Ctrl+K

Ctrl+K

1 contributor

History: 6 commits

lbgan's picture

Upload model trained with Unsloth

2571864 verified about 1 month ago

.gitattributes

1.57 kB

Upload model trained with Unsloth about 1 month ago
README.md

602 Bytes

Upload README.md with huggingface_hub about 1 month ago
adapter_config.json

1.59 kB

Upload model trained with Unsloth about 1 month ago
adapter_model.safetensors

338 MB
xet

Upload model trained with Unsloth about 1 month ago
chat_template.jinja

1.63 kB

Upload model trained with Unsloth about 1 month ago
preprocessor_config.json

1.09 kB

Upload model trained with Unsloth about 1 month ago
processor_config.json

98 Bytes

Upload model trained with Unsloth about 1 month ago
special_tokens_map.json

777 Bytes

Upload model trained with Unsloth about 1 month ago
tokenizer.json

33.4 MB
xet

Upload model trained with Unsloth about 1 month ago
tokenizer.model

4.7 MB
xet

Upload model trained with Unsloth about 1 month ago
tokenizer_config.json

1.2 MB

Upload model trained with Unsloth about 1 month ago