Uploaded finetuned model

Developed by: CompassioninMachineLearning
License: apache-2.0
Finetuned from model : CompassioninMachineLearning/Basellama_plus3kv3_plus20kfinetune2epochs

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Safetensors

Model size

8B params

Tensor type

BF16

Model tree for CompassioninMachineLearning/Basellama_plus3kv3_plus20kfinetune2epochs_plus5kGRPO

Base model

Finetuned

Finetuned

Finetuned

Finetuned

(1)

this model