llama-3.1-8B-grpo / model-00002-of-00004.safetensors

Commit History

Training in progress, step 450
455a725
verified

AlistairPullen commited on

Training in progress, step 150
ea97296
verified

AlistairPullen commited on

Training in progress, step 100
0b2913a
verified

AlistairPullen commited on

Training in progress, step 50
206dd4f
verified

AlistairPullen commited on