Qwen2.5-3B-Instruct-grpo-MATHDATA-E1 / model-00001-of-00002.safetensors

Commit History

Training in progress, epoch 0
0dec13b
verified

chenggong1995 commited on