Qwen2.5-3B-Instruct-grpo-E6-D100-L4096-lr5e7 / model.safetensors.index.json

Commit History

Training in progress, epoch 0
0ecc16a
verified

chenggong1995 commited on