prosecalign
/

clm7b0129-kendall-onof-ofif-corr-max-2-simpo-max1500-decay-sft-beta1.5-gamma0.5-lr5e-6

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

clm7b0129-kendall-onof-ofif-corr-max-2-simpo-max1500-decay-sft-beta1.5-gamma0.5-lr5e-6 / checkpoint-300 /rng_state_7.pth

Commit History

Training in progress, step 300, checkpoint

4b5d90e
verified

ziansu commited on 1 day ago