ShenaoZ
/

0.001_3iters_bs128_declr_nodpo_useresponse_iter_1

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

0.001_3iters_bs128_declr_nodpo_useresponse_iter_1 / tokenizer.json

ShenaoZ's picture

Training in progress, step 100

a57fa5b verified 11 months ago

history contribute delete

1.8 MB

File too large to display, you can check the raw version instead.