ShenaoZhang
/

0.0001_3iters_bs256_nodpo_full6w_userresponse_iter_1

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

0.0001_3iters_bs256_nodpo_full6w_userresponse_iter_1 / tokenizer.json

ShenaoZhang's picture

Model save

ab5b79a verified 11 months ago

history contribute delete

1.8 MB

File too large to display, you can check the raw version instead.