DeepSeek-R1-Distill-HumanLikeDPO-FineTuned-16bit / pytorch_model-00001-of-00004.bin

Commit History

Trained with Unsloth
c312753
verified

krishanwalia30 commited on