genloop
/

DeepSeek-R1-Distill-Llama-8B-HSN-more-cot-ft-2000-with-grpo-2000-lora

text-generation-inference

Model card Files Files and versions

DeepSeek-R1-Distill-Llama-8B-HSN-more-cot-ft-2000-with-grpo-2000-lora

Ctrl+K

Ctrl+K

1 contributor

History: 4 commits

eshangujar's picture

Trained with Unsloth

0b549b4 verified 6 months ago