Uploaded model
- Developed by: williamlcn
- License: apache-2.0
- Finetuned from model : williamlcn/34337_sft_ds_notcot
This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 6
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
Model tree for williamlcn/34337_simpo_ds_notcot
Base model
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Finetuned
unsloth/DeepSeek-R1-Distill-Qwen-1.5B
Finetuned
williamlcn/34337_sft_ds_notcot