chinmaydk99
/

Qwen2.5-0.5B-Open-R1-Distill

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

chinmaydk99 commited on 23 days ago

Commit

e34bbfc

·

verified ·

1 Parent(s): 7f5db27

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ licence: license
 # Model Card for Qwen2.5-0.5B-Open-R1-Distill
 This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct).
-It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start

 # Model Card for Qwen2.5-0.5B-Open-R1-Distill
 This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct).
+It has been trained using [TRL](https://github.com/huggingface/trl). You can use it if you're looking for a model which as a cold start has been pretrained on some CoT data.
 ## Quick start