classtag
/

20250215081122

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

classtag commited on Feb 15

Commit

dbb8748

·

verified ·

1 Parent(s): 79a98a6

End of training

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -1,7 +1,8 @@
 ---
 base_model: Qwen/Qwen2-0.5B-Instruct
 library_name: transformers
-model_name: '20250215081122'
 tags:
 - generated_from_trainer
 - trl
@@ -9,9 +10,9 @@ tags:
 licence: license
 ---
-# Model Card for 20250215081122
-This model is a fine-tuned version of [Qwen/Qwen2-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2-0.5B-Instruct).
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start

 ---
 base_model: Qwen/Qwen2-0.5B-Instruct
+datasets: AI-MO/NuminaMath-TIR
 library_name: transformers
+model_name: Qwen2-0.5B-GRPO-NuminaMath
 tags:
 - generated_from_trainer
 - trl
 licence: license
 ---
+# Model Card for Qwen2-0.5B-GRPO-NuminaMath
+This model is a fine-tuned version of [Qwen/Qwen2-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2-0.5B-Instruct) on the [AI-MO/NuminaMath-TIR](https://huggingface.co/datasets/AI-MO/NuminaMath-TIR) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start