prithivMLmods's picture
Update README.md
948c080 verified
|
raw
history blame
348 Bytes
---
license: apache-2.0
language:
- en
- zh
base_model:
- prithivMLmods/SmolLM2_135M_Grpo_Gsm8k
pipeline_tag: text-generation
library_name: transformers
tags:
- Grpo
- text-generation-inference
- Llama
- trl
---
![d9-mAgyravvwWXZGi3sK5.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/jTUNV5nFY_tyhYQM-zeXl.png)