--- license: apache-2.0 language: - en - zh base_model: - HuggingFaceTB/SmolLM2-360M-Instruct pipeline_tag: text-generation library_name: transformers tags: - Grpo - text-generation-inference - Llama - trl --- ![d9-mAgyravvwWXZGi3sK5.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/jTUNV5nFY_tyhYQM-zeXl.png)