Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Rustamshry
/
HeisenbergQ-0.5B-RL
like
1
Text Generation
PEFT
Safetensors
Transformers
jilp00/YouToks-Instruct-Quantum-Physics-II
English
trl
physics
unsloth
grpo
conversational
License:
mit
Model card
Files
Files and versions
xet
Community
Use this model
Rustamshry
commited on
9 days ago
Commit
b82bf86
·
verified
·
1 Parent(s):
0cea297
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+1
-1
README.md
CHANGED
Viewed
@@ -13,7 +13,7 @@ tags:
13
- unsloth
14
- transformers
15
- grpo
16
-
---
17
18
# Model Card for HeisenbergQ-0.5B
19
13
- unsloth
14
- transformers
15
- grpo
16
+
---
17
18
# Model Card for HeisenbergQ-0.5B
19