davidoj01
/

unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as8

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as8

1.52 kB

1 contributor

History: 1 commit

davidoj01's picture

initial commit

d33ee17 verified 8 months ago

.gitattributes

1.52 kB

initial commit 8 months ago