lightshifted
/

llama-3.1-8B-GRPO-rag-rewards

Text Generation

text-generation-inference

Model card Files Files and versions Community

llama-3.1-8B-GRPO-rag-rewards

Commit History

Trained with Unsloth

619572c
verified

lightshifted commited on 26 days ago

Trained with Unsloth

d19329e
verified

lightshifted commited on about 1 month ago

Trained with Unsloth

a4afbd3
verified

lightshifted commited on Mar 3

Upload tokenizer

399850a
verified

lightshifted commited on Mar 3

Upload README.md with huggingface_hub

318a641
verified

lightshifted commited on Mar 3

initial commit

bc7d8c8
verified

lightshifted commited on Mar 3