Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Robust-Decoding
/
gemma2-2b-it-hh-grpo-harmless-step100
like
0
Follow
Robust-Decoding
5
Text Generation
Transformers
Safetensors
gemma2
text-generation-inference
Inference Endpoints
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
gemma2-2b-it-hh-grpo-harmless-step100
1 contributor
History:
2 commits
WillBankes
Upload Gemma2ForCausalLM
36d9e68
verified
18 days ago
.gitattributes
Safe
1.52 kB
initial commit
18 days ago
README.md
Safe
5.17 kB
Upload Gemma2ForCausalLM
18 days ago
config.json
908 Bytes
Upload Gemma2ForCausalLM
18 days ago
generation_config.json
Safe
187 Bytes
Upload Gemma2ForCausalLM
18 days ago
model-00001-of-00002.safetensors
4.99 GB
LFS
Upload Gemma2ForCausalLM
18 days ago
model-00002-of-00002.safetensors
241 MB
LFS
Upload Gemma2ForCausalLM
18 days ago
model.safetensors.index.json
Safe
24.2 kB
Upload Gemma2ForCausalLM
18 days ago