Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Ray2333
/
gpt2-large-harmless-reward_model
like
2
Text Classification
Transformers
Safetensors
Anthropic/hh-rlhf
gpt2
text-generation-inference
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
d59cbec
gpt2-large-harmless-reward_model
/
README.md
Ray2333
Update README.md
d59cbec
verified
about 1 year ago
preview
code
|
raw
Copy download link
history
blame
70 Bytes
metadata
license:
mit
datasets:
-
Anthropic/hh-rlhf
metrics:
-
accuracy