Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mnoukhov
/
pythia1b-rm-tldr6.9b
like
0
Text Classification
Transformers
Safetensors
gpt_neox
trl
reward-trainer
Generated from Trainer
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
pythia1b-rm-tldr6.9b
Commit History
Model save
4208682
verified
mnoukhov
commited on
Jul 3, 2024
Training in progress, step 1164, checkpoint
aa5ea6e
verified
mnoukhov
commited on
Jul 3, 2024
Training in progress, step 1164
b5c84f5
verified
mnoukhov
commited on
Jul 3, 2024
Training in progress, step 873
9b9258f
verified
mnoukhov
commited on
Jul 3, 2024
Training in progress, step 582, checkpoint
d7a9bf4
verified
mnoukhov
commited on
Jul 3, 2024
Training in progress, step 582
188e468
verified
mnoukhov
commited on
Jul 3, 2024
Training in progress, step 291, checkpoint
b453c8d
verified
mnoukhov
commited on
Jul 3, 2024
Training in progress, step 291
bc5525e
verified
mnoukhov
commited on
Jul 3, 2024
initial commit
03ed7b8
verified
mnoukhov
commited on
Jul 3, 2024