YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Quantization made by Richard Erkhov.
eleuther-pythia70m-hh-dpo - AWQ
- Model creator: https://huggingface.co/lomahony/
- Original model: https://huggingface.co/lomahony/eleuther-pythia70m-hh-dpo/
Original model description:
language: - en tags: - pytorch - causal-lm - pythia license: apache-2.0 datasets: - Anthropic/hh-rlhf
Pythia-70m supervised finetuned with Anthropic-hh-rlhf dataset for 1 epoch (sft-model), before DPO (paper) with same dataset for 1 epoch.
Benchmark evaluations included in repo done using lm-evaluation-harness.
See Pythia-70m for original model details (paper).
- Downloads last month
- 5
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.