ProofGPT-v0.1

Model Description

ProofGPT-v0.1 is a 1.3B parameter language model based on the GPT-NeoX architecture and trained on the proof-pile (v1.1). We initiailized training with pythia-1.3b weights, a precursor to the pythia-1.4b model that has roughly equivalent performance.

Detailed evaluations coming soon :)

Note: Commit 3bcdc4e replaced the weights with a model trained on proof-pile v1.1, as opposed to previous commits which were trained on v1.0. Commit 9695b51 updated the tokenizer to have bos, eos, and unk tokens.

Downloads last month
1,072
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for hoskinson-center/proofGPT-v0.1

Quantizations
1 model

Dataset used to train hoskinson-center/proofGPT-v0.1