java_and_text_gpt2

This model is a fine-tuned version of openai-community/gpt2-medium on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Accuracy	Input Tokens Seen
No log	0	0	11.0141	0.1944	0
2.3634	0.6173	50	2.2269	0.1944	409600
2.003	1.2346	100	1.7977	0.2222	819200
1.7323	1.8519	150	1.7828	0.2222	1228800
1.6991	2.4691	200	1.7454	0.1806	1638400

Safetensors

Model size

0.4B params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

(130)

this model