gpt2-xl-lora-multi-512-2 / train_results.json
MHGanainy's picture
MHGanainy/gpt2-xl-lora-multi-512-2
3329f0f verified
raw
history blame contribute delete
207 Bytes
{
"epoch": 1.0,
"total_flos": 1.59124040841796e+18,
"train_loss": 2.4629864172184424,
"train_runtime": 3399.9284,
"train_samples_per_second": 51.387,
"train_steps_per_second": 3.212
}