cb93b276-e076-44fd-9fd8-90e3e518f4b2

This model is a fine-tuned version of unsloth/Phi-3.5-mini-instruct on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 0.0001001
train_batch_size: 4
eval_batch_size: 4
seed: 42
gradient_accumulation_steps: 2
total_train_batch_size: 8
optimizer: Use OptimizerNames.ADAMW_BNB with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=adam_beta1=0.9,adam_beta2=0.95,adam_epsilon=1e-5
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 10
training_steps: 200

Training Loss	Epoch	Step	Validation Loss
15.0473	0.0002	1	13.6138
7.2455	0.0076	50	13.5470
7.872	0.0152	100	13.0380
7.3184	0.0227	150	12.9848
5.7976	0.0303	200	12.6055