agenttuning_v2_15k_tag5

This model is a fine-tuned version of Qwen/Qwen2.5-7B on the agenttuning_v2_15k_tag5 dataset. It achieves the following results on the evaluation set:

Loss: 0.4974

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-06
train_batch_size: 1
eval_batch_size: 1
seed: 42
distributed_type: multi-GPU
num_devices: 4
total_train_batch_size: 4
total_eval_batch_size: 4
optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: cosine
num_epochs: 1

Training results

Training Loss	Epoch	Step	Validation Loss
0.7388	0.0281	100	0.6871
0.777	0.0562	200	0.6474
0.4864	0.0842	300	0.6196
0.3694	0.1123	400	0.6075
0.4812	0.1404	500	0.6115
0.3701	0.1685	600	0.6015
0.5908	0.1966	700	0.5969
0.4855	0.2247	800	0.5896
0.5579	0.2527	900	0.5785
0.4595	0.2808	1000	0.5758
0.5582	0.3089	1100	0.5599
0.5098	0.3370	1200	0.5567
0.5145	0.3651	1300	0.5483
0.4158	0.3931	1400	0.5537
0.4482	0.4212	1500	0.5465
0.4242	0.4493	1600	0.5418
0.4617	0.4774	1700	0.5373
0.4453	0.5055	1800	0.5287
0.5564	0.5336	1900	0.5272
0.4766	0.5616	2000	0.5178
0.5296	0.5897	2100	0.5196
0.4192	0.6178	2200	0.5167
0.4948	0.6459	2300	0.5122
0.7306	0.6740	2400	0.5103
0.3524	0.7020	2500	0.5115
0.4697	0.7301	2600	0.5072
0.3702	0.7582	2700	0.5064
0.3974	0.7863	2800	0.5037
0.4755	0.8144	2900	0.5012
0.4405	0.8425	3000	0.5000
0.4119	0.8705	3100	0.4984
0.391	0.8986	3200	0.4986
0.4877	0.9267	3300	0.4984
0.4928	0.9548	3400	0.4977
0.3771	0.9829	3500	0.4973

Framework versions

Transformers 4.46.1
Pytorch 2.6.0+cu124
Datasets 3.1.0
Tokenizers 0.20.3

lemonhat
/

Qwen2.5-7B-agenttuning_v2_15k_tag5

agenttuning_v2_15k_tag5

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

Model tree for lemonhat/Qwen2.5-7B-agenttuning_v2_15k_tag5

Evaluation results