agenttuning_v2_15k_tag5

This model is a fine-tuned version of Qwen/Qwen2.5-7B on the agenttuning_v2_15k_tag5 dataset. It achieves the following results on the evaluation set:

  • Loss: 0.4974

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-06
  • train_batch_size: 1
  • eval_batch_size: 1
  • seed: 42
  • distributed_type: multi-GPU
  • num_devices: 4
  • total_train_batch_size: 4
  • total_eval_batch_size: 4
  • optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
  • lr_scheduler_type: cosine
  • num_epochs: 1

Training results

Training Loss Epoch Step Validation Loss
0.7388 0.0281 100 0.6871
0.777 0.0562 200 0.6474
0.4864 0.0842 300 0.6196
0.3694 0.1123 400 0.6075
0.4812 0.1404 500 0.6115
0.3701 0.1685 600 0.6015
0.5908 0.1966 700 0.5969
0.4855 0.2247 800 0.5896
0.5579 0.2527 900 0.5785
0.4595 0.2808 1000 0.5758
0.5582 0.3089 1100 0.5599
0.5098 0.3370 1200 0.5567
0.5145 0.3651 1300 0.5483
0.4158 0.3931 1400 0.5537
0.4482 0.4212 1500 0.5465
0.4242 0.4493 1600 0.5418
0.4617 0.4774 1700 0.5373
0.4453 0.5055 1800 0.5287
0.5564 0.5336 1900 0.5272
0.4766 0.5616 2000 0.5178
0.5296 0.5897 2100 0.5196
0.4192 0.6178 2200 0.5167
0.4948 0.6459 2300 0.5122
0.7306 0.6740 2400 0.5103
0.3524 0.7020 2500 0.5115
0.4697 0.7301 2600 0.5072
0.3702 0.7582 2700 0.5064
0.3974 0.7863 2800 0.5037
0.4755 0.8144 2900 0.5012
0.4405 0.8425 3000 0.5000
0.4119 0.8705 3100 0.4984
0.391 0.8986 3200 0.4986
0.4877 0.9267 3300 0.4984
0.4928 0.9548 3400 0.4977
0.3771 0.9829 3500 0.4973

Framework versions

  • Transformers 4.46.1
  • Pytorch 2.6.0+cu124
  • Datasets 3.1.0
  • Tokenizers 0.20.3
Downloads last month
8
Safetensors
Model size
7.62B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for lemonhat/Qwen2.5-7B-agenttuning_v2_15k_tag5

Base model

Qwen/Qwen2.5-7B
Finetuned
(644)
this model