End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B](https://huggingface.co/Qwen/Qwen2.5-1.5B) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7776
-- Accuracy: 0.5500
 ## Model description
@@ -38,7 +38,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-06
 - train_batch_size: 4
 - eval_batch_size: 8
 - seed: 42
@@ -48,13 +48,17 @@ The following hyperparameters were used during training:
 - total_eval_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 0.6501        | 1.0   | 5310 | 0.7776          | 0.5500   |
 ### Framework versions

 This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B](https://huggingface.co/Qwen/Qwen2.5-1.5B) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8474
+- Accuracy: 0.5493
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0003
 - train_batch_size: 4
 - eval_batch_size: 8
 - seed: 42
 - total_eval_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Accuracy |
+|:-------------:|:-----:|:-----:|:---------------:|:--------:|
+| 0.5645        | 1.0   | 5310  | 0.7363          | 0.5588   |
+| 0.6449        | 2.0   | 10620 | 0.7377          | 0.5521   |
+| 0.6083        | 3.0   | 15930 | 0.7829          | 0.5561   |
+| 0.6265        | 4.0   | 21240 | 0.7739          | 0.5490   |
+| 0.4989        | 5.0   | 26550 | 0.8474          | 0.5493   |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -24,8 +24,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
-    "q_proj"
   ],
   "task_type": "SEQ_CLS",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q_proj",
+    "v_proj"
   ],
   "task_type": "SEQ_CLS",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:78651619e89c321327795d9ddbbb0c45cad17ab3672ef81d976b780075ba465b
 size 8737368

 version https://git-lfs.github.com/spec/v1
+oid sha256:e64217fa491ed5ef53306cedb2db5a7207cd984c928dd22f3a4ce68c8b1d6d7b
 size 8737368

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:889640abc330ffd87b3503c4d12b345e7c1eb1d939a3eda63b38ea2d5e6f593d
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:4fc7f78ac48ac363b0489cb03a5f8ab2fa4f68282043c3ca2125bee226a1e5e0
 size 4728