rngusry commited on
Commit
12b24e9
·
verified ·
1 Parent(s): 25d6c40

End of training

Browse files
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B](https://huggingface.co/Qwen/Qwen2.5-1.5B) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.7776
22
- - Accuracy: 0.5500
23
 
24
  ## Model description
25
 
@@ -38,7 +38,7 @@ More information needed
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
- - learning_rate: 5e-06
42
  - train_batch_size: 4
43
  - eval_batch_size: 8
44
  - seed: 42
@@ -48,13 +48,17 @@ The following hyperparameters were used during training:
48
  - total_eval_batch_size: 16
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
51
- - num_epochs: 1
52
 
53
  ### Training results
54
 
55
- | Training Loss | Epoch | Step | Validation Loss | Accuracy |
56
- |:-------------:|:-----:|:----:|:---------------:|:--------:|
57
- | 0.6501 | 1.0 | 5310 | 0.7776 | 0.5500 |
 
 
 
 
58
 
59
 
60
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B](https://huggingface.co/Qwen/Qwen2.5-1.5B) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.8474
22
+ - Accuracy: 0.5493
23
 
24
  ## Model description
25
 
 
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
+ - learning_rate: 0.0003
42
  - train_batch_size: 4
43
  - eval_batch_size: 8
44
  - seed: 42
 
48
  - total_eval_batch_size: 16
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
51
+ - num_epochs: 5
52
 
53
  ### Training results
54
 
55
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
56
+ |:-------------:|:-----:|:-----:|:---------------:|:--------:|
57
+ | 0.5645 | 1.0 | 5310 | 0.7363 | 0.5588 |
58
+ | 0.6449 | 2.0 | 10620 | 0.7377 | 0.5521 |
59
+ | 0.6083 | 3.0 | 15930 | 0.7829 | 0.5561 |
60
+ | 0.6265 | 4.0 | 21240 | 0.7739 | 0.5490 |
61
+ | 0.4989 | 5.0 | 26550 | 0.8474 | 0.5493 |
62
 
63
 
64
  ### Framework versions
adapter_config.json CHANGED
@@ -24,8 +24,8 @@
24
  "rank_pattern": {},
25
  "revision": null,
26
  "target_modules": [
27
- "v_proj",
28
- "q_proj"
29
  ],
30
  "task_type": "SEQ_CLS",
31
  "use_dora": false,
 
24
  "rank_pattern": {},
25
  "revision": null,
26
  "target_modules": [
27
+ "q_proj",
28
+ "v_proj"
29
  ],
30
  "task_type": "SEQ_CLS",
31
  "use_dora": false,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:78651619e89c321327795d9ddbbb0c45cad17ab3672ef81d976b780075ba465b
3
  size 8737368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e64217fa491ed5ef53306cedb2db5a7207cd984c928dd22f3a4ce68c8b1d6d7b
3
  size 8737368
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:889640abc330ffd87b3503c4d12b345e7c1eb1d939a3eda63b38ea2d5e6f593d
3
  size 4728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4fc7f78ac48ac363b0489cb03a5f8ab2fa4f68282043c3ca2125bee226a1e5e0
3
  size 4728