bobbyw commited on
Commit
71e64b2
·
verified ·
1 Parent(s): 482b258

End of training

Browse files
Files changed (2) hide show
  1. README.md +30 -13
  2. model.safetensors +1 -1
README.md CHANGED
@@ -20,10 +20,10 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [microsoft/deberta-v3-small](https://huggingface.co/microsoft/deberta-v3-small) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.0139
24
- - Accuracy: 0.0014
25
- - F1: 0.0027
26
- - Precision: 0.0014
27
  - Recall: 1.0
28
 
29
  ## Model description
@@ -43,26 +43,43 @@ More information needed
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
- - learning_rate: 2e-05
47
  - train_batch_size: 16
48
  - eval_batch_size: 16
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
- - num_epochs: 3
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
58
- | No log | 1.0 | 172 | 0.0372 | 0.0014 | 0.0027 | 0.0014 | 1.0 |
59
- | No log | 2.0 | 344 | 0.0159 | 0.0014 | 0.0027 | 0.0014 | 1.0 |
60
- | 0.1035 | 3.0 | 516 | 0.0139 | 0.0014 | 0.0027 | 0.0014 | 1.0 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
61
 
62
 
63
  ### Framework versions
64
 
65
- - Transformers 4.35.2
66
- - Pytorch 2.1.0+cu121
67
- - Datasets 2.17.0
68
- - Tokenizers 0.15.1
 
20
 
21
  This model is a fine-tuned version of [microsoft/deberta-v3-small](https://huggingface.co/microsoft/deberta-v3-small) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.0374
24
+ - Accuracy: 0.0069
25
+ - F1: 0.0137
26
+ - Precision: 0.0069
27
  - Recall: 1.0
28
 
29
  ## Model description
 
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
+ - learning_rate: 0.002
47
  - train_batch_size: 16
48
  - eval_batch_size: 16
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
+ - num_epochs: 20
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
58
+ | No log | 1.0 | 39 | 0.0447 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
59
+ | No log | 2.0 | 78 | 0.0401 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
60
+ | No log | 3.0 | 117 | 0.0401 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
61
+ | No log | 4.0 | 156 | 0.0395 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
62
+ | No log | 5.0 | 195 | 0.0379 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
63
+ | No log | 6.0 | 234 | 0.0374 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
64
+ | No log | 7.0 | 273 | 0.0383 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
65
+ | No log | 8.0 | 312 | 0.0378 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
66
+ | No log | 9.0 | 351 | 0.0384 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
67
+ | No log | 10.0 | 390 | 0.0371 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
68
+ | No log | 11.0 | 429 | 0.0378 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
69
+ | No log | 12.0 | 468 | 0.0371 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
70
+ | 0.037 | 13.0 | 507 | 0.0373 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
71
+ | 0.037 | 14.0 | 546 | 0.0380 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
72
+ | 0.037 | 15.0 | 585 | 0.0390 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
73
+ | 0.037 | 16.0 | 624 | 0.0373 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
74
+ | 0.037 | 17.0 | 663 | 0.0376 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
75
+ | 0.037 | 18.0 | 702 | 0.0371 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
76
+ | 0.037 | 19.0 | 741 | 0.0375 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
77
+ | 0.037 | 20.0 | 780 | 0.0374 | 0.0069 | 0.0137 | 0.0069 | 1.0 |
78
 
79
 
80
  ### Framework versions
81
 
82
+ - Transformers 4.40.0
83
+ - Pytorch 2.2.1+cu121
84
+ - Datasets 2.19.0
85
+ - Tokenizers 0.19.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:61bae392a8394d53d4409e1f02f994c5993fe672bfc55845e1db77ca3e3bb484
3
  size 567740204
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:872a582ceffe7e9972cd55dd206f3dd9fc836c1d833476dc0dc501d854e84e59
3
  size 567740204