judy93536 commited on
Commit
6b4c2c7
·
1 Parent(s): 7eb265d

End of training

Browse files
Files changed (2) hide show
  1. README.md +106 -0
  2. model.safetensors +1 -1
README.md ADDED
@@ -0,0 +1,106 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: judy93536/distilroberta-rbm231k-ep20-op40
4
+ tags:
5
+ - generated_from_trainer
6
+ datasets:
7
+ - financial_phrasebank
8
+ metrics:
9
+ - accuracy
10
+ model-index:
11
+ - name: distilroberta-rbm231k-ep20-op40-all-agree_2p2k
12
+ results:
13
+ - task:
14
+ name: Text Classification
15
+ type: text-classification
16
+ dataset:
17
+ name: financial_phrasebank
18
+ type: financial_phrasebank
19
+ config: sentences_allagree
20
+ split: train
21
+ args: sentences_allagree
22
+ metrics:
23
+ - name: Accuracy
24
+ type: accuracy
25
+ value: 0.9602649006622517
26
+ ---
27
+
28
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
29
+ should probably proofread and complete it, then remove this comment. -->
30
+
31
+ # distilroberta-rbm231k-ep20-op40-all-agree_2p2k
32
+
33
+ This model is a fine-tuned version of [judy93536/distilroberta-rbm231k-ep20-op40](https://huggingface.co/judy93536/distilroberta-rbm231k-ep20-op40) on the financial_phrasebank dataset.
34
+ It achieves the following results on the evaluation set:
35
+ - Loss: 0.1320
36
+ - Accuracy: 0.9603
37
+
38
+ ## Model description
39
+
40
+ More information needed
41
+
42
+ ## Intended uses & limitations
43
+
44
+ More information needed
45
+
46
+ ## Training and evaluation data
47
+
48
+ More information needed
49
+
50
+ ## Training procedure
51
+
52
+ ### Training hyperparameters
53
+
54
+ The following hyperparameters were used during training:
55
+ - learning_rate: 1.253335054745316e-06
56
+ - train_batch_size: 16
57
+ - eval_batch_size: 16
58
+ - seed: 42
59
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
+ - lr_scheduler_type: linear
61
+ - lr_scheduler_warmup_ratio: 0.4
62
+ - num_epochs: 30
63
+ - mixed_precision_training: Native AMP
64
+
65
+ ### Training results
66
+
67
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
68
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|
69
+ | No log | 1.0 | 114 | 1.0789 | 0.4327 |
70
+ | No log | 2.0 | 228 | 1.0442 | 0.6115 |
71
+ | No log | 3.0 | 342 | 0.9709 | 0.6137 |
72
+ | No log | 4.0 | 456 | 0.8693 | 0.6115 |
73
+ | 1.0223 | 5.0 | 570 | 0.8346 | 0.6115 |
74
+ | 1.0223 | 6.0 | 684 | 0.7876 | 0.6115 |
75
+ | 1.0223 | 7.0 | 798 | 0.7355 | 0.6203 |
76
+ | 1.0223 | 8.0 | 912 | 0.6974 | 0.6733 |
77
+ | 0.7904 | 9.0 | 1026 | 0.6535 | 0.7219 |
78
+ | 0.7904 | 10.0 | 1140 | 0.6045 | 0.7550 |
79
+ | 0.7904 | 11.0 | 1254 | 0.5653 | 0.7770 |
80
+ | 0.7904 | 12.0 | 1368 | 0.5122 | 0.7859 |
81
+ | 0.7904 | 13.0 | 1482 | 0.4652 | 0.7881 |
82
+ | 0.5806 | 14.0 | 1596 | 0.4319 | 0.7991 |
83
+ | 0.5806 | 15.0 | 1710 | 0.3951 | 0.8057 |
84
+ | 0.5806 | 16.0 | 1824 | 0.3557 | 0.8168 |
85
+ | 0.5806 | 17.0 | 1938 | 0.3174 | 0.8565 |
86
+ | 0.3751 | 18.0 | 2052 | 0.2652 | 0.9007 |
87
+ | 0.3751 | 19.0 | 2166 | 0.2188 | 0.9404 |
88
+ | 0.3751 | 20.0 | 2280 | 0.1797 | 0.9470 |
89
+ | 0.3751 | 21.0 | 2394 | 0.1822 | 0.9492 |
90
+ | 0.1873 | 22.0 | 2508 | 0.1523 | 0.9514 |
91
+ | 0.1873 | 23.0 | 2622 | 0.1425 | 0.9581 |
92
+ | 0.1873 | 24.0 | 2736 | 0.1394 | 0.9581 |
93
+ | 0.1873 | 25.0 | 2850 | 0.1396 | 0.9603 |
94
+ | 0.1873 | 26.0 | 2964 | 0.1345 | 0.9603 |
95
+ | 0.1072 | 27.0 | 3078 | 0.1334 | 0.9603 |
96
+ | 0.1072 | 28.0 | 3192 | 0.1322 | 0.9603 |
97
+ | 0.1072 | 29.0 | 3306 | 0.1316 | 0.9603 |
98
+ | 0.1072 | 30.0 | 3420 | 0.1320 | 0.9603 |
99
+
100
+
101
+ ### Framework versions
102
+
103
+ - Transformers 4.35.2
104
+ - Pytorch 2.1.0+cu118
105
+ - Datasets 2.15.0
106
+ - Tokenizers 0.15.0
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b43ccb210e86005d0a8673d7decb667d6fa650f16a291953a7be9ee80046f782
3
  size 328495356
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d00b1f2968c4dc0bc5bb942dbf8c29271a53f12b98b84032d0f2a071f947ad6c
3
  size 328495356