devhem commited on
Commit
a59c198
·
verified ·
1 Parent(s): 918020d

Model save

Browse files
Files changed (2) hide show
  1. README.md +36 -36
  2. model.safetensors +1 -1
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.6733
22
- - Accuracy: 0.7685
23
 
24
  ## Model description
25
 
@@ -38,8 +38,8 @@ More information needed
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
- - learning_rate: 2e-06
42
- - train_batch_size: 32
43
  - eval_batch_size: 8
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
@@ -52,38 +52,38 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
54
  |:-------------:|:-----:|:-----:|:---------------:|:--------:|
55
- | 2.1739 | 1.0 | 666 | 2.0326 | 0.2935 |
56
- | 1.9721 | 2.0 | 1332 | 1.3851 | 0.6387 |
57
- | 1.5155 | 3.0 | 1998 | 0.8932 | 0.7512 |
58
- | 0.8468 | 4.0 | 2664 | 0.7398 | 0.7696 |
59
- | 0.7476 | 5.0 | 3330 | 0.6910 | 0.7739 |
60
- | 0.6958 | 6.0 | 3996 | 0.6723 | 0.7773 |
61
- | 0.628 | 7.0 | 4662 | 0.6542 | 0.7823 |
62
- | 0.6108 | 8.0 | 5328 | 0.6435 | 0.7812 |
63
- | 0.5897 | 9.0 | 5994 | 0.6433 | 0.7795 |
64
- | 0.5641 | 10.0 | 6660 | 0.6367 | 0.7808 |
65
- | 0.5473 | 11.0 | 7326 | 0.6393 | 0.7827 |
66
- | 0.5367 | 12.0 | 7992 | 0.6329 | 0.7803 |
67
- | 0.5164 | 13.0 | 8658 | 0.6391 | 0.7769 |
68
- | 0.5133 | 14.0 | 9324 | 0.6371 | 0.7797 |
69
- | 0.5006 | 15.0 | 9990 | 0.6371 | 0.7767 |
70
- | 0.4928 | 16.0 | 10656 | 0.6410 | 0.7784 |
71
- | 0.4898 | 17.0 | 11322 | 0.6473 | 0.7792 |
72
- | 0.4661 | 18.0 | 11988 | 0.6443 | 0.7743 |
73
- | 0.463 | 19.0 | 12654 | 0.6516 | 0.7745 |
74
- | 0.456 | 20.0 | 13320 | 0.6573 | 0.7754 |
75
- | 0.4643 | 21.0 | 13986 | 0.6613 | 0.7750 |
76
- | 0.4471 | 22.0 | 14652 | 0.6608 | 0.7722 |
77
- | 0.4525 | 23.0 | 15318 | 0.6601 | 0.7718 |
78
- | 0.4395 | 24.0 | 15984 | 0.6645 | 0.7703 |
79
- | 0.4396 | 25.0 | 16650 | 0.6647 | 0.7690 |
80
- | 0.4371 | 26.0 | 17316 | 0.6682 | 0.7709 |
81
- | 0.4234 | 27.0 | 17982 | 0.6669 | 0.7707 |
82
- | 0.4282 | 28.0 | 18648 | 0.6706 | 0.7700 |
83
- | 0.4181 | 29.0 | 19314 | 0.6705 | 0.7690 |
84
- | 0.4275 | 30.0 | 19980 | 0.6727 | 0.7694 |
85
- | 0.4218 | 31.0 | 20646 | 0.6731 | 0.7685 |
86
- | 0.423 | 32.0 | 21312 | 0.6733 | 0.7685 |
87
 
88
 
89
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.8936
22
+ - Accuracy: 0.7084
23
 
24
  ## Model description
25
 
 
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
+ - learning_rate: 0.0003
42
+ - train_batch_size: 16
43
  - eval_batch_size: 8
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
54
  |:-------------:|:-----:|:-----:|:---------------:|:--------:|
55
+ | 0.7163 | 1.0 | 1332 | 0.7270 | 0.7649 |
56
+ | 0.6975 | 2.0 | 2664 | 0.7528 | 0.7467 |
57
+ | 0.6864 | 3.0 | 3996 | 0.8722 | 0.7360 |
58
+ | 0.7611 | 4.0 | 5328 | 1.0374 | 0.7241 |
59
+ | 0.814 | 5.0 | 6660 | 1.0346 | 0.6928 |
60
+ | 0.72 | 6.0 | 7992 | 1.1184 | 0.7023 |
61
+ | 0.9093 | 7.0 | 9324 | 1.1419 | 0.6240 |
62
+ | 1.1656 | 8.0 | 10656 | 1.3607 | 0.5707 |
63
+ | 1.0431 | 9.0 | 11988 | 1.1602 | 0.6464 |
64
+ | 0.9917 | 10.0 | 13320 | 1.2718 | 0.6244 |
65
+ | 1.1101 | 11.0 | 14652 | 1.1973 | 0.6158 |
66
+ | 1.1094 | 12.0 | 15984 | 1.1642 | 0.6128 |
67
+ | 1.0501 | 13.0 | 17316 | 1.2592 | 0.6205 |
68
+ | 0.9821 | 14.0 | 18648 | 1.1294 | 0.6543 |
69
+ | 1.026 | 15.0 | 19980 | 1.1774 | 0.6338 |
70
+ | 1.0622 | 16.0 | 21312 | 1.2379 | 0.6338 |
71
+ | 1.0199 | 17.0 | 22644 | 1.2025 | 0.6111 |
72
+ | 0.9903 | 18.0 | 23976 | 1.1224 | 0.6233 |
73
+ | 0.9544 | 19.0 | 25308 | 1.1009 | 0.6436 |
74
+ | 0.977 | 20.0 | 26640 | 1.0633 | 0.6500 |
75
+ | 0.9161 | 21.0 | 27972 | 1.0481 | 0.6507 |
76
+ | 0.8816 | 22.0 | 29304 | 1.0135 | 0.6620 |
77
+ | 0.8664 | 23.0 | 30636 | 1.0119 | 0.6830 |
78
+ | 0.8187 | 24.0 | 31968 | 0.9681 | 0.6915 |
79
+ | 0.7799 | 25.0 | 33300 | 1.0124 | 0.6719 |
80
+ | 0.7501 | 26.0 | 34632 | 0.9501 | 0.6928 |
81
+ | 0.7308 | 27.0 | 35964 | 0.9140 | 0.6963 |
82
+ | 0.6957 | 28.0 | 37296 | 0.9413 | 0.7007 |
83
+ | 0.6812 | 29.0 | 38628 | 0.9235 | 0.7055 |
84
+ | 0.6701 | 30.0 | 39960 | 0.9108 | 0.7065 |
85
+ | 0.649 | 31.0 | 41292 | 0.9012 | 0.7084 |
86
+ | 0.6345 | 32.0 | 42624 | 0.8936 | 0.7084 |
87
 
88
 
89
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4ef70d67da781f4198aa35653b13e81fca247e3d00c02aa041ccae36a3521432
3
  size 267854100
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f5e88fde78431e1f1ef1c896bc9c48e5ff63475c3039c22d57ada28291b6301
3
  size 267854100