bbunijieun commited on
Commit
fbca165
·
verified ·
1 Parent(s): 44e2bf6

Initial model training

Browse files
Files changed (3) hide show
  1. README.md +63 -63
  2. generation_config.json +4 -0
  3. model.safetensors +1 -1
README.md CHANGED
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
- - Loss: 7.0179
17
 
18
  ## Model description
19
 
@@ -48,68 +48,68 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:------:|:----:|:---------------:|
51
- | 10.4831 | 0.0480 | 10 | 10.4151 |
52
- | 10.3329 | 0.0959 | 20 | 10.1628 |
53
- | 10.0504 | 0.1439 | 30 | 9.8340 |
54
- | 9.7481 | 0.1918 | 40 | 9.5347 |
55
- | 9.4747 | 0.2398 | 50 | 9.3035 |
56
- | 9.3049 | 0.2878 | 60 | 9.1514 |
57
- | 9.1632 | 0.3357 | 70 | 9.0464 |
58
- | 9.0804 | 0.3837 | 80 | 8.9639 |
59
- | 8.9878 | 0.4317 | 90 | 8.8934 |
60
- | 8.9067 | 0.4796 | 100 | 8.8282 |
61
- | 8.8528 | 0.5276 | 110 | 8.7566 |
62
- | 8.7656 | 0.5755 | 120 | 8.6847 |
63
- | 8.7035 | 0.6235 | 130 | 8.6125 |
64
- | 8.616 | 0.6715 | 140 | 8.5304 |
65
- | 8.5265 | 0.7194 | 150 | 8.4505 |
66
- | 8.4497 | 0.7674 | 160 | 8.3641 |
67
- | 8.3703 | 0.8153 | 170 | 8.2788 |
68
- | 8.2564 | 0.8633 | 180 | 8.1889 |
69
- | 8.165 | 0.9113 | 190 | 8.0972 |
70
- | 8.0651 | 0.9592 | 200 | 8.0038 |
71
- | 7.9712 | 1.0072 | 210 | 7.9094 |
72
- | 7.8945 | 1.0552 | 220 | 7.8145 |
73
- | 7.8069 | 1.1031 | 230 | 7.7206 |
74
- | 7.709 | 1.1511 | 240 | 7.6291 |
75
- | 7.6057 | 1.1990 | 250 | 7.5450 |
76
- | 7.528 | 1.2470 | 260 | 7.4612 |
77
- | 7.4328 | 1.2950 | 270 | 7.3849 |
78
- | 7.3504 | 1.3429 | 280 | 7.3170 |
79
- | 7.3039 | 1.3909 | 290 | 7.2504 |
80
- | 7.2391 | 1.4388 | 300 | 7.1946 |
81
- | 7.2135 | 1.4868 | 310 | 7.1505 |
82
- | 7.1504 | 1.5348 | 320 | 7.1107 |
83
- | 7.1093 | 1.5827 | 330 | 7.0849 |
84
- | 7.0655 | 1.6307 | 340 | 7.0671 |
85
- | 7.0598 | 1.6787 | 350 | 7.0549 |
86
- | 7.0578 | 1.7266 | 360 | 7.0409 |
87
- | 7.0134 | 1.7746 | 370 | 7.0385 |
88
- | 7.0117 | 1.8225 | 380 | 7.0349 |
89
- | 7.0547 | 1.8705 | 390 | 7.0268 |
90
- | 7.0369 | 1.9185 | 400 | 7.0253 |
91
- | 7.0407 | 1.9664 | 410 | 7.0232 |
92
- | 7.0116 | 2.0144 | 420 | 7.0217 |
93
- | 7.0118 | 2.0624 | 430 | 7.0267 |
94
- | 6.9988 | 2.1103 | 440 | 7.0231 |
95
- | 7.0221 | 2.1583 | 450 | 7.0222 |
96
- | 6.9723 | 2.2062 | 460 | 7.0265 |
97
- | 7.0126 | 2.2542 | 470 | 7.0244 |
98
- | 7.0252 | 2.3022 | 480 | 7.0218 |
99
- | 7.0111 | 2.3501 | 490 | 7.0246 |
100
- | 6.9657 | 2.3981 | 500 | 7.0247 |
101
- | 7.0107 | 2.4460 | 510 | 7.0267 |
102
- | 7.0056 | 2.4940 | 520 | 7.0277 |
103
- | 6.9891 | 2.5420 | 530 | 7.0231 |
104
- | 7.0021 | 2.5899 | 540 | 7.0228 |
105
- | 6.9999 | 2.6379 | 550 | 7.0227 |
106
- | 6.977 | 2.6859 | 560 | 7.0192 |
107
- | 6.9834 | 2.7338 | 570 | 7.0195 |
108
- | 6.9978 | 2.7818 | 580 | 7.0207 |
109
- | 6.9647 | 2.8297 | 590 | 7.0199 |
110
- | 6.9833 | 2.8777 | 600 | 7.0196 |
111
- | 7.0092 | 2.9257 | 610 | 7.0182 |
112
- | 6.9901 | 2.9736 | 620 | 7.0179 |
113
 
114
 
115
  ### Framework versions
 
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Loss: 7.0175
17
 
18
  ## Model description
19
 
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:------:|:----:|:---------------:|
51
+ | 10.5072 | 0.0480 | 10 | 10.4491 |
52
+ | 10.3574 | 0.0959 | 20 | 10.1991 |
53
+ | 10.0831 | 0.1439 | 30 | 9.8790 |
54
+ | 9.7946 | 0.1918 | 40 | 9.5780 |
55
+ | 9.5118 | 0.2398 | 50 | 9.3344 |
56
+ | 9.3333 | 0.2878 | 60 | 9.1722 |
57
+ | 9.1888 | 0.3357 | 70 | 9.0610 |
58
+ | 9.0913 | 0.3837 | 80 | 8.9742 |
59
+ | 9.0007 | 0.4317 | 90 | 8.9005 |
60
+ | 8.9134 | 0.4796 | 100 | 8.8328 |
61
+ | 8.8583 | 0.5276 | 110 | 8.7615 |
62
+ | 8.7722 | 0.5755 | 120 | 8.6873 |
63
+ | 8.7092 | 0.6235 | 130 | 8.6137 |
64
+ | 8.6223 | 0.6715 | 140 | 8.5340 |
65
+ | 8.5312 | 0.7194 | 150 | 8.4538 |
66
+ | 8.4582 | 0.7674 | 160 | 8.3681 |
67
+ | 8.3748 | 0.8153 | 170 | 8.2801 |
68
+ | 8.2637 | 0.8633 | 180 | 8.1936 |
69
+ | 8.1704 | 0.9113 | 190 | 8.1001 |
70
+ | 8.0697 | 0.9592 | 200 | 8.0079 |
71
+ | 7.9792 | 1.0072 | 210 | 7.9126 |
72
+ | 7.9 | 1.0552 | 220 | 7.8175 |
73
+ | 7.8134 | 1.1031 | 230 | 7.7236 |
74
+ | 7.7153 | 1.1511 | 240 | 7.6328 |
75
+ | 7.6087 | 1.1990 | 250 | 7.5477 |
76
+ | 7.5328 | 1.2470 | 260 | 7.4634 |
77
+ | 7.4347 | 1.2950 | 270 | 7.3862 |
78
+ | 7.3531 | 1.3429 | 280 | 7.3179 |
79
+ | 7.3059 | 1.3909 | 290 | 7.2513 |
80
+ | 7.2403 | 1.4388 | 300 | 7.1955 |
81
+ | 7.2128 | 1.4868 | 310 | 7.1506 |
82
+ | 7.1508 | 1.5348 | 320 | 7.1105 |
83
+ | 7.1104 | 1.5827 | 330 | 7.0835 |
84
+ | 7.067 | 1.6307 | 340 | 7.0655 |
85
+ | 7.0594 | 1.6787 | 350 | 7.0558 |
86
+ | 7.0591 | 1.7266 | 360 | 7.0411 |
87
+ | 7.0129 | 1.7746 | 370 | 7.0381 |
88
+ | 7.0107 | 1.8225 | 380 | 7.0344 |
89
+ | 7.0549 | 1.8705 | 390 | 7.0268 |
90
+ | 7.0358 | 1.9185 | 400 | 7.0249 |
91
+ | 7.0395 | 1.9664 | 410 | 7.0242 |
92
+ | 7.0105 | 2.0144 | 420 | 7.0215 |
93
+ | 7.0113 | 2.0624 | 430 | 7.0259 |
94
+ | 6.9985 | 2.1103 | 440 | 7.0213 |
95
+ | 7.0218 | 2.1583 | 450 | 7.0218 |
96
+ | 6.9735 | 2.2062 | 460 | 7.0275 |
97
+ | 7.0132 | 2.2542 | 470 | 7.0254 |
98
+ | 7.0241 | 2.3022 | 480 | 7.0219 |
99
+ | 7.0127 | 2.3501 | 490 | 7.0238 |
100
+ | 6.9644 | 2.3981 | 500 | 7.0249 |
101
+ | 7.0103 | 2.4460 | 510 | 7.0259 |
102
+ | 7.006 | 2.4940 | 520 | 7.0266 |
103
+ | 6.9882 | 2.5420 | 530 | 7.0235 |
104
+ | 7.0016 | 2.5899 | 540 | 7.0235 |
105
+ | 7.002 | 2.6379 | 550 | 7.0217 |
106
+ | 6.9782 | 2.6859 | 560 | 7.0196 |
107
+ | 6.9833 | 2.7338 | 570 | 7.0198 |
108
+ | 6.9967 | 2.7818 | 580 | 7.0202 |
109
+ | 6.9644 | 2.8297 | 590 | 7.0196 |
110
+ | 6.9825 | 2.8777 | 600 | 7.0199 |
111
+ | 7.0097 | 2.9257 | 610 | 7.0178 |
112
+ | 6.9909 | 2.9736 | 620 | 7.0175 |
113
 
114
 
115
  ### Framework versions
generation_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "_from_model_config": true,
3
+ "transformers_version": "4.40.2"
4
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b81ab0d114f7855b982991e403d732f922a63067e08bb7fe466cae434818944e
3
  size 310717424
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eac20e408df210e7827c6763a81ce9a581784fd8de0e774fc23489cae990224f
3
  size 310717424