Anish13 commited on
Commit
79d1c45
·
verified ·
1 Parent(s): ab26567

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 6.8305
19
 
20
  ## Model description
21
 
@@ -34,7 +34,7 @@ More information needed
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
- - learning_rate: 0.0003
38
  - train_batch_size: 32
39
  - eval_batch_size: 32
40
  - seed: 42
@@ -47,16 +47,14 @@ The following hyperparameters were used during training:
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | 6.4795 | 0.94 | 500 | 6.1043 |
51
- | 5.3968 | 1.87 | 1000 | 5.7712 |
52
- | 4.7369 | 2.81 | 1500 | 5.6812 |
53
- | 4.1696 | 3.75 | 2000 | 5.7365 |
54
- | 3.6165 | 4.68 | 2500 | 5.8735 |
55
- | 3.098 | 5.62 | 3000 | 6.0607 |
56
- | 2.595 | 6.55 | 3500 | 6.3035 |
57
- | 2.1458 | 7.49 | 4000 | 6.5112 |
58
- | 1.7782 | 8.43 | 4500 | 6.7049 |
59
- | 1.5026 | 9.36 | 5000 | 6.8305 |
60
 
61
 
62
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 5.4593
19
 
20
  ## Model description
21
 
 
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
+ - learning_rate: 0.0005
38
  - train_batch_size: 32
39
  - eval_batch_size: 32
40
  - seed: 42
 
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | 6.5045 | 1.17 | 500 | 6.1462 |
51
+ | 5.5972 | 2.35 | 1000 | 5.7812 |
52
+ | 5.1108 | 3.52 | 1500 | 5.6020 |
53
+ | 4.7389 | 4.69 | 2000 | 5.4971 |
54
+ | 4.4098 | 5.87 | 2500 | 5.4530 |
55
+ | 4.1016 | 7.04 | 3000 | 5.4385 |
56
+ | 3.8119 | 8.22 | 3500 | 5.4539 |
57
+ | 3.5917 | 9.39 | 4000 | 5.4593 |
 
 
58
 
59
 
60
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:333f300ba3b1ec0e899de7e7cff2719ba64cea68d9a31749c24d7fad6cc20d53
3
  size 497783424
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fa01c4ab463c41ea0581d8efe8ad6fc49dc764694409fa3f4e328c9b10471728
3
  size 497783424
runs/Feb10_02-02-48_nlp-gpu-01.be.ucsc.edu/events.out.tfevents.1707559369.nlp-gpu-01.be.ucsc.edu.93660.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4d113b27da60e709f27322aad82b1e47559495023cbe7f545151b62c4802a219
3
- size 7880
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:20822a187079c1e30055316c4ff6556915920f4871d94e763f1306e0066ce9b0
3
+ size 8234