Anish13 commited on
Commit
cdfad30
·
verified ·
1 Parent(s): f3c8887

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 5.4963
19
 
20
  ## Model description
21
 
@@ -47,14 +47,14 @@ The following hyperparameters were used during training:
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | 6.497 | 1.17 | 500 | 6.1065 |
51
- | 5.565 | 2.35 | 1000 | 5.7556 |
52
- | 5.0933 | 3.52 | 1500 | 5.6007 |
53
- | 4.7317 | 4.69 | 2000 | 5.5007 |
54
- | 4.4097 | 5.87 | 2500 | 5.4714 |
55
- | 4.1095 | 7.04 | 3000 | 5.4652 |
56
- | 3.8288 | 8.22 | 3500 | 5.4882 |
57
- | 3.6166 | 9.39 | 4000 | 5.4963 |
58
 
59
 
60
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 5.4775
19
 
20
  ## Model description
21
 
 
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | 6.4897 | 1.17 | 500 | 6.1071 |
51
+ | 5.5542 | 2.35 | 1000 | 5.7496 |
52
+ | 5.0727 | 3.52 | 1500 | 5.5820 |
53
+ | 4.6986 | 4.69 | 2000 | 5.4866 |
54
+ | 4.3625 | 5.87 | 2500 | 5.4438 |
55
+ | 4.0439 | 7.04 | 3000 | 5.4475 |
56
+ | 3.7449 | 8.22 | 3500 | 5.4655 |
57
+ | 3.5157 | 9.39 | 4000 | 5.4775 |
58
 
59
 
60
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cf05e58521b31d1cc361ac680db484854d7c4d77b2e2f3597013553239ea73d1
3
  size 497783424
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8e72b980fd0fb1dd19191d68fbf2474bfd1a58a28f934b622cec33bad7a2cf53
3
  size 497783424
runs/Feb12_11-49-55_nlp-gpu-01.be.ucsc.edu/events.out.tfevents.1707767396.nlp-gpu-01.be.ucsc.edu.163223.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c81d946bd32656431304829cf7f59d8bb26d58e3806c282de9674ac134e78a86
3
- size 7880
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e2e18b9ed49c83fdb485e0314e5f97258e95b65ae1da91da0490b99417287673
3
+ size 8234