theta commited on
Commit
3fea100
·
1 Parent(s): ebe0d6a

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -15
README.md CHANGED
@@ -11,9 +11,14 @@ should probably proofread and complete it, then remove this comment. -->
11
 
12
  # gpt2-reporter
13
 
14
- This model is a fine-tuned version of [uer/gpt2-chinese-cluecorpussmall](https://huggingface.co/uer/gpt2-chinese-cluecorpussmall) on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
- - Loss: 2.4819
 
 
 
 
 
17
 
18
  ## Model description
19
 
@@ -41,19 +46,6 @@ The following hyperparameters were used during training:
41
  - lr_scheduler_warmup_steps: 500
42
  - num_epochs: 2
43
 
44
- ### Training results
45
-
46
- | Training Loss | Epoch | Step | Validation Loss |
47
- |:-------------:|:-----:|:----:|:---------------:|
48
- | 2.7694 | 0.28 | 400 | 2.5751 |
49
- | 2.6336 | 0.56 | 800 | 2.5318 |
50
- | 2.5564 | 0.84 | 1200 | 2.5071 |
51
- | 2.482 | 1.12 | 1600 | 2.4993 |
52
- | 2.4243 | 1.4 | 2000 | 2.4910 |
53
- | 2.4009 | 1.68 | 2400 | 2.4850 |
54
- | 2.3865 | 1.96 | 2800 | 2.4819 |
55
-
56
-
57
  ### Framework versions
58
 
59
  - Transformers 4.25.1
 
11
 
12
  # gpt2-reporter
13
 
14
+ This model is a fine-tuned version of [theta/gpt2-reporter](https://huggingface.co/theta/gpt2-reporter) on an unknown dataset.
15
  It achieves the following results on the evaluation set:
16
+ - eval_loss: 2.4767
17
+ - eval_runtime: 105.1103
18
+ - eval_samples_per_second: 23.632
19
+ - eval_steps_per_second: 2.959
20
+ - epoch: 0.25
21
+ - step: 4000
22
 
23
  ## Model description
24
 
 
46
  - lr_scheduler_warmup_steps: 500
47
  - num_epochs: 2
48
 
 
 
 
 
 
 
 
 
 
 
 
 
 
49
  ### Framework versions
50
 
51
  - Transformers 4.25.1