tilyupo commited on
Commit
1fc2f08
·
1 Parent(s): 4e73068

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -7
README.md CHANGED
@@ -1,21 +1,23 @@
1
  ---
2
  license: apache-2.0
3
- base_model: tilyupo/t5-xl-trivia-gpu-ca2q
4
  tags:
5
  - generated_from_keras_callback
6
  model-index:
7
- - name: t5-xl-trivia-gpu-v2-ca2q
8
  results: []
9
  ---
10
 
11
  <!-- This model card has been generated automatically according to the information Keras had access to. You should
12
  probably proofread and complete it, then remove this comment. -->
13
 
14
- # t5-xl-trivia-gpu-v2-ca2q
15
 
16
- This model is a fine-tuned version of [tilyupo/t5-xl-trivia-gpu-ca2q](https://huggingface.co/tilyupo/t5-xl-trivia-gpu-ca2q) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
-
 
 
19
 
20
  ## Model description
21
 
@@ -34,11 +36,15 @@ More information needed
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
- - optimizer: None
38
- - training_precision: float32
39
 
40
  ### Training results
41
 
 
 
 
 
42
 
43
 
44
  ### Framework versions
 
1
  ---
2
  license: apache-2.0
3
+ base_model: google/flan-t5-xl
4
  tags:
5
  - generated_from_keras_callback
6
  model-index:
7
+ - name: t5-xl-trivia-gpu-ca2q
8
  results: []
9
  ---
10
 
11
  <!-- This model card has been generated automatically according to the information Keras had access to. You should
12
  probably proofread and complete it, then remove this comment. -->
13
 
14
+ # t5-xl-trivia-gpu-ca2q
15
 
16
+ This model is a fine-tuned version of [google/flan-t5-xl](https://huggingface.co/google/flan-t5-xl) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 0.6788
19
+ - Validation Loss: 1.0558
20
+ - Epoch: 1
21
 
22
  ## Model description
23
 
 
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
+ - optimizer: {'name': 'Adafactor', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': 0.0002, 'beta_2_decay': -0.8, 'epsilon_1': 1e-30, 'epsilon_2': 0.001, 'clip_threshold': 1.0, 'relative_step': False}
40
+ - training_precision: mixed_bfloat16
41
 
42
  ### Training results
43
 
44
+ | Train Loss | Validation Loss | Epoch |
45
+ |:----------:|:---------------:|:-----:|
46
+ | 1.0151 | 0.9828 | 0 |
47
+ | 0.6788 | 1.0558 | 1 |
48
 
49
 
50
  ### Framework versions