tilyupo
/

t5-xl-trivia-ca2q

Text2Text Generation

generated_from_keras_callback

Inference Endpoints

Model card Files Files and versions Community

tilyupo commited on Aug 7, 2023

Commit

1fc2f08

·

1 Parent(s): 4e73068

Update README.md

Files changed (1) hide show

README.md +13 -7

README.md CHANGED Viewed

@@ -1,21 +1,23 @@
 ---
 license: apache-2.0
-base_model: tilyupo/t5-xl-trivia-gpu-ca2q
 tags:
 - generated_from_keras_callback
 model-index:
-- name: t5-xl-trivia-gpu-v2-ca2q
   results: []
 ---
 <!-- This model card has been generated automatically according to the information Keras had access to. You should
 probably proofread and complete it, then remove this comment. -->
-# t5-xl-trivia-gpu-v2-ca2q
-This model is a fine-tuned version of [tilyupo/t5-xl-trivia-gpu-ca2q](https://huggingface.co/tilyupo/t5-xl-trivia-gpu-ca2q) on an unknown dataset.
 It achieves the following results on the evaluation set:
 ## Model description
@@ -34,11 +36,15 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: None
-- training_precision: float32
 ### Training results
 ### Framework versions

 ---
 license: apache-2.0
+base_model: google/flan-t5-xl
 tags:
 - generated_from_keras_callback
 model-index:
+- name: t5-xl-trivia-gpu-ca2q
   results: []
 ---
 <!-- This model card has been generated automatically according to the information Keras had access to. You should
 probably proofread and complete it, then remove this comment. -->
+# t5-xl-trivia-gpu-ca2q
+This model is a fine-tuned version of [google/flan-t5-xl](https://huggingface.co/google/flan-t5-xl) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 0.6788
+- Validation Loss: 1.0558
+- Epoch: 1
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: {'name': 'Adafactor', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': 0.0002, 'beta_2_decay': -0.8, 'epsilon_1': 1e-30, 'epsilon_2': 0.001, 'clip_threshold': 1.0, 'relative_step': False}
+- training_precision: mixed_bfloat16
 ### Training results
+| Train Loss | Validation Loss | Epoch |
+|:----------:|:---------------:|:-----:|
+| 1.0151     | 0.9828          | 0     |
+| 0.6788     | 1.0558          | 1     |
 ### Framework versions