gokulsabari
/

paligemma-adapter

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

gokulsabari commited on Sep 6, 2024

Commit

ffa4c39

·

verified ·

1 Parent(s): 87db0e7

End of training

Files changed (1) hide show

README.md +8 -12

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/paligemma-3b-pt-224](https://huggingface.co/google/paligemma-3b-pt-224) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0083
 ## Model description
@@ -44,22 +44,18 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
-| 1.0288        | 1.0   | 8560  | 1.0359          |
-| 1.0149        | 2.0   | 17120 | 1.0133          |
-| 1.0197        | 3.0   | 25680 | 1.0096          |
-| 1.0073        | 4.0   | 34240 | 1.0086          |
-| 1.0054        | 5.0   | 42800 | 1.0084          |
-| 1.0112        | 6.0   | 51360 | 1.0083          |
-| 1.0173        | 7.0   | 59920 | 1.0083          |
-| 1.0175        | 8.0   | 68480 | 1.0083          |
-| 1.0181        | 9.0   | 77040 | 1.0082          |
-| 1.0022        | 10.0  | 85600 | 1.0083          |
 ### Framework versions

 This model is a fine-tuned version of [google/paligemma-3b-pt-224](https://huggingface.co/google/paligemma-3b-pt-224) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.9127
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
+- num_epochs: 100
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
+| 0.9767        | 1.0   | 8560  | 0.9846          |
+| 0.9463        | 2.0   | 17120 | 0.9456          |
+| 0.9415        | 3.0   | 25680 | 0.9300          |
+| 0.9208        | 4.0   | 34240 | 0.9215          |
+| 0.9138        | 5.0   | 42800 | 0.9163          |
+| 0.9138        | 6.0   | 51360 | 0.9127          |
 ### Framework versions