markab
/

Qwen1.5-Capybara-0.5B-Chat

Generated from Trainer

Model card Files Files and versions Community

markab commited on Mar 5, 2024

Commit

9d542db

·

verified ·

1 Parent(s): cc4c94b

End of training

Files changed (1) hide show

README.md +14 -1

README.md CHANGED Viewed

@@ -2,6 +2,7 @@
 license: other
 library_name: peft
 tags:
 - generated_from_trainer
 base_model: Qwen/Qwen1.5-0.5B-Chat
 model-index:
@@ -119,7 +120,9 @@ special_tokens:
 # Qwen1.5-Capybara-0.5B-Chat
-This model is a fine-tuned version of [Qwen/Qwen1.5-0.5B-Chat](https://huggingface.co/Qwen/Qwen1.5-0.5B-Chat) on an unknown dataset.
 ## Model description
@@ -152,6 +155,16 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 15
 - num_epochs: 1
 ### Framework versions
 - PEFT 0.9.1.dev0

 license: other
 library_name: peft
 tags:
+- axolotl
 - generated_from_trainer
 base_model: Qwen/Qwen1.5-0.5B-Chat
 model-index:
 # Qwen1.5-Capybara-0.5B-Chat
+This model is a fine-tuned version of [Qwen/Qwen1.5-0.5B-Chat](https://huggingface.co/Qwen/Qwen1.5-0.5B-Chat) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.0419
 ## Model description
 - lr_scheduler_warmup_steps: 15
 - num_epochs: 1
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 1.164         | 0.0   | 1    | 1.2662          |
+| 0.759         | 0.25  | 343  | 1.0705          |
+| 0.6798        | 0.5   | 686  | 1.0525          |
+| 1.2828        | 0.75  | 1029 | 1.0419          |
 ### Framework versions
 - PEFT 0.9.1.dev0