End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3182
 ## Model description
@@ -47,16 +47,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 5.7579        | 1.0   | 30   | 4.5640          |
-| 3.5101        | 2.0   | 60   | 2.1591          |
-| 1.3903        | 3.0   | 90   | 0.7851          |
-| 0.8212        | 4.0   | 120  | 0.5676          |
-| 0.6335        | 5.0   | 150  | 0.4714          |
-| 0.5552        | 6.0   | 180  | 0.4110          |
-| 0.4997        | 7.0   | 210  | 0.3694          |
-| 0.4455        | 8.0   | 240  | 0.3406          |
-| 0.4249        | 9.0   | 270  | 0.3239          |
-| 0.4           | 10.0  | 300  | 0.3182          |
 ### Framework versions

 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3085
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 5.7293        | 1.0   | 30   | 4.6146          |
+| 3.3709        | 2.0   | 60   | 2.0158          |
+| 1.382         | 3.0   | 90   | 0.7932          |
+| 0.8337        | 4.0   | 120  | 0.5675          |
+| 0.6557        | 5.0   | 150  | 0.4715          |
+| 0.5561        | 6.0   | 180  | 0.4106          |
+| 0.5042        | 7.0   | 210  | 0.3627          |
+| 0.4905        | 8.0   | 240  | 0.3346          |
+| 0.4602        | 9.0   | 270  | 0.3145          |
+| 0.39          | 10.0  | 300  | 0.3085          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -23,8 +23,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v",
-    "q"
   ],
   "task_type": "SEQ_2_SEQ_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q",
+    "v"
   ],
   "task_type": "SEQ_2_SEQ_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:afd7d6f468c508d392ef3a10b4ad71ce4e51336b6544a0ee1b7914fce88f7719
 size 6655648

 version https://git-lfs.github.com/spec/v1
+oid sha256:55df64e77ed208b2c2837230afa689a8fdf6b275ddd3e80b8ba86500d6188eb4
 size 6655648