cnatale/Mistral-7B-Instruct-v0.1-Txt-2-Presto-SQL

Files changed (5) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7162
 ## Model description
@@ -46,19 +46,19 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_steps: 0.03
-- training_steps: 120
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.0447        | 1.54  | 20   | 0.9470          |
-| 0.7971        | 3.08  | 40   | 0.7493          |
-| 0.6729        | 4.62  | 60   | 0.6838          |
-| 0.6072        | 6.15  | 80   | 0.6743          |
-| 0.5361        | 7.69  | 100  | 0.6896          |
-| 0.4948        | 9.23  | 120  | 0.7162          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.2714
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_steps: 0.03
+- training_steps: 360
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.6738        | 4.62  | 60   | 0.6807          |
+| 0.4764        | 9.23  | 120  | 0.7272          |
+| 0.3485        | 13.85 | 180  | 0.8233          |
+| 0.2559        | 18.46 | 240  | 0.9627          |
+| 0.1664        | 23.08 | 300  | 1.1172          |
+| 0.08          | 27.69 | 360  | 1.2714          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -19,8 +19,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "v_proj"
   ],
   "task_type": "CAUSAL_LM"
 }

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
+    "q_proj"
   ],
   "task_type": "CAUSAL_LM"
 }

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d20308eb027f024b2c4f159fc918733a3e3afc4b92044abe2d7f62a89ba5d6b9
 size 109069176

 version https://git-lfs.github.com/spec/v1
+oid sha256:bbed0401300b52a40073e3bd78c99aae96fea476ed824b123527fd92e2c1f839
 size 109069176

runs/Jan03_04-22-22_543a4a17d0f5/events.out.tfevents.1704255743.543a4a17d0f5.1870.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:344ec4a8a2321105c31f283a1c61a7fba0a7766bf353639a47e97adccbf40602
+size 12411

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:213d342d0418ccabc526799a32cb2ac3e9d264c39a32aa4f28a98d1a70f02b1c
-size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:2790fe88c06b0adeb9d587662a4a123ebd0bb0dc8a93f3a3574cc65dead43f61
+size 4792