<deboramachadoandrade>/mistral-7binstruct-summary-100s

Files changed (5) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.5221
 ## Model description
@@ -46,20 +46,16 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 3
-- training_steps: 200
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.6694        | 0.03  | 25   | 1.6401          |
-| 1.5146        | 0.05  | 50   | 1.5519          |
-| 1.5803        | 0.08  | 75   | 1.5408          |
-| 1.5458        | 0.11  | 100  | 1.5345          |
-| 1.4207        | 0.13  | 125  | 1.5274          |
-| 1.5952        | 0.16  | 150  | 1.5242          |
-| 1.5299        | 0.19  | 175  | 1.5224          |
-| 1.5765        | 0.22  | 200  | 1.5221          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4587
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 3
+- training_steps: 100
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.6938        | 0.22  | 25   | 1.5762          |
+| 1.5801        | 0.43  | 50   | 1.4713          |
+| 1.5327        | 0.65  | 75   | 1.4617          |
+| 1.5411        | 0.86  | 100  | 1.4587          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -19,8 +19,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
-    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q_proj",
+    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:61bcd0a374135e4c249d5aec3ce1b64e10a87bf1955f8b777da2bbcad519c9e8
 size 27280152

 version https://git-lfs.github.com/spec/v1
+oid sha256:8144a3b3e7bfa33c88b35a1d1f62f6401a643fd009b109774c7142df3d8b7e3f
 size 27280152

runs/Mar04_23-52-56_19d606c365e8/events.out.tfevents.1709596378.19d606c365e8.3636.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:fea8a296479e639e34d608e3d77e8b6541533dfece70ddec2a8dcb4232c2c60f
+size 8529

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c450e5d116b364cdc20420171fc71cf9dd93b778b22228cb000fb5706d15ef16
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:f545718b2b127ca21456e3fbf192f30e091286e3cfebd02cd9980d5a052a8062
 size 4920