ai-maker-space/mistral-7binstruct-summary-100s

Files changed (5) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.4679
 ## Model description
@@ -52,14 +52,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.679         | 0.22  | 25   | 1.5459          |
-| 1.557         | 0.43  | 50   | 1.4679          |
 ### Framework versions
-- PEFT 0.8.2
-- Transformers 4.38.1
-- Pytorch 2.1.0+cu121
-- Datasets 2.17.1
 - Tokenizers 0.15.2

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4676
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.6649        | 0.22  | 25   | 1.5493          |
+| 1.6041        | 0.43  | 50   | 1.4676          |
 ### Framework versions
+- PEFT 0.10.0
+- Transformers 4.39.3
+- Pytorch 2.2.1+cu121
+- Datasets 2.18.0
 - Tokenizers 0.15.2

adapter_config.json CHANGED Viewed

@@ -6,6 +6,7 @@
   "fan_in_fan_out": false,
   "inference_mode": true,
   "init_lora_weights": true,
   "layers_pattern": null,
   "layers_to_transform": null,
   "loftq_config": {},
@@ -23,5 +24,6 @@
     "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_rslora": false
 }

   "fan_in_fan_out": false,
   "inference_mode": true,
   "init_lora_weights": true,
+  "layer_replication": null,
   "layers_pattern": null,
   "layers_to_transform": null,
   "loftq_config": {},
     "q_proj"
   ],
   "task_type": "CAUSAL_LM",
+  "use_dora": false,
   "use_rslora": false
 }

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:371726668e0352eb0ddb0ee13fdd91e3101b1998bbcb619f9fa9e6c74e97e859
 size 27280152

 version https://git-lfs.github.com/spec/v1
+oid sha256:524f35bc029cca42390cf052de4f6c8956b77cd297ad794fc1ea7c1adaf502ad
 size 27280152

runs/Apr16_22-25-52_75d617b1173a/events.out.tfevents.1713306385.75d617b1173a.402.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:26ed5f4c9554450a61dc6801f76771475d7e852e324051cd78ca7142ae590163
+size 7037

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bcc87dd0b7311063683f72f4c258a74b2d48fa674885727d339c7a9f93382ace
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:7e724461c3b1bc07935f66fbe93443f4fbd5943ebfe5a0ec0629c37875208d8f
 size 4920