siqi00 commited on
Commit
77d3f36
·
1 Parent(s): 5c77577

change model hyperparameters

Browse files
README.md CHANGED
@@ -8,14 +8,14 @@ tags:
8
  datasets:
9
  - siqi00/mistral_ultrafeedback_unhelpful_chatprompt_0.7_1.0_50_320
10
  model-index:
11
- - name: mistral-feedbuhcp2-dft-lr2e-6-tau0.3-u_init0-s2-e2-gamma0.95
12
  results: []
13
  ---
14
 
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
16
  should probably proofread and complete it, then remove this comment. -->
17
 
18
- # mistral-feedbuhcp2-dft-lr2e-6-tau0.3-u_init0-s2-e2-gamma0.95
19
 
20
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the siqi00/mistral_ultrafeedback_unhelpful_chatprompt_0.7_1.0_50_320 dataset.
21
 
 
8
  datasets:
9
  - siqi00/mistral_ultrafeedback_unhelpful_chatprompt_0.7_1.0_50_320
10
  model-index:
11
+ - name: mistral-feedbuhcp2-dft-lr2e-6-tau0.3-u_init0-s2-e2-gamma0.90-rf
12
  results: []
13
  ---
14
 
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
16
  should probably proofread and complete it, then remove this comment. -->
17
 
18
+ # mistral-feedbuhcp2-dft-lr2e-6-tau0.3-u_init0-s2-e2-gamma0.90-rf
19
 
20
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the siqi00/mistral_ultrafeedback_unhelpful_chatprompt_0.7_1.0_50_320 dataset.
21
 
model-00001-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fb1a0385840f207faf8be4fb6c57b6ab3bdaf54ce9fc258f3ba989db09b8a686
3
  size 4943162336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ccfdf0b423d6201b8e9dcfb2092e5fce0296ce35957bbe4c67f3c3c9ec86aba1
3
  size 4943162336
model-00002-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b1e574b9c8ea7fb4653cf6871d31864eabc848d9ab4c4eef32de708f2e3315ed
3
  size 4999819336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:be47fbe3584fb4415099a7677f073e1d6b5aa55466a478a073cc4a071b8710c5
3
  size 4999819336
model-00003-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ea0c07ecbcaad2eadf3530908976cf2a4791af032c298ed7f399db4fa9999962
3
  size 4540516344
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1f7066d160c2ed57b12fb648db44f427d8c2f7bff07d63b99ca6e9945917afed
3
  size 4540516344
trainer_state.json CHANGED
The diff for this file is too large to render. See raw diff
 
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:350fc7e7453fe42bc06751a388b8f1ba16a9ccb0a10a7afb8f935d1456378784
3
  size 8120
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8c1eeb2d9b66ab2e35755fab6632cb3a3886c0a5a2c919b56baaa04fb86513e5
3
  size 8120