dsakerkwq commited on
Commit
ceebddc
·
verified ·
1 Parent(s): 7178419

End of training

Browse files
Files changed (2) hide show
  1. README.md +11 -11
  2. adapter_model.bin +1 -1
README.md CHANGED
@@ -97,7 +97,7 @@ xformers_attention: false
97
 
98
  This model is a fine-tuned version of [zake7749/gemma-2-2b-it-chinese-kyara-dpo](https://huggingface.co/zake7749/gemma-2-2b-it-chinese-kyara-dpo) on the None dataset.
99
  It achieves the following results on the evaluation set:
100
- - Loss: 2.8558
101
 
102
  ## Model description
103
 
@@ -132,16 +132,16 @@ The following hyperparameters were used during training:
132
  | Training Loss | Epoch | Step | Validation Loss |
133
  |:-------------:|:------:|:----:|:---------------:|
134
  | 3.0339 | 0.0007 | 1 | 3.9516 |
135
- | 3.0089 | 0.0020 | 3 | 3.9466 |
136
- | 3.0971 | 0.0040 | 6 | 3.9206 |
137
- | 3.0654 | 0.0060 | 9 | 3.8324 |
138
- | 2.9844 | 0.0080 | 12 | 3.6591 |
139
- | 2.8532 | 0.0100 | 15 | 3.4738 |
140
- | 2.7595 | 0.0121 | 18 | 3.3276 |
141
- | 2.8455 | 0.0141 | 21 | 3.1981 |
142
- | 2.826 | 0.0161 | 24 | 3.0683 |
143
- | 2.7123 | 0.0181 | 27 | 2.9545 |
144
- | 2.8022 | 0.0201 | 30 | 2.8558 |
145
 
146
 
147
  ### Framework versions
 
97
 
98
  This model is a fine-tuned version of [zake7749/gemma-2-2b-it-chinese-kyara-dpo](https://huggingface.co/zake7749/gemma-2-2b-it-chinese-kyara-dpo) on the None dataset.
99
  It achieves the following results on the evaluation set:
100
+ - Loss: 2.8571
101
 
102
  ## Model description
103
 
 
132
  | Training Loss | Epoch | Step | Validation Loss |
133
  |:-------------:|:------:|:----:|:---------------:|
134
  | 3.0339 | 0.0007 | 1 | 3.9516 |
135
+ | 3.0085 | 0.0020 | 3 | 3.9458 |
136
+ | 3.0973 | 0.0040 | 6 | 3.9202 |
137
+ | 3.0628 | 0.0060 | 9 | 3.8294 |
138
+ | 2.9805 | 0.0080 | 12 | 3.6536 |
139
+ | 2.8508 | 0.0100 | 15 | 3.4703 |
140
+ | 2.7599 | 0.0121 | 18 | 3.3247 |
141
+ | 2.841 | 0.0141 | 21 | 3.1959 |
142
+ | 2.8249 | 0.0161 | 24 | 3.0669 |
143
+ | 2.715 | 0.0181 | 27 | 2.9560 |
144
+ | 2.8028 | 0.0201 | 30 | 2.8571 |
145
 
146
 
147
  ### Framework versions
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2c47a64f40b1597c41816e64dd8ebac49022e69b45783c98a7408c9674234565
3
  size 83197882
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2ee588d63e5d61b811d1c21afb5b48359958270ba89a86944cd4ced100f5ff03
3
  size 83197882