stojchet commited on
Commit
9ecf0a6
·
verified ·
1 Parent(s): 883ae82

End of training

Browse files
Files changed (1) hide show
  1. README.md +7 -7
README.md CHANGED
@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [deepseek-ai/deepseek-coder-1.3b-base](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.3991
21
- - Eval/rewards/chosen: -1.8560
22
- - Eval/logps/chosen: -121.7269
23
- - Eval/rewards/rejected: -11.4368
24
- - Eval/logps/rejected: -245.1979
25
- - Eval/rewards/margins: 9.5809
26
  - Eval/kl: 0.0
27
 
28
  ## Model description
@@ -59,7 +59,7 @@ The following hyperparameters were used during training:
59
 
60
  | Training Loss | Epoch | Step | Validation Loss | |
61
  |:-------------:|:------:|:----:|:---------------:|:---:|
62
- | 0.1295 | 1.7058 | 100 | 0.3991 | 0.0 |
63
 
64
 
65
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [deepseek-ai/deepseek-coder-1.3b-base](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.4138
21
+ - Eval/rewards/chosen: -2.1155
22
+ - Eval/logps/chosen: -124.3213
23
+ - Eval/rewards/rejected: -11.6851
24
+ - Eval/logps/rejected: -247.6807
25
+ - Eval/rewards/margins: 9.5696
26
  - Eval/kl: 0.0
27
 
28
  ## Model description
 
59
 
60
  | Training Loss | Epoch | Step | Validation Loss | |
61
  |:-------------:|:------:|:----:|:---------------:|:---:|
62
+ | 0.1296 | 1.7058 | 100 | 0.4138 | 0.0 |
63
 
64
 
65
  ### Framework versions