hlillemark commited on
Commit
e0be075
·
verified ·
1 Parent(s): 506bac8

Model save

Browse files
Files changed (1) hide show
  1. README.md +15 -15
README.md CHANGED
@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # sft_mc
18
 
19
- This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the identity and the data_mc datasets.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 2.3274
22
 
23
  ## Model description
24
 
@@ -55,19 +55,19 @@ The following hyperparameters were used during training:
55
 
56
  | Training Loss | Epoch | Step | Validation Loss |
57
  |:-------------:|:------:|:----:|:---------------:|
58
- | 1.053 | 0.7463 | 50 | 1.2667 |
59
- | 0.8178 | 1.4925 | 100 | 1.3230 |
60
- | 0.3803 | 2.2388 | 150 | 1.5491 |
61
- | 0.436 | 2.9851 | 200 | 1.5159 |
62
- | 0.2529 | 3.7313 | 250 | 1.6716 |
63
- | 0.1555 | 4.4776 | 300 | 1.8479 |
64
- | 0.078 | 5.2239 | 350 | 2.0375 |
65
- | 0.0726 | 5.9701 | 400 | 1.8966 |
66
- | 0.0411 | 6.7164 | 450 | 2.1223 |
67
- | 0.0127 | 7.4627 | 500 | 2.1808 |
68
- | 0.0083 | 8.2090 | 550 | 2.2392 |
69
- | 0.0049 | 8.9552 | 600 | 2.3008 |
70
- | 0.0035 | 9.7015 | 650 | 2.3232 |
71
 
72
 
73
  ### Framework versions
 
16
 
17
  # sft_mc
18
 
19
+ This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 2.3016
22
 
23
  ## Model description
24
 
 
55
 
56
  | Training Loss | Epoch | Step | Validation Loss |
57
  |:-------------:|:------:|:----:|:---------------:|
58
+ | 1.0534 | 0.7463 | 50 | 1.2635 |
59
+ | 0.8118 | 1.4925 | 100 | 1.3805 |
60
+ | 0.3889 | 2.2388 | 150 | 1.6007 |
61
+ | 0.4361 | 2.9851 | 200 | 1.5327 |
62
+ | 0.265 | 3.7313 | 250 | 1.6067 |
63
+ | 0.1347 | 4.4776 | 300 | 1.8177 |
64
+ | 0.0857 | 5.2239 | 350 | 1.9771 |
65
+ | 0.0709 | 5.9701 | 400 | 1.9008 |
66
+ | 0.0474 | 6.7164 | 450 | 2.1317 |
67
+ | 0.0286 | 7.4627 | 500 | 2.2199 |
68
+ | 0.0091 | 8.2090 | 550 | 2.2086 |
69
+ | 0.0054 | 8.9552 | 600 | 2.2865 |
70
+ | 0.0038 | 9.7015 | 650 | 2.3016 |
71
 
72
 
73
  ### Framework versions