End of training
Browse files
README.md
CHANGED
@@ -1,6 +1,9 @@
|
|
1 |
---
|
|
|
2 |
base_model: damienbenveniste/mistral-supervised
|
3 |
tags:
|
|
|
|
|
4 |
- generated_from_trainer
|
5 |
model-index:
|
6 |
- name: mistral-reward
|
@@ -45,7 +48,7 @@ The following hyperparameters were used during training:
|
|
45 |
|
46 |
### Framework versions
|
47 |
|
48 |
-
- Transformers 4.
|
49 |
-
- Pytorch 2.
|
50 |
-
- Datasets 2.
|
51 |
-
- Tokenizers 0.
|
|
|
1 |
---
|
2 |
+
library_name: transformers
|
3 |
base_model: damienbenveniste/mistral-supervised
|
4 |
tags:
|
5 |
+
- trl
|
6 |
+
- reward-trainer
|
7 |
- generated_from_trainer
|
8 |
model-index:
|
9 |
- name: mistral-reward
|
|
|
48 |
|
49 |
### Framework versions
|
50 |
|
51 |
+
- Transformers 4.44.2
|
52 |
+
- Pytorch 2.4.0
|
53 |
+
- Datasets 2.21.0
|
54 |
+
- Tokenizers 0.19.1
|