Commit
·
230c204
1
Parent(s):
d4b5bd8
Update README.md
Browse files
README.md
CHANGED
@@ -53,7 +53,7 @@ Remember that with lower parameter sizes, the structure of the prompt becomes mo
|
|
53 |
- 2048 7B version
|
54 |
- 512 variants of 13B and 7B
|
55 |
- merged ggml models for 13B and 7B
|
56 |
-
- Tweet fix for 13B and 7B
|
57 |
|
58 |
### Citations
|
59 |
Alpaca COT datasets
|
|
|
53 |
- 2048 7B version
|
54 |
- 512 variants of 13B and 7B
|
55 |
- merged ggml models for 13B and 7B
|
56 |
+
- Tweet fix for 13B and 7B - lower model sizes seem to be extremely sensitive to hashtags at the end of training data responses, especially at longer cutoffs
|
57 |
|
58 |
### Citations
|
59 |
Alpaca COT datasets
|