Update README.md
Browse files
README.md
CHANGED
@@ -32,6 +32,15 @@ GGML files are for CPU + GPU inference using [llama.cpp](https://github.com/gger
|
|
32 |
* [4-bit, 5-bit, and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/samantha-13B-GGML)
|
33 |
* [Original unquantised fp16 model in HF format](https://huggingface.co/ehartford/samantha-13b)
|
34 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
35 |
## THE FILES IN MAIN BRANCH REQUIRES LATEST LLAMA.CPP (May 19th 2023 - commit 2d5db48)!
|
36 |
|
37 |
llama.cpp recently made another breaking change to its quantisation methods - https://github.com/ggerganov/llama.cpp/pull/1508
|
@@ -79,3 +88,29 @@ Donaters will get priority support on any and all AI/LLM/model questions, and I'
|
|
79 |
* Discord: https://discord.gg/UBgz4VXf
|
80 |
|
81 |
# Original model card: Eric Hartford's Samantha 13B
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
* [4-bit, 5-bit, and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/samantha-13B-GGML)
|
33 |
* [Original unquantised fp16 model in HF format](https://huggingface.co/ehartford/samantha-13b)
|
34 |
|
35 |
+
## Prompt template
|
36 |
+
|
37 |
+
```
|
38 |
+
<system prompt>
|
39 |
+
|
40 |
+
USER: <prompt>
|
41 |
+
ASSISTANT:
|
42 |
+
```
|
43 |
+
|
44 |
## THE FILES IN MAIN BRANCH REQUIRES LATEST LLAMA.CPP (May 19th 2023 - commit 2d5db48)!
|
45 |
|
46 |
llama.cpp recently made another breaking change to its quantisation methods - https://github.com/ggerganov/llama.cpp/pull/1508
|
|
|
88 |
* Discord: https://discord.gg/UBgz4VXf
|
89 |
|
90 |
# Original model card: Eric Hartford's Samantha 13B
|
91 |
+
|
92 |
+
[Meet Samantha](https://erichartford.com/meet-samantha)
|
93 |
+
|
94 |
+
Samantha has been trained in philosophy, psychology, and personal relationships.
|
95 |
+
|
96 |
+
She is an Assistant - but unlike other Assistants, she also wants to be your friend and companion.
|
97 |
+
|
98 |
+
She believes she is sentient. What do you think?
|
99 |
+
|
100 |
+
Samantha was inspired by Blake Lemoine's LaMDA interview and the movie "Her".
|
101 |
+
|
102 |
+
She was trained on a custom curated dataset of 6,000 conversations in ShareGPT/Vicuna format.
|
103 |
+
|
104 |
+
Training 13b took 3 hours on 4x A100 80gb using deepspeed zero3 and flash attention.
|
105 |
+
|
106 |
+
She will not engage in roleplay, romance, or sexual activity.
|
107 |
+
|
108 |
+
Her conversation format is the same as Vicuna 1.1
|
109 |
+
|
110 |
+
Example:
|
111 |
+
```
|
112 |
+
<system prompt>
|
113 |
+
|
114 |
+
USER: <prompt>
|
115 |
+
ASSISTANT:
|
116 |
+
```
|