Text Generation
Transformers
Safetensors
Japanese
English
mistral
conversational
text-generation-inference
Inference Endpoints
ptrdvn commited on
Commit
f16b57e
·
1 Parent(s): 6d6526c

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +64 -0
README.md ADDED
@@ -0,0 +1,64 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - OpenAssistant/oasst1
5
+ - zetavg/ShareGPT-Processed
6
+ - augmxnt/ultra-orca-boros-en-ja-v1
7
+ language:
8
+ - ja
9
+ - en
10
+ ---
11
+
12
+
13
+ <p align="center">
14
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/64b63f8ad57e02621dc93c8b/3uLNwKHFwEgT2YQ-BGOiH.png" alt="drawing" width="600"/>
15
+ </p>
16
+
17
+ # How to use
18
+
19
+ ```python
20
+ from transformers import AutoTokenizer, AutoModelForCausalLM
21
+ import torch
22
+
23
+ tokenizer = AutoTokenizer.from_pretrained("lightblue/karasu-7B-chat")
24
+ model = AutoModelForCausalLM.from_pretrained("lightblue/karasu-7B-chat", torch_dtype=torch.bfloat16, device_map="auto")
25
+
26
+ pipe = pipeline("text-generation", model=model, tokenizer=tokenizer)
27
+
28
+ messages = [{"role": "system", "content": "あなたはAIアシスタントです。"}]
29
+ messages.append({"role": "user", "content": "イギリスの首相は誰ですか?"})
30
+
31
+ prompt = tokenizer.apply_chat_template(conversation=messages, add_generation_prompt=True, tokenize=False)
32
+
33
+ pipe(prompt, max_new_tokens=100, do_sample=False, temperature=0.0, return_full_text=False)
34
+ ```
35
+
36
+ # Base checkpoint
37
+ [lightblue/karasu-7B](https://huggingface.co/lightblue/karasu-7B)
38
+
39
+ # Training datasets (total ~7B)
40
+ * Lightblue's suite of Kujira datasets (unreleased)
41
+ * Lightblue's own question-based datasets (unreleased)
42
+ * Lightblue's own category-based datasets (unreleased)
43
+ * [OASST](https://huggingface.co/datasets/OpenAssistant/oasst1) (Japanese chats only)
44
+ * [ShareGPT](https://huggingface.co/datasets/zetavg/ShareGPT-Processed) (Japanese chats only)
45
+ * [augmxnt/ultra-orca-boros-en-ja-v1](https://huggingface.co/datasets/augmxnt/ultra-orca-boros-en-ja-v1) (['airoboros', 'slimorca', 'ultrafeedback', 'airoboros_ja_new'] only)
46
+
47
+ # Developed by
48
+
49
+ <a href="https://www.lightblue-tech.com">
50
+ <img src="https://www.lightblue-tech.com/wp-content/uploads/2021/10/LBlogo-scaled.jpg" alt="Lightblue technology logo" width="400"/>
51
+ </a>
52
+
53
+ ### Engineers
54
+ Peter Devine
55
+
56
+ Sho Higuchi
57
+
58
+ ### Advisors
59
+ Yuuki Yamanaka
60
+
61
+ Atom Sonoda
62
+
63
+ ### Dataset evaluator
64
+ Renju Aoki