License question, is it based on Llama license?
#48 opened about 1 month ago
by
JLouisBiz
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/EolfJfjW25hC4Bt_hCPq8.png)
chat_template not set in tokenizer_config
#47 opened 2 months ago
by
vizzard110
Is there a checkpoint after fine-tuning only on `ultrachat_200k`, which we would like to use it to do research on alignment algorithms?
#45 opened 5 months ago
by
AIR-hl
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64bbbe04979949d2e2ef9065/HFO7Y6EZj_FRSGESXTgvk.png)
Interview request: genAI evaluation & documentation
#44 opened 5 months ago
by
evatang
Adding Evaluation Results
#42 opened 6 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
How to remove input token to get only output token ?
#41 opened 7 months ago
by
ducknificient
Multilingual model
#40 opened 7 months ago
by
ducknificient
Instruction Tuning Model
2
#39 opened 7 months ago
by
ducknificient
Request: DOI
#38 opened 7 months ago
by
climbingm
Adding Evaluation Results
#37 opened 8 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
CUDA assertion error when trying to train
#36 opened 8 months ago
by
brianwilcken
Can you upload the SFT version as well?
#34 opened 9 months ago
by
jiwan-chung
Adding Evaluation Results
#33 opened 9 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
Adding Evaluation Results
#32 opened 10 months ago
by
asck
Adding Evaluation Results
#31 opened 10 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
It wrote an credible new recipe for spiced frog salad
#30 opened 11 months ago
by
MartialTerran
Write a story....
#29 opened 11 months ago
by
MartialTerran
Too much Junk vocab words in the vocab.json.
8
#28 opened 11 months ago
by
MartialTerran
Bing (ChatGPT4) analyzes the "def fibonacci_sequence_to_digits(n)" example code.
#27 opened 11 months ago
by
MartialTerran
Update widget example
#26 opened 11 months ago
by
Xenova
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61b253b7ac5ecaae3d1efe0c/hwiQ0uvz3t-L5a-NtBIO6.png)
Adding Evaluation Results
#25 opened 11 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
Deployment?
3
#24 opened 11 months ago
by
huggingface9837
[AUTOMATED] Model Memory Requirements
#22 opened 12 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#21 opened 12 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#20 opened 12 months ago
by
model-sizer-bot
Dataset for DPO, with a Template?
1
#17 opened about 1 year ago
by
ewqr2130
Prompt format?
4
#16 opened about 1 year ago
by
anuragrawal
Minimum supported device?
2
#15 opened about 1 year ago
by
sachinmyneni
Transformers unable to load the model
#14 opened about 1 year ago
by
iammayur
BFloat16 is not supported on MPS
8
#13 opened about 1 year ago
by
nhannn
ImportError: cannot import name 'LlamaTokenizer' from 'transformers' (/Library/Frameworks/Python.framework/Versions/3.11/lib/python3.11/site-packages/transformers/__init__.py)
1
#12 opened about 1 year ago
by
gmdl007
Training on corpus of text (astronomy) - without templates
1
#11 opened about 1 year ago
by
demetera
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/ccdRI3IPuaTRO4YLUfjo0.png)
what are use cases , it is deranged like Joe Biden
2
#10 opened about 1 year ago
by
froilo
![](https://cdn-avatars.huggingface.co/v1/production/uploads/633e72fc3a17ab61de8cdc5f/5lPKWN1C7AR09YcJo-B6O.png)
What is the context size?
1
#9 opened about 1 year ago
by
streamerbtw1002
Is it on the leaderboard?
3
#8 opened about 1 year ago
by
AIWintermuteAI
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6442dce47feb866811b32a0a/kwaQkx_7aXCfF_yJ8V0Io.png)
You know what we are going to ask
1
#6 opened about 1 year ago
by
LaferriereJC
Fine Tuning
3
#5 opened about 1 year ago
by
ybsid
You should try training a model with 2B parameters and context length 32000.
1
#3 opened about 1 year ago
by
win10
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678188568629-noauth.png)
Fantastic work guys!
2
#1 opened about 1 year ago
by
dillfrescott
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6215ce9abfcb3893344dd0a2/Zl1qDoGUGZ-ob0PcY6JbT.jpeg)