error:size mismatch for model.layers.0.self_attn.k_proj.weight: copying a param with shape torch.Size([256, 2048]) from checkpoint, the shape in current model is torch.Size([2048, 2048]).
#60 opened 10 days ago
		by
		
				
							
						Kanade-jie
	
Add link to Neuron-optimized version
#59 opened about 2 months ago
		by
		
				
							
						badaoui
	
testing
#58 opened 2 months ago
		by
		
				
							
						robinhassan
	
Request: DOI
#57 opened 4 months ago
		by
		
				
							
						VCoder1410
	
Rename README.md to Orbital-AI β TinyLLaMA 1.1B Chat (Fine-Tune Ready)
#56 opened 4 months ago
		by
		
				
							
						PlanetX95
	
leave-letter-generator
#54 opened 6 months ago
		by
		
				
							
						jeyakeerthanaa
	
What is the original license of 1.1B Llama ?
								1
#52 opened 7 months ago
		by
		
				
							
						JLouisBiz
	
Adding Evaluation Results
#51 opened 8 months ago
		by
		
				
							
						MythX7
	
Adding ONNX file of this model
#50 opened 8 months ago
		by
		
				
							
						WRCREX
	
Create Tinyllama
#49 opened 8 months ago
		by
		
				
							
						Meenakshi14
	
License question, is it based on Llama license?
								1
#48 opened 10 months ago
		by
		
				
							
						JLouisBiz
	
chat_template not set in tokenizer_config
#47 opened 11 months ago
		by
		
				
							
						vizzard110
	
Is there a checkpoint after fine-tuning only on `ultrachat_200k`, which we would like to use it to do research on alignment algorithms?
#45 opened about 1 year ago
		by
		
				
							
						AIR-hl
	
Interview request: genAI evaluation & documentation
#44 opened about 1 year ago
		by
		
				
							
						evatang
	
Adding Evaluation Results
#42 opened about 1 year ago
		by
		
				
							
						leaderboard-pr-bot
	
How to remove input token to get only output token ?
β
							
						2
				#41 opened over 1 year ago
		by
		
				
							
						ducknificient
	
Multilingual model
#40 opened over 1 year ago
		by
		
				
							
						ducknificient
	
Instruction Tuning Model
								2
#39 opened over 1 year ago
		by
		
				
							
						ducknificient
	
Request: DOI
#38 opened over 1 year ago
		by
		
				
							
						climbingm
	
Adding Evaluation Results
#37 opened over 1 year ago
		by
		
				
							
						leaderboard-pr-bot
	
CUDA assertion error when trying to train
#36 opened over 1 year ago
		by
		
				
							
						brianwilcken
	
Can you upload the SFT version as well?
#34 opened over 1 year ago
		by
		
				
							
						jiwan-chung
	
Adding Evaluation Results
#33 opened over 1 year ago
		by
		
				
							
						leaderboard-pr-bot
	
Adding Evaluation Results
#32 opened over 1 year ago
		by
		
				
							
						asck
	
Adding Evaluation Results
#31 opened over 1 year ago
		by
		
				
							
						leaderboard-pr-bot
	
It wrote an credible new recipe for spiced frog salad
#30 opened over 1 year ago
		by
		
				
							
						MartialTerran
	
Write a story....
#29 opened over 1 year ago
		by
		
				
							
						MartialTerran
	
Too much Junk vocab words in the vocab.json.
								8
#28 opened over 1 year ago
		by
		
				
							
						MartialTerran
	
Bing (ChatGPT4) analyzes the "def fibonacci_sequence_to_digits(n)" example code.
#27 opened over 1 year ago
		by
		
				
							
						MartialTerran
	
Update widget example
#26 opened over 1 year ago
		by
		
				
							
						Xenova
	
Adding Evaluation Results
#25 opened over 1 year ago
		by
		
				
							
						leaderboard-pr-bot
	
Deployment?
π
							
						1
				
								3
#24 opened over 1 year ago
		by
		
				
							
						huggingface9837
	
[AUTOMATED] Model Memory Requirements
#22 opened over 1 year ago
		by
		
				
							
						model-sizer-bot
	
[AUTOMATED] Model Memory Requirements
#21 opened over 1 year ago
		by
		
				
							
						model-sizer-bot
	
[AUTOMATED] Model Memory Requirements
#20 opened over 1 year ago
		by
		
				
							
						model-sizer-bot
	
Dataset for DPO, with a Template?
								1
#17 opened almost 2 years ago
		by
		
				
							
						ewqr2130
	
Prompt format?
								4
#16 opened almost 2 years ago
		by
		
				
							
						anuragrawal
	
Minimum supported device?
								2
#15 opened almost 2 years ago
		by
		
				
							
						sachinmyneni
	
Transformers unable to load the model
#14 opened almost 2 years ago
		by
		
				
							
						iammayur
	
BFloat16 is not supported on MPS
								8
#13 opened almost 2 years ago
		by
		
				
							
						nhannn
	
ImportError: cannot import name 'LlamaTokenizer' from 'transformers' (/Library/Frameworks/Python.framework/Versions/3.11/lib/python3.11/site-packages/transformers/__init__.py)
									1
	#12 opened almost 2 years ago
		by
		
				
							
						gmdl007
	
Training on corpus of text (astronomy) - without templates
								1
#11 opened almost 2 years ago
		by
		
				
							
						demetera
	
what are use cases , it is deranged like Joe Biden
π
							π
							
						4
				
								2
#10 opened almost 2 years ago
		by
		
				
							
						froilo
	
What is the context size?
								1
#9 opened almost 2 years ago
		by
		
				
							
						streamerbtw1002
	
Is it on the leaderboard?
								3
#8 opened almost 2 years ago
		by
		
				
							
						AIWintermuteAI
	
You know what we are going to ask
π
							
						3
				
								1
#6 opened almost 2 years ago
		by
		
				
							
						LaferriereJC
	
Fine Tuning
π€
							
						2
				
								3
#5 opened almost 2 years ago
		by
		
				
							
						ybsid