Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
deepseek-ai
/
DeepSeek-R1-Distill-Qwen-1.5B
like
771
Follow
DeepSeek
32k
Text Generation
Transformers
Safetensors
qwen2
conversational
text-generation-inference
Inference Endpoints
arxiv:
2501.12948
License:
mit
Model card
Files
Files and versions
Community
20
Train
Deploy
Use this model
refs/pr/7
DeepSeek-R1-Distill-Qwen-1.5B
3 contributors
History:
9 commits
nielsr
HF staff
Add pipeline tag, link to paper
528cc7d
verified
19 days ago
figures
Release DeepSeek-R1
23 days ago
.gitattributes
1.52 kB
initial commit
23 days ago
LICENSE
1.06 kB
Release DeepSeek-R1
23 days ago
README.md
18.8 kB
Add pipeline tag, link to paper
19 days ago
config.json
679 Bytes
Add files using upload-large-folder tool
23 days ago
generation_config.json
181 Bytes
Add generation_config.json
22 days ago
model.safetensors
3.55 GB
LFS
Add files using upload-large-folder tool
23 days ago
tokenizer.json
7.03 MB
Add files using upload-large-folder tool
23 days ago
tokenizer_config.json
3.06 kB
Update tokenizer_config.json
21 days ago