Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Peacemann
/
deepseek-ai_DeepSeek-R1-0528-Qwen3-8B_LMUL
like
0
Text Generation
qwen3
lmul
research
experimental
conversational
License:
mit
Model card
Files
Files and versions
Community
main
deepseek-ai_DeepSeek-R1-0528-Qwen3-8B_LMUL
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
Peacemann
Create README.md
27737c8
verified
3 months ago
.gitattributes
Safe
1.57 kB
Add lmul-attention version of deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
3 months ago
README.md
3.56 kB
Create README.md
3 months ago
chat_template.jinja
Safe
3.13 kB
Add lmul-attention version of deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
3 months ago
config.json
Safe
860 Bytes
Add lmul-attention version of deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
3 months ago
lmul.py
2.42 kB
Add lmul-attention version of deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
3 months ago
special_tokens_map.json
Safe
485 Bytes
Add lmul-attention version of deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
3 months ago
tokenizer.json
Safe
11.4 MB
LFS
Add lmul-attention version of deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
3 months ago
tokenizer_config.json
Safe
5.59 kB
Add lmul-attention version of deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
3 months ago