deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation
•
Updated
•
353k
•
451
This is a collection of Llama and Qwen-based models ranging from 1.5B to 70B parameters with are distilled from DeepSeek's new R1 models.