DeepSeek-R1-Distill Collection This is a collection of Llama and Qwen-based models ranging from 1.5B to 70B parameters with are distilled from DeepSeek's new R1 models. • 6 items • Updated 18 days ago