Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-fhm32768-batch32-epoch3-8192 Text Generation • Updated about 4 hours ago
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-fem600-batch32-epoch3-8192 Text Generation • Updated about 5 hours ago
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-fhm600-batch32-epoch3-8192 Text Generation • Updated about 10 hours ago • 21
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192-addthinktoken-new Text Generation • Updated 5 days ago • 21
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192-addthinktoken Text Generation • Updated 5 days ago • 15
Lansechen/OLMoE-1B-7B-012-Distill-or-math220k-batch32-epoch3-8192 Text Generation • Updated 5 days ago • 4
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192 Text Generation • Updated 6 days ago • 53
Lansechen/bs17k_collection_filtered_hard_maxlength600 Viewer • Updated about 11 hours ago • 6.55k • 5
Lansechen/details_Lansechen__Qwen2.5-7B-Open-R1-Distill_private Viewer • Updated 25 days ago • 124 • 138