T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper • 2512.10430 • Published 25 days ago • 113
DevQuasar/ai-sage.GigaChat3-702B-A36B-preview-bf16-GGUF Text Generation • 702B • Updated Nov 24, 2025 • 201 • 5
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents Paper • 2505.20411 • Published May 26, 2025 • 92
deepcogito/cogito-v1-preview-qwen-32B Text Generation • 33B • Updated Apr 8, 2025 • 965 • 115
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published Mar 5, 2025 • 232
Deepseek Papers Collection Deepseek papers collection • 28 items • Updated 38 minutes ago • 295
Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO Text Generation • 2B • Updated Feb 3, 2025 • 13 • 16
Qwen/Qwen2.5-VL-72B-Instruct Image-Text-to-Text • 73B • Updated Jun 6, 2025 • 62.7k • • 578