STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer Paper • 2508.10893 • Published 19 days ago • 30
MolmoAct: Action Reasoning Models that can Reason in Space Paper • 2508.07917 • Published 22 days ago • 40
Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off Paper • 2508.04825 • Published 27 days ago • 57
GenRecal: Generation after Recalibration from Large to Small Vision-Language Models Paper • 2506.15681 • Published Jun 18 • 40
view article Article 🪆 Introduction to Matryoshka Embedding Models By tomaarsen and 2 others • Feb 23, 2024 • 159
Running 3.14k 3.14k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters