R-HORIZON Collection The training and evaluation datasets for Paper "How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?" • 6 items • Updated Oct 22, 2025 • 7
LLM-I: LLMs are Naturally Interleaved Multimodal Creators Paper • 2509.13642 • Published Sep 17, 2025 • 9
The Invisible Leash: Why RLVR May Not Escape Its Origin Paper • 2507.14843 • Published Jul 20, 2025 • 85
WaveUI Collection WaveUI is a collection of datasets and tools to improve UI object detection • 6 items • Updated Jul 31, 2024 • 10
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Paper • 2505.24864 • Published May 30, 2025 • 143
BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions Paper • 2411.07461 • Published Nov 12, 2024 • 23
To Code, or Not To Code? Exploring Impact of Code in Pre-training Paper • 2408.10914 • Published Aug 20, 2024 • 45
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Paper • 2408.11039 • Published Aug 20, 2024 • 63
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16, 2024 • 101
JPEG-LM: LLMs as Image Generators with Canonical Codec Representations Paper • 2408.08459 • Published Aug 15, 2024 • 45
XGen-MM-1 models and datasets Collection A collection of all XGen-MM (Foundation LMM) models! • 18 items • Updated Nov 5, 2025 • 39
Gemma 2 2B Release Collection The 2.6B parameter version of Gemma 2. • 6 items • Updated Jul 10, 2025 • 83
🍃 MINT-1T Collection Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 14 items • Updated Oct 22, 2025 • 64
4M Tokenizers Collection Multimodal tokenizers from https://4m.epfl.ch/ • 15 items • Updated Mar 7, 2025 • 6
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens Paper • 2406.11271 • Published Jun 17, 2024 • 21