Intern-S1: A Scientific Multimodal Foundation Model Paper β’ 2508.15763 β’ Published 12 days ago β’ 242
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others β’ 28 days ago β’ 481
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations Paper β’ 2506.18898 β’ Published Jun 23 β’ 33
Tar Collection Unifying Visual Understanding and Generation via Text-Aligned Representations β’ 5 items β’ Updated Jul 2 β’ 15
Open LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 65 items β’ Updated Mar 20 β’ 634
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper β’ 2506.13585 β’ Published Jun 16 β’ 263
Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation Paper β’ 2505.18842 β’ Published May 24 β’ 37
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration Paper β’ 2505.20256 β’ Published May 26 β’ 17
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning Paper β’ 2505.17667 β’ Published May 23 β’ 89
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset Paper β’ 2505.09568 β’ Published May 14 β’ 97