view article Article SigLIP 2: A better multilingual vision language encoder By ariG23498 and 2 others • 26 days ago • 139
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 26 days ago • 130
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google By ariG23498 and 2 others • 28 days ago • 65
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 884
view article Article Welcome PaliGemma 2 – New vision language models by Google By merve and 3 others • Dec 5, 2024 • 148
view article Article Welcome PaliGemma 2 – New vision language models by Google By merve and 3 others • Dec 5, 2024 • 148
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 7 days ago • 145
view article Article PaliGemma – Google's Cutting-Edge Open Vision Language Model By merve and 2 others • May 14, 2024 • 243
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google By ariG23498 and 2 others • 28 days ago • 65