LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning Paper • 2503.04812 • Published about 1 month ago • 13
LLaVE Collection LLaVE is a series of large language and vision embedding models trained on a variety of multimodal embedding datasets • 4 items • Updated 24 days ago • 8