Collections

Discover the best community collections!

Collections including paper arxiv:2409.17146
vision language models
papers and models 🙈
multilingual vision models
Some papers I read for understanding vision models and also adding multilingual capabilities to them
Multimodal LLMs
Collection by Sep 27, 2024
VLM papers
Collection by Jan 16
Cognition
Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend.