view article Article Vision Language Model Alignment in TRL β‘οΈ By sergiopaniego and 4 others β’ 26 days ago β’ 75
view article Article Introducing ColQwen-Omni: Retrieve in every modality By manu and 4 others β’ Jul 17 β’ 69
view article Article (LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware By derekl35 and 4 others β’ Jun 19 β’ 85
view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub By drbh and 6 others β’ Jun 12 β’ 131
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data By danaaubakirova and 8 others β’ Jun 3 β’ 240
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others β’ May 21 β’ 207
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others β’ May 12 β’ 519
view article Article Welcoming Llama Guard 4 on Hugging Face Hub By merve and 3 others β’ Apr 29 β’ 40
view article Article Cohere on Hugging Face Inference Providers π₯ By burtenshaw and 6 others β’ Apr 16 β’ 131
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others β’ Mar 12 β’ 457
view article Article SigLIP 2: A better multilingual vision language encoder By ariG23498 and 2 others β’ Feb 21 β’ 179
view article Article SmolVLM2: Bringing Video Understanding to Every Device By orrzohar and 6 others β’ Feb 20 β’ 297
view article Article Open-source DeepResearch β Freeing our search agents By m-ric and 4 others β’ Feb 4 β’ 1.29k
view article Article SmolVLM Grows Smaller β Introducing the 250M & 500M Models! By andito and 2 others β’ Jan 23 β’ 182
view article Article Introducing smolagents: simple agents that write actions in code. By m-ric and 2 others β’ Dec 31, 2024 β’ 1.11k
view article Article Welcome PaliGemma 2 β New vision language models by Google By merve and 3 others β’ Dec 5, 2024 β’ 158
view article Article SmolVLM - small yet mighty Vision Language Model By andito and 4 others β’ Nov 26, 2024 β’ 356
view article Article Llama can now see and run on your device - welcome Llama 3.2 By merve and 6 others β’ Sep 25, 2024 β’ 191
view article Article Preference Optimization for Vision Language Models By qgallouedec and 3 others β’ Jul 10, 2024 β’ 80