Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1, 2024 β’ 70
view article Article **How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents** By Steveeeeeeen β’ 1 day ago β’ 10
view article Article π Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker! By ariG23498 β’ 2 days ago β’ 13
π§ Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community β’ 5 items β’ Updated about 14 hours ago β’ 15
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 3 items β’ Updated 4 days ago β’ 287
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths β’ 2 items β’ Updated 4 days ago β’ 89
view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... By srinivasbilla β’ 10 days ago β’ 52
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release β’ 12 items β’ Updated 8 days ago β’ 62
view article Article Yay! Organizations can now publish blog Articles By huggingface β’ 10 days ago β’ 30
Eagle 2 Collection Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. β’ 9 items β’ Updated 8 days ago β’ 27
view article Article Upgrading Kokoro: natural TTS for short bursts By hexgrad β’ Nov 22, 2024 β’ 26
view article Article Timm β€οΈ Transformers: Use any timm model with transformers 15 days ago β’ 37
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Paper β’ 2501.09751 β’ Published 14 days ago β’ 47
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 16 days ago β’ 129
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference 15 days ago β’ 61