view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality about 1 month ago • 71
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Jan 8 • 566
Text-to-Image History Collection How Text-to-Image evolved on HF and inspired the Community • 55 items • Updated Nov 7, 2024 • 15
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Paper • 2402.17485 • Published Feb 27, 2024 • 193
MobileDiffusion: Subsecond Text-to-Image Generation on Mobile Devices Paper • 2311.16567 • Published Nov 28, 2023 • 21