FastVLM Collection Efficient Vision Encoding for Vision Language Models • 9 items • Updated about 7 hours ago • 75
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 5 items • Updated 1 day ago • 59