microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 29 days ago • 198k • 1.56k
huihui-ai/Phi-4-multimodal-instruct-abliterated Automatic Speech Recognition • 6B • Updated Mar 3, 2025 • 70 • 27
google/pix2struct-widget-captioning-large Visual Question Answering • 1B • Updated Apr 10, 2024 • 50 • 20