microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ Updated about 24 hours ago β’ 441k β’ 1.12k
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 11 items β’ Updated about 8 hours ago β’ 93
LLaVa-NeXT-Video Collection LLaVa-NeXT-Video extends LLaVa-NeXT for video understanding. β’ 5 items β’ Updated Jun 10, 2024 β’ 9
Running 543 543 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects