Running on Zero 10 10 Multimodal RAG with Granite Vision π RAG example using Granite [vision, embedding, instruct]
ibm-granite/granite-vision-3.1-2b-preview Image-Text-to-Text β’ Updated 20 days ago β’ 12.5k β’ 93
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ Updated 6 days ago β’ 666k β’ 1.17k
openai/whisper-large-v3-turbo Automatic Speech Recognition β’ Updated Oct 4, 2024 β’ 6.75M β’ β’ 2.13k
Running 2.29k 2.29k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters