Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
Phi-4-multimodal-instruct
like
507
Follow
Microsoft
9.49k
Automatic Speech Recognition
Transformers
Safetensors
24 languages
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
arxiv:
2407.13833
License:
mit
Model card
Files
Files and versions
Community
11
Train
Use this model
llama.cpp support when?
#7
by
alanzhuly
- opened
about 24 hours ago
Discussion
alanzhuly
about 24 hours ago
Great model and use case. I want to try locally on Mac!
See translation
❤️
1
1
+
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment