metadata
license: apache-2.0
datasets:
- HuggingFaceM4/OBELICS
- HuggingFaceM4/the_cauldron
- HuggingFaceM4/Docmatix
- HuggingFaceM4/WebSight
language:
- en
tags:
- multimodal
- vision
- image-text-to-text
- mlx
library_name: transformers
mlx-community/Idefics3-8B-Llama3-4bit
This model was converted to MLX format from HuggingFaceM4/Idefics3-8B-Llama3
using mlx-vlm version 0.1.12.
Refer to the original model card for more details on the model.
Use with mlx
pip install -U mlx-vlm
python -m mlx_vlm.generate --model mlx-community/Idefics3-8B-Llama3-4bit --max-tokens 100 --temp 0.0 --prompt "Describe this image." --image <path_to_image>