UI-TARS-7B-SFT-4bit / README.md
prince-canuma's picture
Upload folder using huggingface_hub
ee15331 verified
metadata
license: apache-2.0
language:
  - en
pipeline_tag: image-text-to-text
tags:
  - multimodal
  - gui
  - mlx
library_name: transformers

mlx-community/UI-TARS-7B-SFT-4bit

This model was converted to MLX format from bytedance-research/UI-TARS-7B-SFT using mlx-vlm version 0.1.14. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm
python -m mlx_vlm.generate --model mlx-community/UI-TARS-7B-SFT-4bit --max-tokens 100 --temp 0.0 --prompt "Describe this image." --image <path_to_image>