Can this do image+prompt to image+text?

#75
by darkbyte - opened

For example, can I send a picture (input image) together with an instruction (input prompt) to segment the input image into regions in some way by producing an image with solid color regions (the output image) and also return a JSON which tells what each color is (the output text)?

use this instead Qwen2.5-VL-72B-Instruct

with instructions

Sign up or log in to comment