Can this do image+prompt to image+text?
#75
by
darkbyte
- opened
For example, can I send a picture (input image) together with an instruction (input prompt) to segment the input image into regions in some way by producing an image with solid color regions (the output image) and also return a JSON which tells what each color is (the output text)?
use this instead Qwen2.5-VL-72B-Instruct
with instructions