microsoft/Phi-3-vision-128k-instruct-onnx
Updated
•
58
•
6
None defined yet.
GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents
Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs