metadata
library_name: transformers
license: mit
datasets:
- array/SAT
- wentao-yuan/robopoint-data
Model Card for Model ID
Please check https://github.com/arijitray1993/SAT on how to run inference with this model.
If you use the model, please cite:
@misc{ray2024satspatialaptitudetraining,
title={SAT: Spatial Aptitude Training for Multimodal Language Models},
author={Arijit Ray and Jiafei Duan and Reuben Tan and Dina Bashkirova and Rose Hendrix and Kiana Ehsani and Aniruddha Kembhavi and Bryan A. Plummer and Ranjay Krishna and Kuo-Hao Zeng and Kate Saenko},
year={2024},
eprint={2412.07755},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2412.07755},
}