|
--- |
|
library_name: transformers |
|
license: mit |
|
datasets: |
|
- array/SAT |
|
- wentao-yuan/robopoint-data |
|
--- |
|
|
|
# Model Card for Model ID |
|
|
|
Please check https://github.com/arijitray1993/SAT on how to run inference with this model. |
|
|
|
If you use the model, please cite: |
|
``` |
|
@misc{ray2024satspatialaptitudetraining, |
|
title={SAT: Spatial Aptitude Training for Multimodal Language Models}, |
|
author={Arijit Ray and Jiafei Duan and Reuben Tan and Dina Bashkirova and Rose Hendrix and Kiana Ehsani and Aniruddha Kembhavi and Bryan A. Plummer and Ranjay Krishna and Kuo-Hao Zeng and Kate Saenko}, |
|
year={2024}, |
|
eprint={2412.07755}, |
|
archivePrefix={arXiv}, |
|
primaryClass={cs.CV}, |
|
url={https://arxiv.org/abs/2412.07755}, |
|
} |
|
``` |
|
|
|
|