|
--- |
|
language: |
|
- en |
|
tags: |
|
- vision-language |
|
- clip |
|
- vilt |
|
datasets: |
|
- lil-lab/kilogram-data |
|
|
|
--- |
|
|
|
KiloGram dataset and code repo: https://github.com/lil-lab/kilogram |
|
|
|
Preprocessed training and evaluation data: https://huggingface.co/datasets/lil-lab/kilogram-data |
|
|
|
# Citation |
|
|
|
```bibtex |
|
@misc{ji2022abstractvisualreasoningtangram, |
|
title={Abstract Visual Reasoning with Tangram Shapes}, |
|
author={Anya Ji and Noriyuki Kojima and Noah Rush and Alane Suhr and Wai Keen Vong and Robert D. Hawkins and Yoav Artzi}, |
|
year={2022}, |
|
eprint={2211.16492}, |
|
archivePrefix={arXiv}, |
|
primaryClass={cs.CL}, |
|
url={https://arxiv.org/abs/2211.16492}, |
|
} |
|
``` |