|
--- |
|
language: |
|
- en |
|
base_model: openai/clip-vit-base-patch32 |
|
--- |
|
GUIClip is a vision-language model in GUI domain. |
|
|
|
Code and dataset can be found at https://github.com/Jl-wei/guing |
|
|
|
If you find our work useful, please cite our paper: |
|
```bibtex |
|
@misc{wei2024guing, |
|
title={GUing: A Mobile GUI Search Engine using a Vision-Language Model}, |
|
author={Jialiang Wei and Anne-Lise Courbis and Thomas Lambolais and Binbin Xu and Pierre Louis Bernard and Gérard Dray and Walid Maalej}, |
|
year={2024}, |
|
eprint={2405.00145}, |
|
archivePrefix={arXiv}, |
|
primaryClass={cs.SE} |
|
} |
|
``` |
|
|
|
Please note that the model can only be used for academic purpose. |