Update README.md
Browse files
README.md
CHANGED
|
@@ -143,4 +143,27 @@ print(processor.decode(predictions[0], skip_special_tokens=True))
|
|
| 143 |
|
| 144 |
# Contribution
|
| 145 |
|
| 146 |
-
This model was originally contributed by Kenton Lee, Mandar Joshi et al. and added to the Hugging Face ecosystem by [Younes Belkada](https://huggingface.co/ybelkada).
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 143 |
|
| 144 |
# Contribution
|
| 145 |
|
| 146 |
+
This model was originally contributed by Kenton Lee, Mandar Joshi et al. and added to the Hugging Face ecosystem by [Younes Belkada](https://huggingface.co/ybelkada).
|
| 147 |
+
|
| 148 |
+
# Citation
|
| 149 |
+
|
| 150 |
+
If you want to cite this work, please consider citing the original paper:
|
| 151 |
+
```
|
| 152 |
+
@misc{https://doi.org/10.48550/arxiv.2210.03347,
|
| 153 |
+
doi = {10.48550/ARXIV.2210.03347},
|
| 154 |
+
|
| 155 |
+
url = {https://arxiv.org/abs/2210.03347},
|
| 156 |
+
|
| 157 |
+
author = {Lee, Kenton and Joshi, Mandar and Turc, Iulia and Hu, Hexiang and Liu, Fangyu and Eisenschlos, Julian and Khandelwal, Urvashi and Shaw, Peter and Chang, Ming-Wei and Toutanova, Kristina},
|
| 158 |
+
|
| 159 |
+
keywords = {Computation and Language (cs.CL), Computer Vision and Pattern Recognition (cs.CV), FOS: Computer and information sciences, FOS: Computer and information sciences},
|
| 160 |
+
|
| 161 |
+
title = {Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding},
|
| 162 |
+
|
| 163 |
+
publisher = {arXiv},
|
| 164 |
+
|
| 165 |
+
year = {2022},
|
| 166 |
+
|
| 167 |
+
copyright = {Creative Commons Attribution 4.0 International}
|
| 168 |
+
}
|
| 169 |
+
```
|