--- license: gpl --- This is the official pre-trained model of the paper ''VIRT: Vision Instructed Robotic Transformer for Manipulation Learning''. The model is pre-trained using the robotic imagery pre-training technique on the Droid dataset. If you find this model useful, please cite: ```BibTeX @article{li2024virt, title={VIRT: Vision Instructed Robotic Transformer for Manipulation Learning}, author={xxx}, journal={xxx}, year={2024} } ```