RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation

Model Description

This contains pre-trained checkpoints and finetuned checkpoints for our RoomTour3D-NaviLLM. Please follow the instructions and license here to use these models.


Citation

If you find our work useful for your research, please consider citing the paper

@article{han2024roomtour3d,
      title={RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation}, 
      author={Mingfei Han and Liang Ma and Kamila Zhumakhanova and Ekaterina Radionova and Jingyi Zhang and Xiaojun Chang and Xiaodan Liang and Ivan Laptev},
      journal={arXiv preprint arXiv:2412.08591},
      year={2024}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Model tree for roomtour3d/roomtour3d-navillm-models

Finetuned
(1)
this model

Collection including roomtour3d/roomtour3d-navillm-models