Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kurogane 's Collections
VLM/Robotics

VLM/Robotics

updated May 28
Upvote
-

  • Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

    Paper • 2311.08046 • Published Nov 14, 2023 • 2

  • nvidia/GR00T-N1-2B

    Robotics • 2B • Updated Jul 8 • 1.72k • 329

  • nvidia/Eagle2-1B

    Image-Text-to-Text • 1B • Updated Apr 27 • 4.45k • 24

  • nvidia/PhysicalAI-Robotics-GR00T-X-Embodiment-Sim

    Updated Jul 11 • 360k • 152

  • lerobot/pi0

    Robotics • 4B • Updated Mar 6 • 14.8k • 292

  • facebook/vc1-base

    Robotics • Updated Apr 7, 2023 • 21 • 13

  • EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality

    Paper • 2411.15241 • Published Nov 22, 2024 • 7

  • MobileMamba: Lightweight Multi-Receptive Visual Mamba Network

    Paper • 2411.15941 • Published Nov 24, 2024 • 2

  • timm/shvit_s4.in1k

    Image Classification • 0.0B • Updated May 26 • 59

  • Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

    Paper • 2401.09417 • Published Jan 17, 2024 • 63

  • Theia: Distilling Diverse Vision Foundation Models for Robot Learning

    Paper • 2407.20179 • Published Jul 29, 2024 • 48
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs