Drive as You Speak: Enabling Human-Like Interaction with Large Language Models in Autonomous Vehicles Paper • 2309.10228 • Published Sep 19, 2023
On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation Paper • 2411.11913 • Published Nov 17, 2024
MedSAM3: Delving into Segment Anything with Medical Concepts Paper • 2511.19046 • Published Nov 24 • 49
MedSAM3: Delving into Segment Anything with Medical Concepts Paper • 2511.19046 • Published Nov 24 • 49
Qwen/Qwen3-VL-235B-A22B-Thinking Image-Text-to-Text • 236B • Updated Nov 26 • 35.6k • • 357
SocialGesture: Delving into Multi-person Gesture Understanding Paper • 2504.02244 • Published Apr 3
Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs Paper • 2506.21656 • Published Jun 26 • 15
Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs Paper • 2506.21656 • Published Jun 26 • 15