-
Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
Paper • 2311.07590 • Published • 17 -
Exponentially Faster Language Modelling
Paper • 2311.10770 • Published • 119 -
AppAgent: Multimodal Agents as Smartphone Users
Paper • 2312.13771 • Published • 55 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 189
Vineet Sharma
vineetsharma
AI & ML interests
Generative AI, Computer Vision, Natural Language Processing, Reinforcement Learning
Recent Activity
liked
a model
1 day ago
microsoft/VibeVoice-Large
upvoted
a
collection
4 days ago
FastVLM
upvoted
an
article
6 days ago
SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data