8 28 76

Jaykumaran R

Jaykumaran17

Jaykumaran

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Unified Vision-Language-Action Model

upvoted a paper about 1 month ago

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

liked a model about 1 month ago

patrickjohncyh/fashion-clip

View all activity

Organizations

upvoted 2 papers about 1 month ago

Unified Vision-Language-Action Model

Paper • 2506.19850 • Published Jun 24 • 27

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

Paper • 2506.15681 • Published Jun 18 • 39

liked a model about 1 month ago

patrickjohncyh/fashion-clip

Zero-Shot Image Classification • 0.2B • Updated Sep 17, 2024 • 2.89M • 235

liked a dataset about 1 month ago

RenzKa/simlingo

Updated Jun 17 • 2.8k • 7

liked a model about 1 month ago

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated Jun 20 • 170k • 1.44k

liked a Space about 1 month ago

165

DocScope-R1

📰

cosmos reason1 / docscopeocr / visionocr / captioner relaxed

upvoted a collection about 1 month ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 11 items • Updated 7 days ago • 513

liked a model about 2 months ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 5.7M • • 1.08k

New activity in nvidia/Cosmos-Reason1-7B about 2 months ago

Hi from the NVIDIA PM team

❤️ 2

#2 opened about 2 months ago by

jpenningNVIDIA

upvoted 2 articles about 2 months ago

Article

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 4 others •

Jun 11

• 71

Article

Introducing Training Cluster as a Service - a new collaboration with NVIDIA

and 2 others •

Jun 11

• 24

upvoted a collection about 2 months ago

Vision Language Models Papers 🖼️💬📝

Collection

Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30, 2024 • 38

liked 2 models about 2 months ago

unsloth/Cosmos-Reason1-7B-bnb-4bit

Image-to-Text • 5B • Updated May 24 • 10 • 1

Efficient-Large-Model/NVILA-8B

Text Generation • Updated Jan 6 • 11.4k • 6

upvoted a paper about 2 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 119

upvoted a collection about 2 months ago

SmolVLA

Collection

Small, efficient and light-weight VLAs pretrained on community datasets • 1 item • Updated Jun 1 • 27

liked a model about 2 months ago

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • 2B • Updated Apr 21 • 55.7k • 660

upvoted a paper about 2 months ago

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Paper • 2401.02117 • Published Jan 4, 2024 • 34

upvoted an article about 2 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

and 8 others •

Jun 3

• 211

liked a model about 2 months ago

nvidia/Cosmos-Reason1-7B

Image-to-Text • 8B • Updated Jun 11 • 208k • 111

Jaykumaran R

AI & ML interests

Recent Activity

Organizations

Jaykumaran17's activity

DocScope-R1

Hi from the NVIDIA PM team

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

Introducing Training Cluster as a Service - a new collaboration with NVIDIA

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data