Vaibhav Singh

veb-101

veb-101

AI & ML interests

None yet

Recent Activity

upvoted an article 25 days ago

Encoding the World's Medical Knowledge into 970K

upvoted an article 2 months ago

Easily Build and Share ROCm Kernels with Hugging Face

liked a model 3 months ago

nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16

View all activity

Organizations

None yet

upvoted an article 25 days ago

Article

Encoding the World's Medical Knowledge into 970K

about 1 month ago

•

upvoted an article 2 months ago

Article

Easily Build and Share ROCm Kernels with Hugging Face

Nov 17, 2025

•

liked a model 3 months ago

nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16

Image-Text-to-Text • 13B • Updated Dec 2, 2025 • 74.7k • 71

upvoted an article 3 months ago

Article

LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR

Oct 23, 2025

•

upvoted a paper 4 months ago

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1, 2025 • 39

liked a model 4 months ago

jinaai/jina-embeddings-v4-vllm-retrieval

Visual Document Retrieval • 4B • Updated Sep 17, 2025 • 9.55k • 32

upvoted a collection 4 months ago

jina-embeddings-v4

Collection

Universal Embeddings for Multimodal Multilingual Retrieval • 10 items • Updated Sep 2, 2025 • 2

upvoted 2 papers 4 months ago

Lost in Embeddings: Information Loss in Vision-Language Models

Paper • 2509.11986 • Published Sep 15, 2025 • 28

Color Me Correctly: Bridging Perceptual Color Spaces and Text Embeddings for Improved Diffusion Generation

Paper • 2509.10058 • Published Sep 12, 2025 • 11

upvoted 3 papers 5 months ago

STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer

Paper • 2508.10893 • Published Aug 14, 2025 • 31

MolmoAct: Action Reasoning Models that can Reason in Space

Paper • 2508.07917 • Published Aug 11, 2025 • 44

Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

Paper • 2508.04825 • Published Aug 6, 2025 • 59

liked a model 6 months ago

PhysicsWallahAI/Aryabhata-1.0

Text Generation • 8B • Updated Aug 13, 2025 • 159 • 106

upvoted an article 6 months ago

Article

Efficient MultiModal Data Pipeline

Jul 8, 2025

•

upvoted a paper 7 months ago

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

Paper • 2506.15681 • Published Jun 18, 2025 • 40

upvoted an article 7 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23, 2024

•

191

liked a model 7 months ago

sentence-transformers/all-MiniLM-L6-v2

liked a model 9 months ago

rasbt/llama-3.2-from-scratch

Updated Jun 12, 2025 • 283

updated a model 10 months ago

veb-101/Keras-3-apple-mobilevit

Updated Apr 10, 2025

published a model 10 months ago

veb-101/Keras-3-apple-mobilevit

Updated Apr 10, 2025

Vaibhav Singh

AI & ML interests

Recent Activity

Organizations

veb-101's activity

Encoding the World's Medical Knowledge into 970K

Easily Build and Share ROCm Kernels with Hugging Face

LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR

Efficient MultiModal Data Pipeline

🪆 Introduction to Matryoshka Embedding Models