Varun Sakunia

Varun-08

AI & ML interests

Python, Machine Learning, Deep Learning, Computer Vision

Recent Activity

upvoted an article about 1 month ago

Open-R1: a fully open reproduction of DeepSeek-R1

liked a model about 2 months ago

deepseek-ai/Janus-Pro-7B

liked a model about 2 months ago

HKUSTAudio/Llasa-3B

View all activity

Organizations

None yet

Varun-08's activity

upvoted an article about 1 month ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 803

liked 3 models about 2 months ago

upvoted an article about 2 months ago

Article

Timm ❤️ Transformers: Use any timm model with transformers

Jan 16

• 44

liked a model about 2 months ago

jxm/cde-small-v2

Feature Extraction • Updated Feb 3 • 4.49k • 77

upvoted a paper about 2 months ago

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

Paper • 2501.08326 • Published Jan 14 • 32

liked a model about 2 months ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated 10 days ago • 1.58M • 3.66k

upvoted 2 papers 2 months ago

Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction

Paper • 2501.03218 • Published Jan 6 • 36

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

Paper • 2412.19326 • Published Dec 26, 2024 • 18

upvoted a paper 3 months ago

Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis

Paper • 2412.01819 • Published Dec 2, 2024 • 35

liked a Space 3 months ago

260

Jupyter Agent

🏃

Create and run Jupyter notebooks interactively

upvoted a collection 3 months ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 141

liked 3 models 3 months ago

google/siglip-so400m-patch14-384

Zero-Shot Image Classification • Updated Sep 26, 2024 • 10.5M • • 496

matteogeniaccio/phi-4

Updated Jan 10 • 140 • 187

meta-llama/Llama-3.3-70B-Instruct

Text Generation • Updated Dec 21, 2024 • 855k • • 2.13k

upvoted a collection 3 months ago

PaliGemma 2 Release

Collection

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 2 days ago • 145

upvoted a collection 4 months ago

Models for dataset curation

Collection

9 items • Updated Dec 5, 2024 • 17

liked a Space 4 months ago

365

Qwen2.5 Turbo 1M Demo

💻

Upload documents for Q&A

liked a model 4 months ago

NexaAIDev/OmniVLM-968M

Updated Dec 17, 2024 • 1.39k • 513