15 552 238

Taufiq Dwi Purnomo

taufiqdp

https://taufiqdp.com

AI & ML interests

SLM, VLM

Recent Activity

updated a model 2 days ago

taufiqdp/convnext_tiny-arutala

published a model 2 days ago

taufiqdp/convnext_tiny-arutala

upvoted a paper 2 days ago

Baichuan-Omni-1.5 Technical Report

View all activity

Organizations

taufiqdp's activity

upvoted a paper 2 days ago

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 5 days ago • 45

upvoted an article 2 days ago

Article

Welcome to Inference Providers on the Hub 🔥

3 days ago

• 172

upvoted a paper 3 days ago

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 5 days ago • 41

upvoted a collection 3 days ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 3 items • Updated 4 days ago • 289

upvoted an article 5 days ago

Article

We now support VLMs in smolagents!

7 days ago

• 65

upvoted a paper 7 days ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published 8 days ago • 40

upvoted a collection 8 days ago

SmolVLM 256M & 500M

Collection

Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 8 days ago • 62

upvoted 3 papers 8 days ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published 8 days ago • 38

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 9 days ago • 77

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 9 days ago • 274

upvoted 2 papers 9 days ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published 9 days ago • 47

Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published 11 days ago • 30

upvoted 2 papers 10 days ago

PaSa: An LLM Agent for Comprehensive Academic Paper Search

Paper • 2501.10120 • Published 14 days ago • 41

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 14 days ago • 101

upvoted 3 papers 14 days ago

upvoted 2 papers 16 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 16 days ago • 271

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 20 days ago • 79

upvoted a paper 21 days ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published 21 days ago • 87