Building on HF

15 726 285

Taufiq Dwi Purnomo

taufiqdp

https://taufiqdp.com

AI & ML interests

SLM, VLM

Recent Activity

upvoted a paper 6 days ago

mHC: Manifold-Constrained Hyper-Connections

liked a model 6 days ago

MiniMaxAI/MiniMax-M2.1

upvoted a paper 10 days ago

Qwen3-VL Technical Report

View all activity

Organizations

upvoted a paper 6 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 8 days ago • 227

liked a model 6 days ago

MiniMaxAI/MiniMax-M2.1

Text Generation • 229B • Updated 12 days ago • 200k • • 957

upvoted a paper 10 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 149

liked a model 20 days ago

google/functiongemma-270m-it

Text Generation • 0.3B • Updated 21 days ago • 58.9k • 774

upvoted an article 20 days ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

22 days ago

•

107

liked a model 22 days ago

apple/Sharp

Image-to-3D • Updated 21 days ago • 5.63k • 318

upvoted a collection 23 days ago

NVIDIA Nemotron v3

Collection

Open, Production-ready Enterprise Models • 6 items • Updated 8 days ago • 115

upvoted an article 24 days ago

Article

New in llama.cpp: Model Management

28 days ago

•

105

upvoted a paper about 1 month ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 245

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated Dec 1, 2025 • 111k • • 1.08k

upvoted 2 collections about 1 month ago

Ministral 3

Collection

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 139

Mistral Large 3

Collection

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 82

upvoted an article about 1 month ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

268

upvoted a paper about 2 months ago

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 126

liked a model about 2 months ago

facebook/sam3

Mask Generation • 0.9B • Updated Nov 20, 2025 • 1.46M • 1.32k

upvoted 2 papers about 2 months ago

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12, 2025 • 95

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 132

liked a model about 2 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated Nov 8, 2025 • 290k • • 1.6k

upvoted 2 papers 2 months ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 77

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

Paper • 2510.22115 • Published Oct 25, 2025 • 83

Taufiq Dwi Purnomo

AI & ML interests

Recent Activity

Organizations

taufiqdp's activity

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

New in llama.cpp: Model Management

Transformers v5: Simple model definitions powering the AI ecosystem