Tang

Pingjie

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

MetaCLIP 2: A Worldwide Scaling Recipe

upvoted an article about 1 month ago

Open-source DeepResearch – Freeing our search agents

upvoted a paper about 1 month ago

A Survey of Context Engineering for Large Language Models

View all activity

Organizations

None yet

upvoted a paper 8 days ago

MetaCLIP 2: A Worldwide Scaling Recipe

Paper • 2507.22062 • Published Jul 29 • 25

upvoted an article about 1 month ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.29k

upvoted a paper about 1 month ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 248

upvoted a collection 3 months ago

Qwen2.5-Omni

Collection

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated Jul 21 • 155

upvoted a paper 4 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 134

upvoted 2 papers 5 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 199

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24 • 121

upvoted 2 articles 6 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

and 3 others •

Mar 12

• 457

Article

SigLIP 2: A better multilingual vision language encoder

and 2 others •

Feb 21

• 179

upvoted a paper 6 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 203

upvoted a collection 6 months ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 535

upvoted an article 7 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

and 2 others •

Jan 23

• 182

upvoted 2 papers 7 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 242

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published Jan 24 • 60

upvoted 2 articles 7 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 878

upvoted 3 papers 8 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 298

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 284

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 64

upvoted a collection 9 months ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 150

Tang

AI & ML interests

Recent Activity

Organizations

Pingjie's activity

Open-source DeepResearch – Freeing our search agents

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

SigLIP 2: A better multilingual vision language encoder

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Open-R1: Update #1

Open-R1: a fully open reproduction of DeepSeek-R1