小明's picture

16 34

小明

xiaoming

·

xiaominghero

AI & ML interests

nlp

Recent Activity

liked a model 1 day ago

stepfun-ai/Step-Audio-2-mini

liked a model 5 days ago

ByteDance-Seed/Seed-OSS-36B-Base

upvoted a paper 8 days ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

View all activity

Organizations

None yet

upvoted a paper 8 days ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published 8 days ago • 171

upvoted a paper 11 days ago

DINOv3

Paper • 2508.10104 • Published 20 days ago • 231

upvoted a collection 12 days ago

Nemotron-Pre-Training-Dataset

7 items • Updated about 10 hours ago • 31

upvoted a paper 19 days ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published 20 days ago • 141

upvoted an article 20 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

Jul 8

• 643

upvoted a collection about 1 month ago

SmolDocling datasets

Datasets used to train SmolDocling • 6 items • Updated Jul 31 • 28

upvoted 3 papers about 1 month ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31 • 112

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published Jul 30 • 65

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22 • 62

upvoted a collection about 2 months ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 2 items • Updated Jul 12 • 117

upvoted 2 papers 3 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 263

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published Jun 13 • 70

upvoted a collection 6 months ago

Document AI

25 items • Updated 17 days ago • 3

upvoted a paper 6 months ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 117

upvoted a paper 7 months ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14 • 56

upvoted a paper 8 months ago

HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips

Paper • 1906.03327 • Published Jun 7, 2019 • 1