Zuhao Yang's picture

Zuhao Yang

mwxely

·

https://mwxely.github.io/

AI & ML interests

Large Multimodal Models

Recent Activity

upvoted a paper 4 days ago

Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

upvoted a paper 4 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

upvoted a paper 10 days ago

Agent Learning via Early Experience

View all activity

Organizations

upvoted 2 papers 4 days ago

Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

Paper • 2512.17532 • Published 7 days ago • 62

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 4 days ago • 60

upvoted 3 papers 10 days ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 269

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19 • 226

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6 • 210

upvoted 2 collections 22 days ago

Multimodal Agent

123 items • Updated 3 days ago • 1

AI Paper of the Day

A collection of papers that I think are interesting, one added each day • 551 items • Updated about 11 hours ago • 73

upvoted a collection 25 days ago

LongVT-HF_Daily_Paper

1 item • Updated 26 days ago • 1

upvoted 2 papers 25 days ago

Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models

Paper • 2512.01949 • Published 25 days ago • 8

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Paper • 2511.20785 • Published Nov 25 • 162

upvoted a collection about 1 month ago

LongVT

8 items • Updated 16 days ago • 8

upvoted a paper about 1 month ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20 • 91

upvoted a paper 3 months ago

First Try Matters: Revisiting the Role of Reflection in Reasoning Models

Paper • 2510.08308 • Published Oct 9 • 24

upvoted a paper 5 months ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19 • 134

upvoted a paper 8 months ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1 • 36