InternLMPrivate

Team

community

AI & ML interests

None defined yet.

Recent Activity

yuhangzang authored a paper about 1 month ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

yuhangzang authored a paper about 1 month ago

LSVOS 2025 Challenge Report: Recent Advances in Complex Video Object Segmentation

yuhangzang authored a paper about 1 month ago

Think Visually, Reason Textually: Vision-Language Synergy in ARC

View all activity

yuhangzang

authored 3 papers about 1 month ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 47

LSVOS 2025 Challenge Report: Recent Advances in Complex Video Object Segmentation

Paper • 2510.11063 • Published Oct 13, 2025 • 1

Think Visually, Reason Textually: Vision-Language Synergy in ARC

Paper • 2511.15703 • Published Nov 19, 2025 • 8

yuhangzang

authored 2 papers 2 months ago

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Paper • 2510.27606 • Published Oct 31, 2025 • 28

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

Paper • 2510.24693 • Published Oct 28, 2025 • 18

yuhangzang

authored a paper 3 months ago

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Paper • 2510.18701 • Published Oct 21, 2025 • 66

W-Wu

authored 5 papers 3 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 259

UniFlow-Audio: Unified Flow Matching for Audio Generation from Omni-Modalities

Paper • 2509.24391 • Published Sep 29, 2025

BrainOmni: A Brain Foundation Model for Unified EEG and MEG Signals

Paper • 2505.18185 • Published May 18, 2025 • 1

SciTS: Scientific Time Series Understanding and Generation with LLMs

Paper • 2510.03255 • Published Sep 26, 2025

PicoAudio2: Temporal Controllable Text-to-Audio Generation with Natural Language Description

Paper • 2509.00683 • Published Aug 31, 2025

yuhangzang

authored 5 papers 3 months ago

$\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models

Paper • 2510.01982 • Published Oct 2, 2025 • 5

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 139

SPARK: Synergistic Policy And Reward Co-Evolving Framework

Paper • 2509.22624 • Published Sep 26, 2025 • 17

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Paper • 2509.22647 • Published Sep 26, 2025 • 32

SIM-CoT: Supervised Implicit Chain-of-Thought

Paper • 2509.20317 • Published Sep 24, 2025 • 41

yuhangzang

authored 2 papers 4 months ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28, 2025 • 89

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published Aug 27, 2025 • 36

yuhangzang

authored 2 papers 5 months ago

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Paper • 2508.04700 • Published Aug 6, 2025 • 52

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

Paper • 2508.00819 • Published Aug 1, 2025 • 62