Zhijiang's picture

9

Zhijiang

Zeee

·

AI & ML interests

Natural Language Processing, Machine Learning

Recent Activity

upvoted a paper 4 days ago

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

upvoted a paper 8 days ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

upvoted a paper 12 days ago

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

View all activity

Organizations

upvoted a paper 4 days ago

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published 6 days ago • 35

upvoted a paper 8 days ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published 14 days ago • 116

upvoted a paper 12 days ago

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Paper • 2508.14460 • Published 13 days ago • 79

upvoted 3 papers 3 months ago

Through the Valley: Path to Effective Long CoT Training for Small Language Models

Paper • 2506.07712 • Published Jun 9 • 18

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8 • 112

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26 • 104

upvoted a paper 5 months ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 302

upvoted a paper 6 months ago

FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving

Paper • 2502.20238 • Published Feb 27 • 24

upvoted a paper 11 months ago

Personalized Visual Instruction Tuning

Paper • 2410.07113 • Published Oct 9, 2024 • 71