Sirui Zhang
zsr200901
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 20 hours ago
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs
via Bi-Mode Annealing and Reinforce Learning
upvoted
a
paper
28 days ago
The Promise of RL for Autoregressive Image Editing
upvoted
a
paper
28 days ago
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens