Xiangzhong Luo
stepbuystep
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
On the Generalization of SFT: A Reinforcement Learning Perspective with
Reward Rectification
upvoted
a
paper
about 2 months ago
LayerCake: Token-Aware Contrastive Decoding within Large Language Model
Layers
upvoted
a
paper
6 months ago
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through
Two-Stage Rule-Based RL
Organizations
None yet