long
kevinlong
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
A Survey of Reinforcement Learning for Large Reasoning Models
commented on
a paper
about 2 months ago
Group Sequence Policy Optimization
upvoted
a
paper
about 2 months ago
RiemannLoRA: A Unified Riemannian Framework for Ambiguity-Free LoRA
Optimization
Organizations
None yet