Kaiyan Zhang's picture

Kaiyan Zhang

iseesaw

·

https://iseesaw.github.io/

AI & ML interests

Large Reasoning Models, Reinforcement Learning, Agent

Recent Activity

authored a paper about 2 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

upvoted a paper about 2 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

upvoted a collection about 2 months ago

View all activity

Organizations

iseesaw 's models

None public yet