Kaiyan Zhang
iseesaw
ยท
AI & ML interests
Large Reasoning Models, Reinforcement Learning, Agent
Recent Activity
authored
a paper
about 2 months ago
FlowRL: Matching Reward Distributions for LLM Reasoning
upvoted
a
paper
about 2 months ago
FlowRL: Matching Reward Distributions for LLM Reasoning
upvoted
a
collection
about 2 months ago
DeepSeek-V3.2