sophia peng
sophiapeng
ยท
AI & ML interests
None yet
Recent Activity
liked
a model
about 2 months ago
lixiaoxi45/WebThinker-QwQ-32B
upvoted
a
paper
about 2 months ago
WebSailor: Navigating Super-human Reasoning for Web Agent
upvoted
a
paper
6 months ago
Harnessing Negative Signals: Reinforcement Distillation from Teacher
Data for LLM Reasoning