Trained models as described in the paper "ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning"
Agent-RL
agentrl
AI & ML interests
None yet
Recent Activity
liked
a model
25 days ago
baichuan-inc/Baichuan-M2-32B
upvoted
an
article
3 months ago
The 4 Things Qwen-3's Chat Template Teaches Us
Organizations
None yet