Matthieu Zimmer
MatthieuZ
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
Tree-OPO: Off-policy Monte Carlo Tree-Guided Advantage Optimization for
Multistep Reasoning
upvoted
a
paper
about 1 month ago
Tree-OPO: Off-policy Monte Carlo Tree-Guided Advantage Optimization for
Multistep Reasoning
authored
a paper
about 1 month ago
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning