arxiv:2501.00192
Ligong Han
ligongh
AI & ML interests
Generative Models
Recent Activity
upvoted
a
paper
3 days ago
M3-Bench: Multi-Modal, Multi-Hop, Multi-Threaded Tool-Using MLLM Agent Benchmark
liked
a model
about 1 month ago
OpenTO/NFAE
upvoted
a
paper
about 2 months ago
EPO: Entropy-regularized Policy Optimization for LLM Agents
Reinforcement Learning