VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use Paper • 2509.01055 • Published 3 days ago • 44
Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation Paper • 2506.09350 • Published Jun 11 • 48
Chouoftears/Agent2Agent-Negotiation-in-Consumer-Setting-Dataset Viewer • Updated Jun 11 • 150 • 10 • 1
Chouoftears/Agent2Agent-Negotiation-in-Consumer-Setting-Dataset Viewer • Updated Jun 11 • 150 • 10 • 1
AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving Paper • 2412.15206 • Published Dec 19, 2024
The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets Paper • 2506.00073 • Published May 29 • 2
The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets Paper • 2506.00073 • Published May 29 • 2
The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets Paper • 2506.00073 • Published May 29 • 2 • 3
DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models Paper • 2505.24025 • Published May 29 • 27
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 418
JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model Paper • 2504.03770 • Published Apr 3 • 3
JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model Paper • 2504.03770 • Published Apr 3 • 3 • 2
Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements Paper • 2502.12904 • Published Feb 18 • 2