MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper • 2502.14499 • Published 21 days ago • 179
Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources Paper • 2409.08239 • Published Sep 12, 2024 • 20
Teaching Large Language Models to Reason with Reinforcement Learning Paper • 2403.04642 • Published Mar 7, 2024 • 48
Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts Paper • 2402.16822 • Published Feb 26, 2024 • 18
Chain-of-Verification Reduces Hallucination in Large Language Models Paper • 2309.11495 • Published Sep 20, 2023 • 38
Challenges and Applications of Large Language Models Paper • 2307.10169 • Published Jul 19, 2023 • 48