WritingBench: A Comprehensive Benchmark for Generative Writing Paper • 2503.05244 • Published 5 days ago • 14
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models Paper • 2410.11805 • Published Oct 15, 2024 • 13