Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing Paper • 2310.12664 • Published Oct 19, 2023
Can LLMs Augment Low-Resource Reading Comprehension Datasets? Opportunities and Challenges Paper • 2309.12426 • Published Sep 21, 2023
FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning Paper • 2509.13160 • Published Sep 16, 2025 • 29
FinAgentBench: A Benchmark Dataset for Agentic Retrieval in Financial Question Answering Paper • 2508.14052 • Published Aug 7, 2025
FinGAIA: A Chinese Benchmark for AI Agents in Real-World Financial Domain Paper • 2507.17186 • Published Jul 23, 2025 • 1