Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models Paper • 2512.21337 • Published 8 days ago • 26
SCOPE: Prompt Evolution for Enhancing Agent Effectiveness Paper • 2512.15374 • Published 15 days ago • 5