X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents Paper • 2306.17674 • Published Jun 30, 2023
Representation Surgery: Theory and Practice of Affine Steering Paper • 2402.09631 • Published Feb 15, 2024
The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning Paper • 2403.03218 • Published Mar 5, 2024 • 1
Wu's Method can Boost Symbolic AI to Rival Silver Medalists and AlphaGeometry to Outperform Gold Medalists at IMO Geometry Paper • 2404.06405 • Published Apr 9, 2024 • 2
Benchmarks Underestimate the Readiness of Multi-lingual Dialogue Agents Paper • 2405.17840 • Published May 28, 2024
Counter Turing Test ($CT^2$): Investigating AI-Generated Text Detection for Hindi -- Ranking LLMs based on Hindi AI Detectability Index ($ADI_{hi}$) Paper • 2407.15694 • Published Jul 22, 2024
Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting Paper • 2412.00869 • Published Dec 1, 2024 • 4
Rethinking Thinking Tokens: Understanding Why They Underperform in Practice Paper • 2411.11371 • Published Nov 18, 2024
Great Models Think Alike and this Undermines AI Oversight Paper • 2502.04313 • Published 9 days ago • 27