QE4PE: Word-level Quality Estimation for Human Post-Editing Paper • 2503.03044 • Published 9 days ago • 6
view article Article What We Learned About LLM/VLMs in Healthcare AI Evaluation: By shanchen • Nov 8, 2024 • 12
A Primer on the Inner Workings of Transformer-based Language Models Paper • 2405.00208 • Published Apr 30, 2024 • 10
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 105 items • Updated about 16 hours ago • 97