Hearing to Translate: The Effectiveness of Speech Modality Integration into LLMs Paper • 2512.16378 • Published 19 days ago • 7
Translation Difficulty Estimators Collection This collection hosts the two Translation Difficulty estimators studied in https://arxiv.org/abs/2508.10175. • 3 items • Updated Sep 17, 2025 • 3
Can Large Language Models Capture Human Annotator Disagreements? Paper • 2506.19467 • Published Jun 24, 2025 • 18
MT Sentinel Metrics Collection Machine Translation (MT) metrics designed explicitly to scrutinize the MT meta-evaluation process’s accuracy, robustness, and fairness. • 7 items • Updated Dec 4, 2024 • 7
✍️ QE4PE & GroTE Collection Materials for "QE4PE: Word-level Quality Estimation for Human Post-Editing" • 3 items • Updated Mar 6, 2025 • 1
QE4PE: Word-level Quality Estimation for Human Post-Editing Paper • 2503.03044 • Published Mar 4, 2025 • 6
COMET-early-exit Collection Models introduced in the paper Early-Exit and Instant Confidence Translation Quality Estimation https://github.com/zouharvi/COMET-early-exit • 4 items • Updated Feb 21, 2025 • 2
We Can't Understand AI Using our Existing Vocabulary Paper • 2502.07586 • Published Feb 11, 2025 • 10
Early-Exit and Instant Confidence Translation Quality Estimation Paper • 2502.14429 • Published Feb 20, 2025 • 4
PreCOMET Collection COMET-like models for MT evaluation that predict some scores given only the source segment. https://github.com/zouharvi/subset2evaluate • 8 items • Updated Feb 25, 2025 • 2
How to Select Datapoints for Efficient Human Evaluation of NLG Models? Paper • 2501.18251 • Published Jan 30, 2025 • 2