Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources
Zihao Li
Zihao-Li



·
AI & ML interests
Multilingual NLP
Recent Activity
liked
a model
2 days ago
tencent/Hunyuan-MT-7B
authored
a paper
6 days ago
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language
Models
authored
a paper
6 days ago
GlotEval: A Test Suite for Massively Multilingual Evaluation of Large
Language Models