Hierarchical Autoregressive Transformers: Combining Byte-~and Word-Level Processing for Robust, Adaptable Language Models Paper • 2501.10322 • Published 21 days ago • 1 • 2
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 24 days ago • 53 • 3
AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages Paper • 2501.08284 • Published 24 days ago • 6 • 2
Building Foundations for Natural Language Processing of Historical Turkish: Resources and Models Paper • 2501.04828 • Published 30 days ago • 11 • 3
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 126 • 10
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 126 • 10
OpenNER 1.0: Standardized Open-Access Named Entity Recognition Datasets in 50+ Languages Paper • 2412.09587 • Published Dec 12, 2024 • 3 • 2