Ndamulelo Nemakhavhani
commited on
Commit
·
51774af
1
Parent(s):
c624f82
Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ tags:
|
|
15 |
- tshivenda
|
16 |
---
|
17 |
|
18 |
-
# Zabantu -
|
19 |
|
20 |
> Zabantu( "Za" for South Africa, "bantu" for Bantu languages) is a collection of masked language models that have been trained from scratch using a compact dataset comprising various subsets of Bantu languages spoken in South Africa. These models are inspired by the work done on AfriBERTa, which demonstrated the effectiveness of training on XLM-R architecture using a smaller dataset. The focus of this work was to use LLMs to advance NLP applications in Tshivenda and also to serve as a benchmark for future works covering Bantu languages.
|
21 |
|
|
|
15 |
- tshivenda
|
16 |
---
|
17 |
|
18 |
+
# Zabantu - Exploring Multilingual Language Model training for South African Bantu Languages
|
19 |
|
20 |
> Zabantu( "Za" for South Africa, "bantu" for Bantu languages) is a collection of masked language models that have been trained from scratch using a compact dataset comprising various subsets of Bantu languages spoken in South Africa. These models are inspired by the work done on AfriBERTa, which demonstrated the effectiveness of training on XLM-R architecture using a smaller dataset. The focus of this work was to use LLMs to advance NLP applications in Tshivenda and also to serve as a benchmark for future works covering Bantu languages.
|
21 |
|