Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mehdie
/
BPE-tokenizer
like
0
Follow
MEHDIE
4
Model card
Files
Files and versions
Community
fac3fd5
BPE-tokenizer
/
README.md
morten-j
Create README.md
fac3fd5
verified
11 months ago
preview
code
|
raw
Copy download link
history
blame
112 Bytes
BPE based tokenizer used for the MEHDIE project and the training of a bilingual BERT model.
Vocab size of 52000.