zenz-v1-checkpoints / README.md
Miwa-Keita's picture
Update README.md
8dc4420 verified
---
license: cc-by-sa-4.0
language:
- ja
tags:
- japanese input
- kana kanji conversion
---
# zenz-v1 Checkpoints
<!-- Provide a quick summary of what the model is/does. -->
[zenz-v1](https://huggingface.co/Miwa-Keita/zenz-v1) is a language model specialized for kana-kanji conversion tasks based on the GPT-2 architecture. It is intended for use in the neural kana-kanji conversion system "Zenzai."
This repository publishes the checkpoints for zenz-v1.
* 90M parameters
* Character-level + byte-level BPE tokenizer
* High performance in kana-kanji conversion tasks using greedy decoding
## Model Details
### Model Description
<!-- Provide a longer summary of what this model is. -->
The base model used is [ku-nlp/gpt2-small-japanese-char](https://huggingface.co/ku-nlp/gpt2-small-japanese-char) provided under [CC-BY-SA 4.0](https://creativecommons.org/licenses/by-sa/4.0/deed.ja).
This model is provided under [CC-BY-SA 4.0](https://creativecommons.org/licenses/by-sa/4.0/deed.ja).
- **Developed by:** Keita Miwa ([๐•](https://twitter.com/miwa_ensan))
- **Model type:** GPT-2
- **Language(s) (NLP):** Japanese
- **License:** CC-BY-SA 4.0
- **Finetuned from model:** [ku-nlp/gpt2-small-japanese-char](https://huggingface.co/ku-nlp/gpt2-small-japanese-char)
### Model Sources
<!-- Provide the basic links for the model. -->
This model is intended for use with Zenzai (AzooKeyKanaKanjiConverter).
- **Repository:** https://github.com/ensan-hcl/AzooKeyKanaKanjiConverter
## Acknowledgements
The following libraries, tools, and language resources were utilized in constructing this model.
* MeCab (https://taku910.github.io/mecab/)
* ipadic-NEologd (https://github.com/neologd/mecab-ipadic-neologd)
* torch (https://pypi.org/project/torch/)
* transformers (https://pypi.org/project/transformers/)
* datasets (https://pypi.org/project/datasets/)
* jaconv (https://pypi.org/project/jaconv/)
* llama.cpp (https://github.com/ggerganov/llama.cpp)