Add instructions
Browse files
README.md
ADDED
|
@@ -0,0 +1,13 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
## Common voice release generator
|
| 2 |
+
|
| 3 |
+
1. Copy the latest release id from the `RELEASES` dict in https://github.com/common-voice/common-voice/blob/main/web/src/components/pages/datasets/releases.ts
|
| 4 |
+
to the `VERSIONS` variable in `generate_datasets.py`.
|
| 5 |
+
2. Copy the languages from https://github.com/common-voice/common-voice/blob/release-v1.78.0/web/locales/en/messages.ftl
|
| 6 |
+
(replacing `release-v1.78.0` with the latest version tag) to the `languages.ftl` file.
|
| 7 |
+
3. Run `python generate_datasets.py` to generate the dataset repos.
|
| 8 |
+
4. `cd ..`
|
| 9 |
+
5. `huggingface-cli repo create --type dataset --organization mozilla-foundation common_voice_11_0`
|
| 10 |
+
6. `git clone https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0`
|
| 11 |
+
7. `cd common_voice_11_0`
|
| 12 |
+
8. `cp ../common_voice_generator/common_voice_11_0/* ./`
|
| 13 |
+
9. `git add . && git commit -m "Release" && git push`
|