anton-l HF Staff commited on
Commit
68830cf
·
1 Parent(s): 568b7da

Add instructions

Browse files
Files changed (1) hide show
  1. README.md +13 -0
README.md ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ## Common voice release generator
2
+
3
+ 1. Copy the latest release id from the `RELEASES` dict in https://github.com/common-voice/common-voice/blob/main/web/src/components/pages/datasets/releases.ts
4
+ to the `VERSIONS` variable in `generate_datasets.py`.
5
+ 2. Copy the languages from https://github.com/common-voice/common-voice/blob/release-v1.78.0/web/locales/en/messages.ftl
6
+ (replacing `release-v1.78.0` with the latest version tag) to the `languages.ftl` file.
7
+ 3. Run `python generate_datasets.py` to generate the dataset repos.
8
+ 4. `cd ..`
9
+ 5. `huggingface-cli repo create --type dataset --organization mozilla-foundation common_voice_11_0`
10
+ 6. `git clone https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0`
11
+ 7. `cd common_voice_11_0`
12
+ 8. `cp ../common_voice_generator/common_voice_11_0/* ./`
13
+ 9. `git add . && git commit -m "Release" && git push`