IbrahimSalah commited on
Commit
88b71d4
·
verified ·
1 Parent(s): ff87f60

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -30
README.md CHANGED
@@ -1,48 +1,51 @@
1
- ---
2
- license: cc-by-nc-4.0
3
- base_model:
4
- - SWivid/F5-TTS
5
- ---
6
  Overview
7
- The F5-TTS model has been fine-tuned specifically for Arabic language speech synthesis. This project aims to deliver high-quality, regionally diverse speech synthesis capabilities for Arabic speakers. The fine-tuning process is ongoing, and temporary checkpoints are being shared as progress is made. Once better results are obtained, updated checkpoints will be uploaded.
8
 
9
  License
10
- This model is released under the CC BY-NC 4.0 license, which allows for free usage, modification, and distribution for non-commercial purposes.
11
 
12
  Datasets
13
- The following dataset was used for training:
14
-
15
- Common Voice Arabic Dataset : A crowdsourced dataset containing diverse Arabic accents and dialects.
16
- Additional datasets may be incorporated in future iterations to improve the model's performance across various Arabic dialects.
17
 
18
  Model Information
19
  Base Model: SWivid/F5-TTS
20
- Current Training Status: Ongoing (Temporary Checkpoint Shared)
21
  Training Configuration:
22
-
23
- Batch Size: To Be Determined (TBD)
24
  Max Samples: TBD
25
  Training Steps: TBD
26
- Note: Detailed training parameters will be updated once the fine-tuning process is complete.
27
-
28
  Usage Instructions
29
- To use this Arabic fine-tuned version of F5-TTS, follow these steps:
30
 
31
  Method 1: Manual Model Replacement
32
- Run the F5-TTS Application: Start the F5-TTS application and note the model file path displayed in the terminal. It should look similar to:
 
 
33
  Copy
34
- 1
35
- model : C:\Users\thega\.cache\huggingface\hub\models--SWivid--F5-TTS\snapshots\995ff41929c08ff968786b448a384330438b5cb6\F5TTS_Base\model_1200000.safetensors
36
- Replace the Model File:
37
  Navigate to the displayed file location.
38
- Rename the existing model file to model_1200000.safetensors.bak.
39
- Download the Arabic checkpoint and vocabulary files from this repository and save them to the same location.
40
- Restart the Application: Relaunch the F5-TTS application to load the updated Arabic model.
 
 
 
 
41
  Alternative Methods
42
- GitHub Repository: Refer to the official F5-TTS repository for setup instructions but use the Arabic checkpoint and vocab files provided here.
43
- Collaboration Welcome: Contributions and collaborations are encouraged to further enhance the model's performance. Feel free to reach out with suggestions or improvements.
44
- Contributions and Recommendations
45
- This model is still in development, and contributions from the community are highly encouraged to refine its performance across different Arabic dialects. For optimal output quality, consider preprocessing the reference audio by removing background noise, balancing audio levels, and enhancing clarity.
46
-
47
- If you have any questions or need further clarification, feel free to get in touch!
 
 
 
48
 
 
1
+ F5-TTS: Fine-Tuned Arabic Speech Synthesis Model
2
+ License: CC BY-NC 4.0
3
+ Base Model: SWivid/F5-TTS
4
+
 
5
  Overview
6
+ This project fine-tunes the F5-TTS model for high-quality Arabic speech synthesis, incorporating regional diversity in pronunciation and accents. The fine-tuning process is ongoing, and temporary checkpoints are provided as progress updates. Future iterations will include improved models with enhanced accuracy and naturalness.
7
 
8
  License
9
+ This model is released under the CC BY-NC 4.0 license, which allows free usage, modification, and distribution for non-commercial purposes.
10
 
11
  Datasets
12
+ Training is based on the Common Voice Arabic Dataset, a crowdsourced dataset featuring diverse Arabic accents and dialects. Additional datasets may be incorporated in future updates to improve dialectal coverage and pronunciation accuracy.
 
 
 
13
 
14
  Model Information
15
  Base Model: SWivid/F5-TTS
16
+ Current Status: Ongoing fine-tuning (Temporary Checkpoints Available)
17
  Training Configuration:
18
+ Batch Size: TBD
 
19
  Max Samples: TBD
20
  Training Steps: TBD
21
+ (Final training parameters will be updated upon completion of fine-tuning.)
 
22
  Usage Instructions
23
+ To use the fine-tuned Arabic model, follow these steps:
24
 
25
  Method 1: Manual Model Replacement
26
+ Run the F5-TTS Application
27
+ Start the application and locate the model file path displayed in the terminal. Example:
28
+ less
29
  Copy
30
+ Edit
31
+ model : C:\Users\yourname\.cache\huggingface\hub\models--SWivid--F5-TTS\snapshots\995ff41929c08ff968786b448a384330438b5cb6\F5TTS_Base\model_1200000.safetensors
32
+ Replace the Model File
33
  Navigate to the displayed file location.
34
+ Rename the existing model file:
35
+ Copy
36
+ Edit
37
+ model_1200000.safetensors → model_1200000.safetensors.bak
38
+ Download the Arabic checkpoint and vocabulary files from this repository and place them in the same directory.
39
+ Restart the Application
40
+ Relaunch the F5-TTS application to load the Arabic fine-tuned model.
41
  Alternative Methods
42
+ GitHub Repository: Follow the F5-TTS setup instructions, but replace the default model with the Arabic checkpoint and vocabulary files provided here.
43
+ Contributions & Collaboration
44
+ This model is a work in progress, and community contributions are highly encouraged! Suggestions, improvements, and dataset contributions are welcome to refine its performance across different Arabic dialects.
45
+
46
+ Recommendations for Better Results
47
+ Use clear reference audio with minimal background noise.
48
+ Ensure balanced audio levels for improved synthesis quality.
49
+ Contributions in dataset expansion and model evaluation are highly valuable.
50
+ If you have any questions or suggestions, feel free to reach out! ��
51