Update README.md
Browse files
README.md
CHANGED
@@ -2,4 +2,47 @@
|
|
2 |
license: cc-by-nc-4.0
|
3 |
base_model:
|
4 |
- SWivid/F5-TTS
|
5 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
license: cc-by-nc-4.0
|
3 |
base_model:
|
4 |
- SWivid/F5-TTS
|
5 |
+
---
|
6 |
+
Overview
|
7 |
+
The F5-TTS model has been fine-tuned specifically for Arabic language speech synthesis. This project aims to deliver high-quality, regionally diverse speech synthesis capabilities for Arabic speakers. The fine-tuning process is ongoing, and temporary checkpoints are being shared as progress is made. Once better results are obtained, updated checkpoints will be uploaded.
|
8 |
+
|
9 |
+
License
|
10 |
+
This model is released under the CC BY-NC 4.0 license, which allows for free usage, modification, and distribution for non-commercial purposes.
|
11 |
+
|
12 |
+
Datasets
|
13 |
+
The following dataset was used for training:
|
14 |
+
|
15 |
+
Common Voice Arabic Dataset : A crowdsourced dataset containing diverse Arabic accents and dialects.
|
16 |
+
Additional datasets may be incorporated in future iterations to improve the model's performance across various Arabic dialects.
|
17 |
+
|
18 |
+
Model Information
|
19 |
+
Base Model: SWivid/F5-TTS
|
20 |
+
Current Training Status: Ongoing (Temporary Checkpoint Shared)
|
21 |
+
Training Configuration:
|
22 |
+
|
23 |
+
Batch Size: To Be Determined (TBD)
|
24 |
+
Max Samples: TBD
|
25 |
+
Training Steps: TBD
|
26 |
+
Note: Detailed training parameters will be updated once the fine-tuning process is complete.
|
27 |
+
|
28 |
+
Usage Instructions
|
29 |
+
To use this Arabic fine-tuned version of F5-TTS, follow these steps:
|
30 |
+
|
31 |
+
Method 1: Manual Model Replacement
|
32 |
+
Run the F5-TTS Application: Start the F5-TTS application and note the model file path displayed in the terminal. It should look similar to:
|
33 |
+
Copy
|
34 |
+
1
|
35 |
+
model : C:\Users\thega\.cache\huggingface\hub\models--SWivid--F5-TTS\snapshots\995ff41929c08ff968786b448a384330438b5cb6\F5TTS_Base\model_1200000.safetensors
|
36 |
+
Replace the Model File:
|
37 |
+
Navigate to the displayed file location.
|
38 |
+
Rename the existing model file to model_1200000.safetensors.bak.
|
39 |
+
Download the Arabic checkpoint and vocabulary files from this repository and save them to the same location.
|
40 |
+
Restart the Application: Relaunch the F5-TTS application to load the updated Arabic model.
|
41 |
+
Alternative Methods
|
42 |
+
GitHub Repository: Refer to the official F5-TTS repository for setup instructions but use the Arabic checkpoint and vocab files provided here.
|
43 |
+
Collaboration Welcome: Contributions and collaborations are encouraged to further enhance the model's performance. Feel free to reach out with suggestions or improvements.
|
44 |
+
Contributions and Recommendations
|
45 |
+
This model is still in development, and contributions from the community are highly encouraged to refine its performance across different Arabic dialects. For optimal output quality, consider preprocessing the reference audio by removing background noise, balancing audio levels, and enhancing clarity.
|
46 |
+
|
47 |
+
If you have any questions or need further clarification, feel free to get in touch!
|
48 |
+
|