Update README.md
Browse files
README.md
CHANGED
@@ -1,48 +1,51 @@
|
|
1 |
-
|
2 |
-
|
3 |
-
|
4 |
-
|
5 |
-
---
|
6 |
Overview
|
7 |
-
|
8 |
|
9 |
License
|
10 |
-
This model is released under the CC BY-NC 4.0 license, which allows
|
11 |
|
12 |
Datasets
|
13 |
-
|
14 |
-
|
15 |
-
Common Voice Arabic Dataset : A crowdsourced dataset containing diverse Arabic accents and dialects.
|
16 |
-
Additional datasets may be incorporated in future iterations to improve the model's performance across various Arabic dialects.
|
17 |
|
18 |
Model Information
|
19 |
Base Model: SWivid/F5-TTS
|
20 |
-
Current
|
21 |
Training Configuration:
|
22 |
-
|
23 |
-
Batch Size: To Be Determined (TBD)
|
24 |
Max Samples: TBD
|
25 |
Training Steps: TBD
|
26 |
-
|
27 |
-
|
28 |
Usage Instructions
|
29 |
-
To use
|
30 |
|
31 |
Method 1: Manual Model Replacement
|
32 |
-
Run the F5-TTS Application
|
|
|
|
|
33 |
Copy
|
34 |
-
|
35 |
-
model : C:\Users\
|
36 |
-
Replace the Model File
|
37 |
Navigate to the displayed file location.
|
38 |
-
Rename the existing model file
|
39 |
-
|
40 |
-
|
|
|
|
|
|
|
|
|
41 |
Alternative Methods
|
42 |
-
GitHub Repository:
|
43 |
-
|
44 |
-
|
45 |
-
|
46 |
-
|
47 |
-
|
|
|
|
|
|
|
48 |
|
|
|
1 |
+
F5-TTS: Fine-Tuned Arabic Speech Synthesis Model
|
2 |
+
License: CC BY-NC 4.0
|
3 |
+
Base Model: SWivid/F5-TTS
|
4 |
+
|
|
|
5 |
Overview
|
6 |
+
This project fine-tunes the F5-TTS model for high-quality Arabic speech synthesis, incorporating regional diversity in pronunciation and accents. The fine-tuning process is ongoing, and temporary checkpoints are provided as progress updates. Future iterations will include improved models with enhanced accuracy and naturalness.
|
7 |
|
8 |
License
|
9 |
+
This model is released under the CC BY-NC 4.0 license, which allows free usage, modification, and distribution for non-commercial purposes.
|
10 |
|
11 |
Datasets
|
12 |
+
Training is based on the Common Voice Arabic Dataset, a crowdsourced dataset featuring diverse Arabic accents and dialects. Additional datasets may be incorporated in future updates to improve dialectal coverage and pronunciation accuracy.
|
|
|
|
|
|
|
13 |
|
14 |
Model Information
|
15 |
Base Model: SWivid/F5-TTS
|
16 |
+
Current Status: Ongoing fine-tuning (Temporary Checkpoints Available)
|
17 |
Training Configuration:
|
18 |
+
Batch Size: TBD
|
|
|
19 |
Max Samples: TBD
|
20 |
Training Steps: TBD
|
21 |
+
(Final training parameters will be updated upon completion of fine-tuning.)
|
|
|
22 |
Usage Instructions
|
23 |
+
To use the fine-tuned Arabic model, follow these steps:
|
24 |
|
25 |
Method 1: Manual Model Replacement
|
26 |
+
Run the F5-TTS Application
|
27 |
+
Start the application and locate the model file path displayed in the terminal. Example:
|
28 |
+
less
|
29 |
Copy
|
30 |
+
Edit
|
31 |
+
model : C:\Users\yourname\.cache\huggingface\hub\models--SWivid--F5-TTS\snapshots\995ff41929c08ff968786b448a384330438b5cb6\F5TTS_Base\model_1200000.safetensors
|
32 |
+
Replace the Model File
|
33 |
Navigate to the displayed file location.
|
34 |
+
Rename the existing model file:
|
35 |
+
Copy
|
36 |
+
Edit
|
37 |
+
model_1200000.safetensors → model_1200000.safetensors.bak
|
38 |
+
Download the Arabic checkpoint and vocabulary files from this repository and place them in the same directory.
|
39 |
+
Restart the Application
|
40 |
+
Relaunch the F5-TTS application to load the Arabic fine-tuned model.
|
41 |
Alternative Methods
|
42 |
+
GitHub Repository: Follow the F5-TTS setup instructions, but replace the default model with the Arabic checkpoint and vocabulary files provided here.
|
43 |
+
Contributions & Collaboration
|
44 |
+
This model is a work in progress, and community contributions are highly encouraged! Suggestions, improvements, and dataset contributions are welcome to refine its performance across different Arabic dialects.
|
45 |
+
|
46 |
+
Recommendations for Better Results
|
47 |
+
Use clear reference audio with minimal background noise.
|
48 |
+
Ensure balanced audio levels for improved synthesis quality.
|
49 |
+
Contributions in dataset expansion and model evaluation are highly valuable.
|
50 |
+
If you have any questions or suggestions, feel free to reach out! ��
|
51 |
|