File size: 2,794 Bytes
d7e9909
 
 
 
 
 
 
 
 
 
87c2fb3
 
c435c79
 
2e3758c
d7e9909
f83f8f2
88b71d4
f83f8f2
88b71d4
ff87f60
c666cbc
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
f83f8f2
 
 
 
c435c79
f83f8f2
 
 
d7e9909
f83f8f2
 
88b71d4
ff87f60
0deebd5
 
f83f8f2
 
 
 
 
 
 
 
 
b1a214f
 
dff3552
 
d7e9909
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
---
language:
- ar
base_model:
- SWivid/F5-TTS
pipeline_tag: text-to-speech
tags:
- speech
- f5-tts
- arabic
- text-to-speech
- tts
datasets:
- MBZUAI/ClArTTS
- mozilla-foundation/common_voice_17_0
---
# F5-TTS: Fine-Tuned Arabic Speech Synthesis Model

## Overview
This project fine-tunes the F5-TTS model for high-quality Arabic speech synthesis, incorporating regional diversity in pronunciation and accents. The fine-tuning process is ongoing, and temporary checkpoints are provided as progress updates. Future iterations will include improved models with enhanced accuracy and naturalness.

## Samples for now 
'''

1- "لكن على ما يبدو ان هناك تصاعد غير مسبوق للاحداث."

2- "لذلك يجب علينا الإتحاد فى وجه كل الصدامات التى قد تؤثر علينا."

3- "كان هناك الكثير من التحديات للوصول إلى الدقه المطلوبة."

'''
1- 

<audio controls src="https://cdn-uploads.huggingface.co/production/uploads/645098004f731658826cfe57/Co1vv5UnOffDEyPGY47li.wav"></audio>


2- 

<audio controls src="https://cdn-uploads.huggingface.co/production/uploads/645098004f731658826cfe57/jeKaMPd7f9P11aPCe5Y_0.wav"></audio>

3-

<audio controls src="https://cdn-uploads.huggingface.co/production/uploads/645098004f731658826cfe57/-c4gemoEcNX53CA21IheJ.wav"></audio>

## License
This model is released under the **CC BY-NC 4.0** license, which allows free usage, modification, and distribution for **non-commercial** purposes.

## Datasets
Training is based on the **MBZUAI/ClArTTS** so basically the model support MSA
## Model Information
- **Base Model:** SWivid/F5-TTS  
- **Current Status:** Ongoing fine-tuning (Temporary Checkpoints Available)  
- *(Final training parameters will be updated upon completion of fine-tuning.)*

## Usage Instructions
To use the fine-tuned Arabic model, follow these steps:


### Usage 
- **GitHub Repository:** Follow the [F5-TTS setup instructions](https://github.com/SWivid/F5-TTS), but replace the default model with the Arabic checkpoint and vocabulary files provided here.

## Contributions & Collaboration
This model is a **work in progress**, and community contributions are highly encouraged! Suggestions, improvements, and dataset contributions are welcome to refine its performance across different Arabic dialects.

### Recommendations for Better Results
- Use **clear reference audio** with minimal background noise.  
- Ensure **balanced audio levels** for improved synthesis quality.  
- Contributions in **dataset expansion** and **model evaluation** are highly valuable.
### Acknowledgment 
- This work is done using **Zewail City of science and technology machine**

  
If you have any questions or suggestions, feel free to reach out! 🚀