rmtariq commited on
Commit
f3c39fa
·
verified ·
1 Parent(s): 4f8e78b

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -0
README.md ADDED
@@ -0,0 +1,47 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - whisper
5
+ - fine-tuned
6
+ - malay
7
+ - speech-to-text
8
+ datasets:
9
+ - custom-dataset
10
+ model-index:
11
+ - name: whisper-RMTfinetuned
12
+ results:
13
+ - task:
14
+ type: automatic-speech-recognition
15
+ dataset:
16
+ name: Malay Audio Datasets
17
+ type: custom
18
+ metrics:
19
+ - type: wer
20
+ value: 5.6 # (Replace with actual WER score)
21
+ ---
22
+
23
+ # Whisper-RMTfinetuned
24
+
25
+ This model is a fine-tuned version of OpenAI's Whisper model for **Malay speech-to-text transcription**.
26
+
27
+ ## **Model Description**
28
+ - **Base Model**: OpenAI Whisper-Small
29
+ - **Fine-Tuned on**: Malay language dataset
30
+ - **Intended Use**: Speech recognition for Malay audio
31
+
32
+ ## **Usage**
33
+ ```python
34
+ from transformers import WhisperProcessor, WhisperForConditionalGeneration
35
+ import torch
36
+
37
+ model = WhisperForConditionalGeneration.from_pretrained("rmtariq/whisper-RMTfinetuned")
38
+ processor = WhisperProcessor.from_pretrained("rmtariq/whisper-RMTfinetuned")
39
+
40
+ audio = "/path/to/audio.wav"
41
+ input_features = processor(audio, sampling_rate=16000, return_tensors="pt").input_features
42
+
43
+ with torch.no_grad():
44
+ predicted_ids = model.generate(input_features)
45
+
46
+ transcription = processor.batch_decode(predicted_ids, skip_special_tokens=True)[0]
47
+ print(transcription)