Whisper-RMTfinetuned

This model is a fine-tuned version of OpenAI's Whisper model for Malay speech-to-text transcription.

Model Description

  • Base Model: OpenAI Whisper-Small
  • Fine-Tuned on: Malay language dataset
  • Intended Use: Speech recognition for Malay audio

Usage

from transformers import WhisperProcessor, WhisperForConditionalGeneration
import torch

model = WhisperForConditionalGeneration.from_pretrained("rmtariq/whisper-RMTfinetuned")
processor = WhisperProcessor.from_pretrained("rmtariq/whisper-RMTfinetuned")

audio = "/path/to/audio.wav"
input_features = processor(audio, sampling_rate=16000, return_tensors="pt").input_features

with torch.no_grad():
    predicted_ids = model.generate(input_features)

transcription = processor.batch_decode(predicted_ids, skip_special_tokens=True)[0]
print(transcription)
Downloads last month
63
Safetensors
Model size
242M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Evaluation results