metadata

title: MyIVR
emoji: 🎧
colorFrom: indigo
colorTo: blue
sdk: gradio
sdk_version: 5.12.0
app_file: app.py
pinned: false

MyIVR: Audio Transcription and Summarization

This Space provides a simple web-based interface to upload audio files, transcribe them into text, and generate a summarized version of the transcription.

How It Works

Upload an audio file in formats like .wav or .mp3.
The app uses:
- Whisper for audio transcription.
- Hugging Face Transformers for text summarization.
Get the transcription and summary displayed on the interface.

Features

Audio Transcription: Convert speech to text.
Text Summarization: Generate concise summaries of the transcription.

Models Used

Whisper (OpenAI): Handles the audio-to-text conversion.
T5-Small (Hugging Face): Summarizes the transcription.

Requirements

Dependencies are listed in the requirements.txt file. The app uses:

gradio
openai-whisper
transformers
torch

Usage

Visit the deployed app at the following link:
MyIVR on Hugging Face Spaces

Upload an audio file, and the app will process it to provide a transcription and summary.