A newer version of the Gradio SDK is available:
5.21.0
metadata
title: MyIVR
emoji: 🎧
colorFrom: indigo
colorTo: blue
sdk: gradio
sdk_version: 5.12.0
app_file: app.py
pinned: false
MyIVR: Audio Transcription and Summarization
This Space provides a simple web-based interface to upload audio files, transcribe them into text, and generate a summarized version of the transcription.
How It Works
- Upload an audio file in formats like
.wav
or.mp3
. - The app uses:
- Whisper for audio transcription.
- Hugging Face Transformers for text summarization.
- Get the transcription and summary displayed on the interface.
Features
- Audio Transcription: Convert speech to text.
- Text Summarization: Generate concise summaries of the transcription.
Models Used
- Whisper (OpenAI): Handles the audio-to-text conversion.
- T5-Small (Hugging Face): Summarizes the transcription.
Requirements
Dependencies are listed in the requirements.txt
file. The app uses:
gradio
openai-whisper
transformers
torch
Usage
Visit the deployed app at the following link:
MyIVR on Hugging Face Spaces
Upload an audio file, and the app will process it to provide a transcription and summary.