MyIVR / README.md
JabriA's picture
Update README.md
105a56b verified

A newer version of the Gradio SDK is available: 5.21.0

Upgrade
metadata
title: MyIVR
emoji: 🎧
colorFrom: indigo
colorTo: blue
sdk: gradio
sdk_version: 5.12.0
app_file: app.py
pinned: false

MyIVR: Audio Transcription and Summarization

This Space provides a simple web-based interface to upload audio files, transcribe them into text, and generate a summarized version of the transcription.

How It Works

  1. Upload an audio file in formats like .wav or .mp3.
  2. The app uses:
  3. Get the transcription and summary displayed on the interface.

Features

  • Audio Transcription: Convert speech to text.
  • Text Summarization: Generate concise summaries of the transcription.

Models Used

  • Whisper (OpenAI): Handles the audio-to-text conversion.
  • T5-Small (Hugging Face): Summarizes the transcription.

Requirements

Dependencies are listed in the requirements.txt file. The app uses:

  • gradio
  • openai-whisper
  • transformers
  • torch

Usage

Visit the deployed app at the following link:
MyIVR on Hugging Face Spaces

Upload an audio file, and the app will process it to provide a transcription and summary.