--- title: MyIVR emoji: 🎧 colorFrom: indigo colorTo: blue sdk: gradio sdk_version: 5.12.0 app_file: app.py pinned: false --- # MyIVR: Audio Transcription and Summarization This Space provides a simple web-based interface to upload audio files, transcribe them into text, and generate a summarized version of the transcription. ## How It Works 1. Upload an audio file in formats like `.wav` or `.mp3`. 2. The app uses: - [Whisper](https://github.com/openai/whisper) for audio transcription. - [Hugging Face Transformers](https://huggingface.co/docs/transformers/index) for text summarization. 3. Get the transcription and summary displayed on the interface. ## Features - **Audio Transcription**: Convert speech to text. - **Text Summarization**: Generate concise summaries of the transcription. ## Models Used - **Whisper** (OpenAI): Handles the audio-to-text conversion. - **T5-Small** (Hugging Face): Summarizes the transcription. ## Requirements Dependencies are listed in the `requirements.txt` file. The app uses: - `gradio` - `openai-whisper` - `transformers` - `torch` ## Usage Visit the deployed app at the following link: [MyIVR on Hugging Face Spaces](https://huggingface.co/spaces/JabriA/MyIVR) Upload an audio file, and the app will process it to provide a transcription and summary.