Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ short_description: smolagent tool to transcribe audio & video files
|
|
15 |
|
16 |
## Overview
|
17 |
|
18 |
-
`TranscriptTool` is a smolagent tool designed to transcribe audio and video files into text.
|
19 |
|
20 |
The repository contains three main components:
|
21 |
- **`transcription_tool.py`**: The core smolagent tool for transcription.
|
|
|
15 |
|
16 |
## Overview
|
17 |
|
18 |
+
`TranscriptTool` is a smolagent tool designed to transcribe audio and video files into text. Implementing OpenAI's Whisper and `ffmpeg`, this tool enables agents to process multimedia inputs efficiently. It supports robust file handling, including format conversion to WAV, dynamic device selection (CPU or GPU), and easy use within smolagents via the Hugging Face API.
|
19 |
|
20 |
The repository contains three main components:
|
21 |
- **`transcription_tool.py`**: The core smolagent tool for transcription.
|