Spaces:

inwneon
/

project-voice-diarzation

Paused

project-voice-diarzation / README.md

sivakorn-su

feat: add voice diarization project

78dde53 5 months ago

1.92 kB

metadata

title: WhisperPyanoteLLM
emoji: 📉
colorFrom: indigo
colorTo: green
sdk: docker
pinned: false
license: apache-2.0

WhisperPyanoteLLM

A FastAPI-based app for speaker diarization and transcription using Whisper and PyAnnote, with LLM-powered summarization.

Clone the repository:

git clone <your-repo-url>
cd WhisperPyanoteLLM

Create a .env file:

HF_TOKEN=your_huggingface_token
TOGETHER_API_KEY=your_together_api_key
NGROK_AUTH_TOKEN=your_ngrok_token

Install dependencies:
```
pip install -r requirements.txt
```
Run the app:
```
uvicorn app:app --reload --port 8300
```
Access the API:
- Health check: http://localhost:8300/health
- Upload endpoint: /upload_video/

Create a .env.prod file:

HF_TOKEN=your_huggingface_token
TOGETHER_API_KEY=your_together_api_key
NGROK_AUTH_TOKEN=your_ngrok_token

Run the Docker container:

docker run --env-file .env.prod -p 8300:8300 whisperpyanote

Access the API:
- Health check: http://localhost:8300/health
- Upload endpoint: /upload_video/

Make sure your .env and .env.prod files are not committed to version control.
For best performance, run on a machine with a CUDA-enabled GPU.
For more details, see the code and comments in app.py.

Apache-2.0