111105

Running

App Files Files Community

snsbhg commited on 15 days ago

Commit

50ff874

verified ·

1 Parent(s): 05f1586

Upload 6 files

Browse files

Files changed (6) hide show

Dockerfile +34 -0
README.md +82 -4
astrbot_plugin_config_example.json +5 -0
download_support_models.py +17 -0
reference_audio/ref_shantianliang_1.wav +0 -0
weights/README_PLACEHOLDER.txt +7 -0

Dockerfile ADDED Viewed

	@@ -0,0 +1,34 @@

+# Base image with PyTorch + CUDA 12.1 runtime
+FROM pytorch/pytorch:2.5.1-cuda12.1-cudnn9-runtime
+ENV PYTHONUNBUFFERED=1 \
+    PIP_DISABLE_PIP_VERSION_CHECK=1
+WORKDIR /app
+# System deps
+RUN apt-get update && \
+    apt-get install -y --no-install-recommends ffmpeg libsox-dev git && \
+    rm -rf /var/lib/apt/lists/*
+# Get GPT-SoVITS source
+RUN git clone --depth 1 https://github.com/RVC-Boss/GPT-SoVITS.git /app
+# Python deps (repo's + API server)
+RUN pip install --upgrade pip && \
+    pip install --no-deps --no-cache-dir -r /app/extra-req.txt && \
+    pip install --no-cache-dir -r /app/requirements.txt && \
+    pip install --no-cache-dir fastapi uvicorn soundfile huggingface_hub ffmpeg-python
+# Pre-download essential support models (Chinese frontends & encoders, sv/*)
+COPY download_support_models.py /app/download_support_models.py
+RUN python /app/download_support_models.py || true
+# Put your weights and reference audio into image
+COPY weights/ /app/pretrained_models/shantianliang/
+COPY reference_audio/ /app/reference_audio/
+EXPOSE 7860
+# Start REST API v2 (FastAPI)
+CMD ["python", "api_v2.py", "-a", "0.0.0.0", "-p", "7860", "-c", "GPT_SoVITS/configs/tts_infer.yaml"]

README.md CHANGED Viewed

@@ -1,10 +1,88 @@
 ---
-title: '1111'
-emoji: 🐨
 colorFrom: yellow
 colorTo: red
 sdk: docker
-pinned: false
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: GPT-SoVITS API (v2 ProPlus) for AstrBot
+emoji: 🗣️
 colorFrom: yellow
 colorTo: red
 sdk: docker
+app_port: 7860
+license: mit
 ---
+# GPT‑SoVITS v2 ProPlus — REST API for AstrBot (Docker Space)
+这个 Space 已经为 **AstrBot** 的 `astrbot_plugin_GPT_SoVITS` 插件准备好了一个 **REST API 服务端 (api_v2.py)**。
+你只需把 **你现成的模型权重 + 参考音频** 放到下列路径，然后点击 **Restart and rebuild** 即可。
+## 放置你的文件（请先重命名，避免中文/空格/加号等字符）
+- **GPT 权重（.ckpt）** → `weights/shantianliang_proplus_e32.ckpt`
+  （把你本地的 `shantianliangPROpius-e32.ckpt` 改名为上面这个）
+- **SoVITS 权重（.pth）** → `weights/shantianliang_proplus_e8_s192.pth`
+  （把你本地的 `shantianliangPRO+_e8_s192.pth` 改名为上面这个）
+- **参考音频（.wav）** → `reference_audio/ref_shantianliang_1.wav`
+  （把你本地的 `山田凉参考音频1.wav` 改名为上面这个）
+> 你现在看到的 `reference_audio/ref_shantianliang_1.wav` 是一个 **占位的静音文件**，用来保证服务能启动。请用你的真实参考音频替换。
+## 启动后的测试
+- **切 SoVITS 权重**
+```bash
+curl -G "https://<你的空间>.hf.space/set_sovits_weights"   --data-urlencode "weights_path=/app/pretrained_models/shantianliang/shantianliang_proplus_e8_s192.pth"
+```
+- **切 GPT 权重**
+```bash
+curl -G "https://<你的空间>.hf.space/set_gpt_weights"   --data-urlencode "weights_path=/app/pretrained_models/shantianliang/shantianliang_proplus_e32.ckpt"
+```
+- **合成 TTS（POST，推荐）**
+```bash
+curl -L "https://<你的空间>.hf.space/tts"   -H "Content-Type: application/json"   -d '{
+    "text": "今天来测试一下山田凉的声音，欢迎收听。",
+    "text_lang": "zh",
+    "ref_audio_path": "/app/reference_audio/ref_shantianliang_1.wav",
+    "prompt_lang": "zh",
+    "prompt_text": "这是山田凉的参考音频",
+    "media_type": "wav",
+    "streaming_mode": false
+  }'   -o out.wav
+```
+- **流式（边播边收）**
+```bash
+curl -N -L "https://<你的空间>.hf.space/tts?text=流式测试&text_lang=zh&ref_audio_path=/app/reference_audio/ref_shantianliang_1.wav&prompt_lang=zh&prompt_text=参考提示&media_type=wav&streaming_mode=true" -o stream.wav
+```
+> 以上接口与参数来自 `api_v2.py`。记得把 `<你的空间>` 换成实际 Space 名称。
+## AstrBot 插件如何填
+在 AstrBot 的 **astrbot_plugin_GPT_SoVITS** 插件配置里：
+- **base_url**：`https://<你的空间>.hf.space`
+- **gpt_weights_path**：`/app/pretrained_models/shantianliang/shantianliang_proplus_e32.ckpt`
+- **sovits_weights_path**：`/app/pretrained_models/shantianliang/shantianliang_proplus_e8_s192.pth`
+然后就可以用插件的命令（如 `/说 你好`、`/生气地说 ...`）或自动触发来合成语音了。
+## 目录结构（上传前）
+```
+/
+├─ Dockerfile
+├─ download_support_models.py
+├─ README.md
+├─ weights/
+│  ├─ shantianliang_proplus_e32.ckpt           # ← 放你的 GPT 权重（改名后）
+│  └─ shantianliang_proplus_e8_s192.pth        # ← 放你的 SoVITS 权重（改名后）
+└─ reference_audio/
+   └─ ref_shantianliang_1.wav                  # ← 放你的参考音频（改名后，已提供静音占位）
+```
+## 常见问题
+- 如果报 **400** 且提示缺少参数，请检查 `/tts` 的必填字段：`text`、`text_lang`、`ref_audio_path`、`prompt_lang`。
+- `ref_audio_path` 一定要是服务端的**本地路径**（例如 `/app/reference_audio/...`）。
+- Hugging Face **Docker Space 监听端口**为 `7860`（本仓库已固定）。

astrbot_plugin_config_example.json ADDED Viewed

	@@ -0,0 +1,5 @@

+{
+  "base_url": "https://<your-space>.hf.space",
+  "gpt_weights_path": "/app/pretrained_models/shantianliang/shantianliang_proplus_e32.ckpt",
+  "sovits_weights_path": "/app/pretrained_models/shantianliang/shantianliang_proplus_e8_s192.pth"
+}

download_support_models.py ADDED Viewed

	@@ -0,0 +1,17 @@

+from huggingface_hub import snapshot_download
+import os
+target = "pretrained_models"
+os.makedirs(target, exist_ok=True)
+# Download speech encoders and Chinese frontends (kept small; add more as needed)
+try:
+    snapshot_download(
+        repo_id="lj1995/GPT-SoVITS",
+        repo_type="model",
+        local_dir=target,
+        allow_patterns=["sv/*", "chinese*"],
+    )
+    print("Support models downloaded to ./pretrained_models")
+except Exception as e:
+    print("Skipping support model download:", e)

reference_audio/ref_shantianliang_1.wav ADDED Viewed

Binary file (48 kB). View file

weights/README_PLACEHOLDER.txt ADDED Viewed

	@@ -0,0 +1,7 @@

+Put your finetuned model files here and rename them as follows:
+- GPT (.ckpt)  -> shantianliang_proplus_e32.ckpt
+- SoVITS (.pth)-> shantianliang_proplus_e8_s192.pth
+These files will be copied into the Docker image at:
+  /app/pretrained_models/shantianliang/