Speechless TWI β€” Stage 1 (RVQ for Whisper Encoder)

Trained RVQ that discretizes Whisper encoder features into semantic tokens for Twi/Akan.

Files

  • rvq_final.pt β€” state dict
  • config_stage1.json β€” training/config params
  • rvq_wrapper.py β€” tiny module defining RVQWrapper

Usage (example)

import torch, json
from huggingface_hub import hf_hub_download
from rvq_wrapper import RVQWrapper

cfg = json.load(open(hf_hub_download("ik/speechless-twi-stage1-rvq-whisper-medium", "config_stage1.json"), "r"))
ckpt = torch.load(hf_hub_download("ik/speechless-twi-stage1-rvq-whisper-medium", "rvq_final.pt"), map_location="cpu")

rvq = RVQWrapper(cfg["rvq_dim"], cfg["rvq_num_quantizers"], cfg["rvq_codebook_size"])
rvq.load_state_dict(ckpt["rvq"])
rvq.eval()
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support