absl-py accelerate==1.6.0 aiortc av diffusers flash-attn librosa ml-collections numpy scipy soundfile torch tqdm transformers git+https://github.com/microsoft/VibeVoice.git