File size: 767 Bytes
f994c83 25242a9 f994c83 25242a9 f994c83 25242a9 f8aff7a 25242a9 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 |
---
license: cc-by-nc-4.0
tags:
- audio-to-video
library_name: diffusers
---
Models of [R-FLAV](https://arxiv.org/abs/2503.08307) trained on Landscape and AIST++ for 400k iterations.
For more info, please refer to the Github repository at https://github.com/ErgastiAlex/R-FLAV
To download the ckpts directly in the code you can do
```python
from huggingface_hub import hf_hub_download
import torch
from models import FLAV
model = FLAV.from_pretrained(args.model_ckpt)
hf_hub_download(repo_id="MaverickAlex/R-FLAV-B-1-LS", filename="vocoder/config.json")
vocoder_path = hf_hub_download(repo_id="MaverickAlex/R-FLAV-B-1-LS", filename="vocoder/vocoder.pt")
vocoder_path = vocoder_path.replace("vocoder.pt", "")
vocoder = Generator.from_pretrained(vocoder_path)
``` |