--- license: cc-by-nc-4.0 tags: - audio-to-video library_name: diffusers --- Models of [R-FLAV](https://arxiv.org/abs/2503.08307) trained on Landscape and AIST++ for 400k iterations. For more info, please refer to the Github repository at https://github.com/ErgastiAlex/R-FLAV To download the ckpts directly in the code you can do ```python from huggingface_hub import hf_hub_download import torch from models import FLAV model = FLAV.from_pretrained(args.model_ckpt) hf_hub_download(repo_id="MaverickAlex/R-FLAV-B-1-LS", filename="vocoder/config.json") vocoder_path = hf_hub_download(repo_id="MaverickAlex/R-FLAV-B-1-LS", filename="vocoder/vocoder.pt") vocoder_path = vocoder_path.replace("vocoder.pt", "") vocoder = Generator.from_pretrained(vocoder_path) ```