# Tune-A-Video
This repository is the official implementation of [Tune-A-Video](https://arxiv.org/abs/2212.11565).
**[Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation](https://arxiv.org/abs/2212.11565)**
[Jay Zhangjie Wu](https://zhangjiewu.github.io/),
[Yixiao Ge](https://geyixiao.com/),
[Xintao Wang](https://xinntao.github.io/),
[Stan Weixian Lei](),
[Yuchao Gu](https://ycgu.site/),
[Yufei Shi](),
[Wynne Hsu](https://www.comp.nus.edu.sg/~whsu/),
[Ying Shan](https://scholar.google.com/citations?user=4oXBp9UAAAAJ&hl=en),
[Xiaohu Qie](https://scholar.google.com/citations?user=mk-F69UAAAAJ&hl=en),
[Mike Zheng Shou](https://sites.google.com/view/showlab)
[](https://tuneavideo.github.io/)
[](https://arxiv.org/abs/2212.11565)
[](https://huggingface.co/spaces/Tune-A-Video-library/Tune-A-Video-Training-UI)
[](https://colab.research.google.com/github/showlab/Tune-A-Video/blob/main/notebooks/Tune-A-Video.ipynb)
Given a video-text pair as input, our method, Tune-A-Video, fine-tunes a pre-trained text-to-image diffusion model for text-to-video generation.
Input Video | Output Video | ||
![]() |
![]() |
![]() |
![]() |
"A man is skiing" | "Spider Man is skiing on the beach, cartoon style” | "Wonder Woman, wearing a cowboy hat, is skiing" | "A man, wearing pink clothes, is skiing at sunset" |
![]() |
![]() |
![]() |
![]() |
"A rabbit is eating a watermelon on the table" | "A rabbit is |
"A cat with sunglasses is eating a watermelon on the beach" | "A puppy is eating a cheeseburger on the table, comic style" |
![]() |
![]() |
![]() |
![]() |
"A jeep car is moving on the road" | "A Porsche car is moving on the beach" | "A car is moving on the road, cartoon style" | "A car is moving on the snow" |
![]() |
![]() |
![]() |
![]() |
"A man is dribbling a basketball" | "James Bond is dribbling a basketball on the beach" | "An astronaut is dribbling a basketball, cartoon style" | "A lego man in a black suit is dribbling a basketball" |
Input Video | Output Video | ||
![]() |
![]() |
![]() |
![]() |
"A bear is playing guitar" | "1girl is playing guitar, white hair, medium hair, cat ears, closed eyes, cute, scarf, jacket, outdoors, streets" | "1boy is playing guitar, bishounen, casual, indoors, sitting, coffee shop, bokeh" | "1girl is playing guitar, red hair, long hair, beautiful eyes, looking at viewer, cute, dress, beach, sea" |
Input Video | Output Video | ||
![]() |
![]() |
![]() |
![]() |
"A bear is playing guitar" | "A rabbit is playing guitar, modern disney style" | "A handsome prince is playing guitar, modern disney style" | "A magic princess with sunglasses is playing guitar on the stage, modern disney style" |
Input Video | Output Video | ||
![]() |
![]() |
![]() |
![]() |
"A bear is playing guitar" | "Mr Potato Head, made of lego, is playing guitar on the snow" | "Mr Potato Head, wearing sunglasses, is playing guitar on the beach" | "Mr Potato Head is playing guitar in the starry night, Van Gogh style" |