--- title: README emoji: 🏃 colorFrom: red colorTo: yellow sdk: static pinned: false ---

Welcome to WARP. This is our little organization for multimodal generative models focussing on the visual domain. We have been working with generative image models a lot and will soon work on video models as well. Our main team consists of: - [Pablo Pernias](https://github.com/pabloppp/) - [Dominic Rampas](https://github.com/dome272) Feel free to join our [Discord](https://discord.gg/BTUAzb8vFY) channel! Models:

Paella

A simple & straightforward text-conditional image generation model that works on quantized latents.
More details can be found in the paper, the blog post and the YouTube video.
Only accessible through GitHub.

Würstchen

An efficient text-to-image model to train and use for inference. Achieves competetive performance to state-of-the-art methods, while needing only a fraction of the compute.
More details can be found in the paper.
Versions:

v1: Only accessible through GitHub.
v2: Accessible through GitHub and Diffusers