README / README.md
dome272's picture
Update README.md
996ed25
|
raw
history blame
1.37 kB
metadata
title: README
emoji: 🏃
colorFrom: red
colorTo: yellow
sdk: static
pinned: false
Welcome to WARP. This is our little organization for multimodal generative models focussing on the visual domain. We have been working with generative image models a lot and will soon work on video models as well. Our main team consists of:

Feel free to join our Discord channel!

Models:

  • Paella:
    • A simple & straightforward text-conditional image generation model that works on quantized latents.
    • More details can be found in the paper, the blog post and the YouTube video.
    • Only accessible through GitHub.
  • Würstchen:
    • An efficient text-to-image model to train and use for inference. Achieves competetive performance to state-of-the-art methods, while needing only a fraction of the compute.

    • More details can be found in the paper.

    • Versions:

      • v1: Only accessible through GitHub.
      • v2: Accessible through diffusers