license: apache-2.0 | |
tags: | |
- text-to-image | |
# AuraFlow v0.3 | |
 | |
AuraFlow v0.3 is the fully open-sourced flow-based text-to-image generation model. The model was trained with more compute compared to the previous version, [AuraFlow-v0.2](https://huggingface.co/fal/AuraFlow-v0.2). | |
Compared to AuraFlow-v0.2, the model is fine-tuned on more aesthetic datasets and now supports various aspect ratio, (now width and height up to 1536 pixels). | |
## Usage | |
```bash | |
$ pip install transformers accelerate protobuf sentencepiece | |
$ pip install git+https://github.com/huggingface/diffusers.git | |
``` | |
```python | |
from diffusers import AuraFlowPipeline | |
import torch | |
pipeline = AuraFlowPipeline.from_pretrained( | |
"terminusresearch/auraflow-v0.3", | |
torch_dtype=torch.float16, | |
variant="fp16", | |
).to("cuda") | |
image = pipeline( | |
prompt="rempage of the iguana character riding F1, fast and furious, cinematic movie poster", | |
width=1536, | |
height=768, | |
num_inference_steps=50, | |
generator=torch.Generator().manual_seed(1), | |
guidance_scale=3.5, | |
).images[0] | |
image.save("output.png") | |
``` |