auraflow-v0.3 / README.md
bghira's picture
Update README.md
453b259 verified
---
license: apache-2.0
tags:
- text-to-image
---
# AuraFlow v0.3
![image/png](/static-proxy?url=https%3A%2F%2Fcdn-uploads.huggingface.co%2Fproduction%2Fuploads%2F6311151c64939fabc00c8436%2FBcH5xyGCGNnkmPC-OPS9z.png%3C%2Fspan%3E)%3C!----%3E%3C%2Ftd%3E%3C%2Ftr%3E%3Ctr id="L10">
AuraFlow v0.3 is the fully open-sourced flow-based text-to-image generation model. The model was trained with more compute compared to the previous version, [AuraFlow-v0.2](https://huggingface.co/fal/AuraFlow-v0.2).
Compared to AuraFlow-v0.2, the model is fine-tuned on more aesthetic datasets and now supports various aspect ratio, (now width and height up to 1536 pixels).
## Usage
```bash
$ pip install transformers accelerate protobuf sentencepiece
$ pip install git+https://github.com/huggingface/diffusers.git
```
```python
from diffusers import AuraFlowPipeline
import torch
pipeline = AuraFlowPipeline.from_pretrained(
"terminusresearch/auraflow-v0.3",
torch_dtype=torch.float16,
variant="fp16",
).to("cuda")
image = pipeline(
prompt="rempage of the iguana character riding F1, fast and furious, cinematic movie poster",
width=1536,
height=768,
num_inference_steps=50,
generator=torch.Generator().manual_seed(1),
guidance_scale=3.5,
).images[0]
image.save("output.png")
```