| --- |
| license: apache-2.0 |
| tags: |
| - text-to-image |
| --- |
| # AuraFlow v0.3 |
|
|
|
|
| %3C!----%3E%3C%2Ftd%3E%3C%2Ftr%3E%3Ctr id="L10"> | |
|
|
|
| AuraFlow v0.3 is the fully open-sourced flow-based text-to-image generation model. The model was trained with more compute compared to the previous version, [AuraFlow-v0.2](https://huggingface.co/fal/AuraFlow-v0.2). |
|
|
| Compared to AuraFlow-v0.2, the model is fine-tuned on more aesthetic datasets and now supports various aspect ratio, (now width and height up to 1536 pixels). |
|
|
| ## Usage |
|
|
| ```bash |
| $ pip install transformers accelerate protobuf sentencepiece |
| $ pip install git+https://github.com/huggingface/diffusers.git |
| ``` |
|
|
| ```python |
| from diffusers import AuraFlowPipeline |
| import torch |
| |
| pipeline = AuraFlowPipeline.from_pretrained( |
| "terminusresearch/auraflow-v0.3", |
| torch_dtype=torch.float16, |
| variant="fp16", |
| ).to("cuda") |
| |
| image = pipeline( |
| prompt="rempage of the iguana character riding F1, fast and furious, cinematic movie poster", |
| width=1536, |
| height=768, |
| num_inference_steps=50, |
| generator=torch.Generator().manual_seed(1), |
| guidance_scale=3.5, |
| ).images[0] |
| |
| image.save("output.png") |
| ``` |