[](#waifu-diffusion-v14---diffusion-for-weebs)waifu-diffusion v1.4 - Diffusion for Weebs
========================================================================================

waifu-diffusion is a latent text-to-image diffusion model that has been conditioned on high-quality anime images through fine-tuning.

[![image](https://user-images.githubusercontent.com/26317155/210155933-db3a5f1a-1ec3-4777-915c-6deff2841ce9.png)](https://user-images.githubusercontent.com/26317155/210155933-db3a5f1a-1ec3-4777-915c-6deff2841ce9.png)

masterpiece, best quality, 1girl, green hair, sweater, looking at viewer, upper body, beanie, outdoors, watercolor, night, turtleneck

[Original Weights](https://huggingface.co/hakurei/waifu-diffusion-v1-4)

[](#gradio--colab)Gradio & Colab
================================

We also support a [Gradio](https://github.com/gradio-app/gradio) Web UI and Colab with Diffusers to run Waifu Diffusion: [![Open In Spaces](https://camo.githubusercontent.com/00380c35e60d6b04be65d3d94a58332be5cc93779f630bcdfc18ab9a3a7d3388/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f25463025394625413425393725323048756767696e67253230466163652d5370616365732d626c7565)](https://huggingface.co/spaces/hakurei/waifu-diffusion-demo) [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1_8wPN7dJO746QXsFnB09Uq2VGgSRFuYE#scrollTo=1HaCauSq546O)

[](#model-description)Model Description
---------------------------------------

[See here for a full model overview.](https://gist.github.com/harubaru/f727cedacae336d1f7877c4bbe2196e1)

[](#license)License
-------------------

This model is open access and available to all, with a CreativeML OpenRAIL-M license further specifying rights and usage. The CreativeML OpenRAIL License specifies:

1.  You can't use the model to deliberately produce nor share illegal or harmful outputs or content
2.  The authors claims no rights on the outputs you generate, you are free to use them and are accountable for their use which must not go against the provisions set in the license
3.  You may re-distribute the weights and use the model commercially and/or as a service. If you do, please be aware you have to include the same use restrictions as the ones in the license and share a copy of the CreativeML OpenRAIL-M to all your users (please read the license entirely and carefully) [Please read the full license here](https://huggingface.co/spaces/CompVis/stable-diffusion-license)

[](#downstream-uses)Downstream Uses
-----------------------------------

This model can be used for entertainment purposes and as a generative art assistant.

[](#example-code)Example Code
-----------------------------

    import torch
    from torch import autocast
    from diffusers import StableDiffusionPipeline
    
    pipe = StableDiffusionPipeline.from_pretrained(
        'hakurei/waifu-diffusion',
        torch_dtype=torch.float32
    ).to('cuda')
    
    prompt = "1girl, aqua eyes, baseball cap, blonde hair, closed mouth, earrings, green background, hat, hoop earrings, jewelry, looking at viewer, shirt, short hair, simple background, solo, upper body, yellow shirt"
    with autocast("cuda"):
        image = pipe(prompt, guidance_scale=6)["sample"][0]  
        
    image.save("test.png")
    

[](#team-members-and-acknowledgements)Team Members and Acknowledgements
-----------------------------------------------------------------------

This project would not have been possible without the incredible work by Stability AI and Novel AI.

*   [Haru](https://github.com/harubaru)
*   [Salt](https://github.com/sALTaccount/)
*   [Sta @ Bit192](https://twitter.com/naclbbr)

In order to reach us, you can join our [Discord server](https://discord.gg/touhouai).

[![Discord Server](https://discordapp.com/api/guilds/930499730843250783/widget.png?style=banner2)](https://discord.gg/touhouai)

## Model overview

`waifu-diffusion` is a latent text-to-image diffusion model that has been fine-tuned on high-quality anime images. It was developed by the creator [hakurei](https://aimodels.fyi/creators/huggingFace/hakurei). Similar models include [cog-a1111-ui](https://aimodels.fyi/models/huggingFace/cog-a1111-ui-brewwh), a collection of anime stable diffusion models, [stable-diffusion-inpainting](https://aimodels.fyi/models/huggingFace/stable-diffusion-inpainting-stability-ai) for filling in masked parts of images, and [masactrl-stable-diffusion-v1-4](https://aimodels.fyi/models/huggingFace/masactrl-stable-diffusion-v1-4-adirik) for editing real or generated images.

## Model inputs and outputs

The `waifu-diffusion` model takes textual prompts as input and generates corresponding anime-style images. The input prompts can describe a wide range of subjects, characters, and scenes, and the model will attempt to render them in a unique anime aesthetic.

### Inputs
- Textual prompts describing the desired image

### Outputs
- Generated anime-style images corresponding to the input prompts

## Capabilities

`waifu-diffusion` can generate a variety of anime-inspired images based on text prompts. It is capable of rendering detailed characters, scenes, and environments in a consistent anime art style. The model has been trained on a large dataset of high-quality anime images, allowing it to capture the nuances and visual conventions of the anime genre.

## What can I use it for?

The `waifu-diffusion` model can be used for a variety of creative and entertainment purposes. It can serve as a generative art assistant, allowing users to create unique anime-style illustrations and artworks. The model could also be used in the development of anime-themed games, animations, or other multimedia projects. Additionally, the model could be utilized for personal hobbies or professional creative work involving anime-inspired visual content.

## Things to try

With `waifu-diffusion`, you can experiment with a wide range of text prompts to generate diverse anime-style images. Try mixing and matching different elements like characters, settings, and moods to see the model's versatility. You can also explore the model's capabilities by providing more detailed or specific prompts, such as including references to particular anime tropes or visual styles.