[](#wd-15-beta-2)WD 1.5 Beta 2
==============================

For this release, we release two versions of the model:

*   WD 1.5 Beta 2
*   WD 1.5 Beta 2 Aesthetic

For the aesthetic version, we finetune the attention layer on popular aesthetic images. For training, it is recomended to use the base version.

[](#vae)VAE
===========

WD 1.5 uses the same VAE as WD 1.4, which can be found here [https://huggingface.co/hakurei/waifu-diffusion-v1-4/blob/main/vae/kl-f8-anime2.ckpt](https://huggingface.co/hakurei/waifu-diffusion-v1-4/blob/main/vae/kl-f8-anime2.ckpt)

### [](#release-notes)Release Notes

[https://cafeai.notion.site/WD-1-5-Beta-2-Release-Notes-2852db5a9cdd456ba52fc5730b91acfd](https://cafeai.notion.site/WD-1-5-Beta-2-Release-Notes-2852db5a9cdd456ba52fc5730b91acfd)

### [](#example-images)Example Images

[https://cafeai.notion.site/WD-1-5-Beta-2-Aesthetic-Ver-c44a410fec06478fbf1a08a9890310ff](https://cafeai.notion.site/WD-1-5-Beta-2-Aesthetic-Ver-c44a410fec06478fbf1a08a9890310ff)

[](#license)License
-------------------

WD 1.5 is released under the Fair AI Public License 1.0-SD ([https://freedevproject.org/faipl-1.0-sd/](https://freedevproject.org/faipl-1.0-sd/)). If any derivative of this model is made, please share your changes accordingly. Special thanks to ronsor/undeleted ([https://undeleted.ronsor.com/](https://undeleted.ronsor.com/)) for help with the license.

## Model overview

`wd-1-5-beta2` is a text-to-image diffusion model fine-tuned by the waifu-diffusion team on high-quality anime images. It is an updated version of the Waifu Diffusion v1.4 model, which was conditioned on anime-styled images through fine-tuning the Stable Diffusion 1.4 model. The current model, `wd-1-5-beta2`, has two versions: the base version and an "aesthetic" version that was further fine-tuned on popular aesthetic images. 

The model uses the same VAE (Variational Autoencoder) as the previous Waifu Diffusion v1.4 model, which can be found [here](https://huggingface.co/hakurei/waifu-diffusion-v1-4/blob/main/vae/kl-f8-anime2.ckpt). This VAE was originally trained on anime-style images.

## Model inputs and outputs

### Inputs
- Text prompt describing the desired image

### Outputs
- An image generated based on the input text prompt

## Capabilities

The `wd-1-5-beta2` model is capable of generating high-quality anime-style images based on text prompts. It can create a wide variety of scenes and characters, from portraits to landscapes, with a distinctive anime aesthetic.

## What can I use it for?

The `wd-1-5-beta2` model can be used for creative and entertainment purposes, such as generating anime-inspired artwork, character designs, and concept art. It could be utilized by artists, illustrators, and hobbyists to aid in their creative process or to generate unique and compelling images.

## Things to try

One interesting aspect of the `wd-1-5-beta2` model is the "aesthetic" version, which was further fine-tuned on popular aesthetic images. This version may be able to generate images with a more refined and polished anime style, potentially capturing the look and feel of high-quality anime illustrations. Experimenting with prompts that focus on aesthetic qualities, such as "masterpiece, best quality, highly detailed," could yield visually striking results.

[](#wd-15-beta-3)WD 1.5 Beta 3
==============================

[![WD 1.5 Radiance](https://i.ibb.co/hYjgvGZ/00160-2195473148.png)](https://i.ibb.co/hYjgvGZ/00160-2195473148.png)

For this release, we release five versions of the model:

*   WD 1.5 Beta3 Base
*   WD 1.5 Radiance
*   WD 1.5 Ink
*   WD 1.5 Mofu
*   WD 1.5 Illusion

The WD 1.5 Base model is only intended for training use. For generation, it is recomended to create your own finetunes and loras on top of WD 1.5 Base or use one of the aesthetic models. More information and sample generations for the aesthetic models are in the release notes

### [](#release-notes)Release Notes

[https://saltacc.notion.site/WD-1-5-Beta-3-Release-Notes-1e35a0ed1bb24c5b93ec79c45c217f63](https://saltacc.notion.site/WD-1-5-Beta-3-Release-Notes-1e35a0ed1bb24c5b93ec79c45c217f63)

[](#vae)VAE
===========

WD 1.5 uses the same VAE as WD 1.4, which can be found here [https://huggingface.co/hakurei/waifu-diffusion-v1-4/blob/main/vae/kl-f8-anime2.ckpt](https://huggingface.co/hakurei/waifu-diffusion-v1-4/blob/main/vae/kl-f8-anime2.ckpt)

[](#license)License
-------------------

WD 1.5 is released under the Fair AI Public License 1.0-SD ([https://freedevproject.org/faipl-1.0-sd/](https://freedevproject.org/faipl-1.0-sd/)). If any derivative of this model is made, please share your changes accordingly. Special thanks to ronsor/undeleted ([https://undeleted.ronsor.com/](https://undeleted.ronsor.com/)) for help with the license.

## Model overview

The `wd-1-5-beta3` model, created by the waifu-diffusion team, is a text-to-image diffusion model trained on high-quality anime images. It builds upon the previous [WD 1.5 Beta 2](https://aimodels.fyi/models/huggingFace/wd-1-5-beta2-waifu-diffusion) and [WD 1.5 Beta](https://aimodels.fyi/models/huggingFace/wd-1-5-beta-waifu-diffusion) versions, with five new aesthetic variations - WD 1.5 Radiance, WD 1.5 Ink, WD 1.5 Mofu, and WD 1.5 Illusion. The base WD 1.5 Beta3 model is intended primarily for training use, while the aesthetic variants are recommended for generation.

## Model inputs and outputs

The `wd-1-5-beta3` model takes text prompts as input and generates corresponding anime-style images as output. The text prompts can describe a wide variety of anime-themed subjects, characters, and scenes. 

### Inputs
- **Text prompts:** Short to medium-length descriptions of the desired anime-style image

### Outputs
- **Generated images:** Anime-style images that match the input text prompts

## Capabilities

The `wd-1-5-beta3` model is capable of generating a wide range of high-quality anime-style images from text prompts. It can create illustrations of characters, scenes, and more, with a distinct anime aesthetic. The five aesthetic variations offer different stylistic approaches, allowing users to explore diverse artistic interpretations.

## What can I use it for?

The `wd-1-5-beta3` model can be used for various creative and entertainment purposes, such as:

- Generating concept art and illustrations for anime-inspired projects
- Creating custom anime-style avatars or character designs
- Producing unique and personalized anime-themed artwork

The model's versatility allows users to explore their creativity and potentially monetize their work, for example, by selling generated images or offering custom illustration services.

## Things to try

One interesting aspect of the `wd-1-5-beta3` model is the ability to fine-tune it or create custom variations using the provided [WD 1.5 Base](https://aimodels.fyi/models/huggingFace/wd-1-5-beta-waifu-diffusion) model. This allows users to further customize the model's outputs to their specific needs or preferences. Experimenting with prompt engineering and the aesthetic variants can also lead to unique and unexpected results.

IMPORTANT: this is a BETA MODEL! It is not done!

[](#release-notes)Release Notes
===============================

[https://cafeai.notion.site/WD-1-5-Beta-Release-Notes-967d3a5ece054d07bb02cba02e8199b7](https://cafeai.notion.site/WD-1-5-Beta-Release-Notes-967d3a5ece054d07bb02cba02e8199b7)

[](#checkpoints)Checkpoints
===========================

Checkpoints are located in the "checkpoints" folder, under the files tab

[](#vae)VAE
===========

WD 1.5 uses the same VAE as WD 1.4, which can be found here [https://huggingface.co/hakurei/waifu-diffusion-v1-4/blob/main/vae/kl-f8-anime2.ckpt](https://huggingface.co/hakurei/waifu-diffusion-v1-4/blob/main/vae/kl-f8-anime2.ckpt)

[](#aesthetic-embeddings)Aesthetic Embeddings
=============================================

I've included a "wdgoodprompt" and "wdbadprompt" embedding in the embeddings folder to help make generation easier. With in progress models, its common to have to use long prompts for good results. Using these embeddings helps alleviate some of that.

[](#generation)Generation
=========================

With Waifu Diffusion 1.5, best results are generated from generating at a resolution of somewhere between 500 and 1000 and then using 2x latent upscale hiresfix

Here are some prompting examples: [https://cafeai.notion.site/WD-1-5-Beta-Examples-d9417e2f1f064437996b581f361e7ef3](https://cafeai.notion.site/WD-1-5-Beta-Examples-d9417e2f1f064437996b581f361e7ef3)

[](#license)License
-------------------

WD 1.5 is released under the Fair AI Public License 1.0-SD ([https://freedevproject.org/faipl-1.0-sd/](https://freedevproject.org/faipl-1.0-sd/)). If any derivative of this model is made, please share your changes accordingly. Special thanks to ronsor/undeleted ([https://undeleted.ronsor.com/](https://undeleted.ronsor.com/)) for help with the license.

## Model overview

`wd-1-5-beta` is a beta version of the Waifu Diffusion model, which is a latent text-to-image diffusion model fine-tuned on high-quality anime images. It builds upon the [Waifu Diffusion v1.3](https://aimodels.fyi/models/huggingFace/waifu-diffusion-v1-3-hakurei) and [Waifu Diffusion v1.4](https://aimodels.fyi/models/huggingFace/waifu-diffusion-hakurei) models, with further improvements and enhancements. This beta model is not yet finalized, but provides a preview of the upcoming Waifu Diffusion 1.5 release.

## Model inputs and outputs

`wd-1-5-beta` is a text-to-image generation model, taking in text prompts and outputting corresponding images. The model leverages the same VAE as Waifu Diffusion v1.4, which can be found at [https://huggingface.co/hakurei/waifu-diffusion-v1-4/blob/main/vae/kl-f8-anime2.ckpt](https://huggingface.co/hakurei/waifu-diffusion-v1-4/blob/main/vae/kl-f8-anime2.ckpt).

### Inputs
- Text prompt describing the desired image

### Outputs
- Generated image corresponding to the input text prompt

## Capabilities

The `wd-1-5-beta` model is capable of generating high-quality anime-style images from text prompts. It includes aesthetic embeddings to help improve the quality and consistency of the generated images. The model performs best when generating images at resolutions between 500 and 1000 pixels, and then using a 2x latent upscale hiresfix.

## What can I use it for?

`wd-1-5-beta` can be used for a variety of creative and entertainment purposes, such as generating anime-style artwork, character designs, and illustrations. The model is released under the Fair AI Public License 1.0-SD, which allows for commercial use and distribution of derivative works, as long as the license terms are followed.

## Things to try

With the `wd-1-5-beta` model, it's recommended to experiment with different prompting techniques and use the provided aesthetic embeddings to help improve the quality of the generated images. The model's capabilities are still in development, so users should expect some variability in the results, but the overall quality and consistency of the outputs is quite impressive.