[](#cool-japan-diffusion-210-model-card)Cool Japan Diffusion 2.1.0 Model Card
=============================================================================

[![](/aipicasso/cool-japan-diffusion-2-1-0/resolve/main/eyecatch.jpg)](/aipicasso/cool-japan-diffusion-2-1-0/blob/main/eyecatch.jpg)

[2023110](http://www.cac.gov.cn/2022-12/11/c_1672221949318230.htm) 

English version is [here](/aipicasso/cool-japan-diffusion-2-1-0/blob/main/README_en.md).

[](#)
=============

Cool Japan Diffusion (for learning) Stable Diffsion

[](#)
=======================

 CreativeML Open RAIL++-M License    [](https://qiita.com/robitan/items/887d9f3153963114823d)   

[](#)
=======================

 304 175 [](https://twitter.com/tka0120/status/1601483633436393473?s=20&t=yvM9EX0Em-_7lh8NJln3IQ) 

   

[](#)
===========

[Space](https://huggingface.co/spaces/alfredplpl/cool-japan-diffusion-2-1-0) [](https://alfredplpl.hatenablog.com/entry/2022/12/30/102636) [](https://huggingface.co/aipicasso/cool-japan-diffusion-2-1-0/resolve/main/v2-1-0.ckpt)



[](#)
---------------

*   **:** Robin Rombach, Patrick Esser, Alfred Increment
    
*   **:**  text-to-image 
    
*   **:** 
    
*   **:** CreativeML Open RAIL++-M-NC License
    
*   **:**  [Latent Diffusion Model](https://arxiv.org/abs/2112.10752)  [OpenCLIP-ViT/H](https://github.com/mlfoundations/open_clip) 
    
*   **:**
    
*   **:**
    
        @InProceedings{Rombach_2022_CVPR,
            author    = {Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn},
            title     = {High-Resolution Image Synthesis With Latent Diffusion Models},
            booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
            month     = {June},
            year      = {2022},
            pages     = {10684-10695}
        }
        
    

[](#)
-------------------

Stable Diffusion v2 

*   Web UI
*   Diffusers

### [](#web-ui)Web UI

[](https://alfredplpl.hatenablog.com/entry/2022/12/30/102636)

### [](#diffusers)Diffusers

['s Diffusers library](https://github.com/huggingface/diffusers) 



    pip install --upgrade git+https://github.com/huggingface/diffusers.git transformers accelerate scipy
    



    from diffusers import StableDiffusionPipeline, EulerDiscreteScheduler
    import torch
    
    model_id = "aipicasso/cool-japan-diffusion-2-1-0-beta"
    
    scheduler = EulerDiscreteScheduler.from_pretrained(model_id, subfolder="scheduler")
    pipe = StableDiffusionPipeline.from_pretrained(model_id, scheduler=scheduler, torch_dtype=torch.float16)
    pipe = pipe.to("cuda")
    
    prompt = "anime, a portrait of a girl with black short hair and red eyes, kimono, full color illustration, official art, 4k, detailed"
    negative_prompt="low quality, bad face, bad anatomy, bad hand, lowres, jpeg artifacts, 2d, 3d, cg, text"
    image = pipe(prompt,negative_prompt=negative_prompt).images[0]
    
    image.save("girl.png")
    

****:

*   [xformers](https://github.com/facebookresearch/xformers) 
*   GPUGPU `pipe.enable_attention_slicing()` 

#### [](#)

*   
    *   [AI](https://www.aiartgrandprix.com/)
        *   
        *   Hugging Face  Community 
*   AI
    *   
        *   AI
*   
    *   
        *   Alfred Increment
*   
    *   Discord
        *   
        *   
            *   DreamBooth 
        *   
    *   Latent Diffusion Model
    *   FID
    *   Stable Diffusion
*   
    *   
    *   
    *   AI
*   
    *   SNS
*   Hugging Face  Community 
    *   

#### [](#)

*   
*   YouTube
*   
*   
*   

[](#)
===========================================

*    ([Digital Forgery](https://arxiv.org/abs/2212.03860)) 
    *   
        *   [](https://twitter.com/ThePioneerJPnew/status/1609074173892235264?s=20&t=-rY1ufzNeIDT3Fm5YdME6g)
*   Image-to-Image
*    (175
    *   
*   
    *   

[](#)
---------------------------

### [](#)

*   

### [](#)

Stable Diffusion 

[](#)
---------

****

Stable Diffusion

*   VAE
    *   Danbooru: 60 
*   U-Net
    *   Danbooru: 80

****

Stable DiffusionVAEU-Net

*   **:** RTX 3090
*   **:** AdamW
*   **Gradient Accumulations**: 1
*   **:** 1

[](#)
-------------

[](#)
-----------------



*   **:** RTX 3090
*   **:** 300
*   **:** 
*   **:** 
*   **:** 

[](#)
-------------

    @InProceedings{Rombach_2022_CVPR,
        author    = {Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn},
        title     = {High-Resolution Image Synthesis With Latent Diffusion Models},
        booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
        month     = {June},
        year      = {2022},
        pages     = {10684-10695}
    }
    

\* [Stable Diffusion v2](https://huggingface.co/stabilityai/stable-diffusion-2/raw/main/README.md) Alfred Increment

## Model overview

The `cool-japan-diffusion-2-1-0` model is a text-to-image diffusion model developed by [aipicasso](https://aimodels.fyi/creators/huggingFace/aipicasso) that is fine-tuned from the [Stable Diffusion v2-1](https://aimodels.fyi/models/huggingFace/stable-diffusion-2-1-stabilityai) model. This model aims to generate images with a focus on Japanese aesthetic and cultural elements, building upon the strong capabilities of the Stable Diffusion framework.

## Model inputs and outputs

The `cool-japan-diffusion-2-1-0` model takes text prompts as input and generates corresponding images as output. The text prompts can describe a wide range of concepts, from characters and scenes to abstract ideas, and the model will attempt to render these as visually compelling images.

### Inputs
- **Text prompt**: A natural language description of the desired image, which can include details about the subject, style, and various other attributes.

### Outputs
- **Generated image**: The model outputs a high-resolution image that visually represents the provided text prompt, with a focus on Japanese-inspired aesthetics and elements.

## Capabilities

The `cool-japan-diffusion-2-1-0` model is capable of generating a diverse array of images inspired by Japanese art, culture, and design. This includes portraits of anime-style characters, detailed illustrations of traditional Japanese landscapes and architecture, and imaginative scenes blending modern and historical elements. The model's attention to visual detail and ability to capture the essence of Japanese aesthetics make it a powerful tool for creative endeavors.

## What can I use it for?

The `cool-japan-diffusion-2-1-0` model can be utilized for a variety of applications, such as:

- **Artistic creation**: Generate unique, Japanese-inspired artwork and illustrations for personal or commercial use, including book covers, poster designs, and digital art.
- **Character design**: Create detailed character designs for anime, manga, or other Japanese-influenced media, with a focus on accurate facial features, clothing, and expressions.
- **Scene visualization**: Render immersive scenes of traditional Japanese landscapes, cityscapes, and architectural elements to assist with worldbuilding or visual storytelling.
- **Conceptual ideation**: Explore and visualize abstract ideas or themes through the lens of Japanese culture and aesthetics, opening up new creative possibilities.

## Things to try

One interesting aspect of the `cool-japan-diffusion-2-1-0` model is its ability to capture the intricate details and refined sensibilities associated with Japanese art and design. Try experimenting with prompts that incorporate specific elements, such as:

- Traditional Japanese art styles (e.g., ukiyo-e, sumi-e, Japanese calligraphy)
- Iconic Japanese landmarks or architectural features (e.g., torii gates, pagodas, shinto shrines)
- Japanese cultural motifs (e.g., cherry blossoms, koi fish, Mount Fuji)
- Anime and manga-inspired character designs

By focusing on these distinctive Japanese themes and aesthetics, you can unlock the model's full potential and create truly captivating, culturally-immersive images.