Maintainer: nitrosocke

Total Score


Last updated 5/28/2024


Model LinkView on HuggingFace
API SpecView on HuggingFace
Github LinkNo Github link provided
Paper LinkNo paper link provided

Create account to get full access


If you already have an account, we'll log you in

Model overview

The disco-elysium model is a fine-tuned Stable Diffusion model trained on the character portraits from the game Disco Elysium. By incorporating the discoelysium style tokens in your prompts, you can generate images with a distinct visual style inspired by the game. This model is similar to other Stable Diffusion fine-tuned models, such as the disco-diffusion-style model, which applies the Disco Diffusion style to Stable Diffusion using Dreambooth, and the elden-ring-diffusion model, which is trained on art from the Elden Ring game.

Model inputs and outputs

The disco-elysium model is a text-to-image AI model, meaning it takes a text prompt as input and generates a corresponding image as output. The model can create a wide variety of images, from character portraits to landscapes, as long as the prompt is related to the Disco Elysium game world and art style.


  • Text prompt: A natural language description of the desired image, including the discoelysium style token to invoke the specific visual style.


  • Generated image: A visually striking, game-inspired image that matches the provided text prompt.


The disco-elysium model excels at generating high-quality images with a distinct visual flair inspired by the Disco Elysium game. The model can create detailed character portraits, imaginative landscapes, and other visuals that capture the unique aesthetic of the game. By using the discoelysium style token, you can ensure that the generated images maintain the characteristic look and feel of Disco Elysium.

What can I use it for?

The disco-elysium model can be a valuable tool for various creative projects and applications. Artists and designers can use it to quickly generate concept art, character designs, or illustrations with a Disco Elysium-inspired style. Writers and worldbuilders can leverage the model to visualize scenes and characters from their Disco Elysium-inspired stories or campaigns. The model can also be used for commercial purposes, such as generating promotional materials or artwork for Disco Elysium-themed products and merchandise.

Things to try

Experiment with different prompts that incorporate the discoelysium style token, and see how the model's output varies in terms of subject matter, composition, and overall aesthetic. Try combining the discoelysium style with other descriptors, such as specific character types, emotions, or narrative elements, to see how the model blends these elements. Additionally, consider using the disco-elysium model in conjunction with other Stable Diffusion fine-tuned models, such as the elden-ring-diffusion or mo-di-diffusion models, to create unique and visually striking hybrid styles.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Models




Total Score


The disco-diffusion-style model is a Stable Diffusion model that has been fine-tuned to produce images in the distinctive Disco Diffusion style. This model was created by the sd-dreambooth-library team and can be used to generate images with a similar aesthetic to the popular Disco Diffusion tool, characterized by vibrant colors, surreal elements, and dreamlike compositions. Similar models include the midjourney-style concept, which applies a Midjourney-inspired style to Stable Diffusion, and the mo-di-diffusion model, which was fine-tuned on screenshots from a popular animation studio to produce images in a modern Disney art style. Model inputs and outputs Inputs Instance prompt**: A text prompt that describes the desired image, such as "a photo of ddfusion style" Outputs Generated image**: A 512x512 pixel image that reflects the provided prompt in the Disco Diffusion style Capabilities The disco-diffusion-style model can generate unique, imaginative images that capture the vibrant and surreal aesthetic of the Disco Diffusion tool. The model is particularly adept at producing dreamlike scenes, abstract compositions, and visually striking artwork. By incorporating the Disco Diffusion style, this model can help users create striking and memorable images without the need for extensive prompt engineering. What can I use it for? The disco-diffusion-style model can be a valuable tool for creative professionals, digital artists, and anyone looking to experiment with AI-generated imagery. The Disco Diffusion style lends itself well to conceptual art, album covers, promotional materials, and other applications where a visually striking and unconventional aesthetic is desired. Additionally, the model can be used as a starting point for further image editing and refinement, allowing users to build upon the unique qualities of the generated images. The Colab Notebook for Inference provided by the maintainers can help users get started with generating and working with images produced by this model. Things to try One interesting aspect of the disco-diffusion-style model is its ability to capture the dynamic and surreal qualities of the Disco Diffusion aesthetic. Users may want to experiment with prompts that incorporate abstract concepts, fantastical elements, or unconventional compositions to fully embrace the model's capabilities. Additionally, the model's performance may be enhanced by combining it with other techniques, such as prompt engineering or further fine-tuning. By exploring the limits of the model and experimenting with different approaches, users can unlock new and unexpected creative possibilities.

Read more

Updated Invalid Date



Total Score


The elden-ring-diffusion model is a fine-tuned Stable Diffusion model trained on game art from the popular video game Elden Ring. This allows the model to generate images in the distinct style of the game's visuals. Similar models created by the same maintainer, nitrosocke, include Arcane Diffusion, Ghibli Diffusion, and Nitro Diffusion, each trained on different artistic styles. Model inputs and outputs The elden-ring-diffusion model takes text prompts as input and generates corresponding images in the style of Elden Ring. Users can influence the output by including the token elden ring style in their prompts. Inputs Text prompts**: Descriptive text that the model uses to generate images, e.g. "a magical princess with golden hair, elden ring style" Outputs Images**: The generated images based on the provided text prompts, in the distinct visual style of Elden Ring. Capabilities The elden-ring-diffusion model can generate a wide variety of images, including portraits, landscapes, and fantastical scenes, all with the signature look and feel of the Elden Ring game world. The model is particularly adept at capturing the atmospheric, somber, and ominous tone that permeates the Elden Ring aesthetic. What can I use it for? The elden-ring-diffusion model can be a powerful tool for artists, designers, and content creators who want to incorporate the Elden Ring visual style into their projects. This could include creating concept art, promotional materials, fan art, and more. The model's ability to generate images quickly and with high fidelity makes it a valuable asset for those working in the fantasy and gaming spaces. Things to try One interesting aspect of the elden-ring-diffusion model is its ability to blend the Elden Ring style with other artistic influences. By combining the elden ring style token with other keywords, users can experiment with mixing the game's visuals with other aesthetic elements, such as different character archetypes or environmental settings. This can lead to the creation of unique and unexpected imagery that captures the essence of Elden Ring while introducing new creative twists.

Read more

Updated Invalid Date




Total Score


Future-Diffusion is a fine-tuned version of the Stable Diffusion 2.0 base model, trained by nitrosocke on high-quality 3D images with a futuristic sci-fi theme. This model allows users to generate images with a distinct "future style" by incorporating the future style token into their prompts. Compared to similar models like redshift-diffusion-768, Future-Diffusion has a 512x512 resolution, while the redshift model has a higher 768x768 resolution. The Ghibli-Diffusion and Arcane-Diffusion models, on the other hand, are fine-tuned on anime and Arcane-themed images respectively, producing outputs with those distinct visual styles. Model inputs and outputs Future-Diffusion is a text-to-image model, taking text prompts as input and generating corresponding images as output. The model was trained using the diffusers-based dreambooth training approach with prior-preservation loss and the train-text-encoder flag. Inputs Text prompts**: Users provide text descriptions to guide the image generation, such as future style [subject] Negative Prompt: duplicate heads bad anatomy for character generation or future style city market street level at night Negative Prompt: blurry fog soft for landscapes. Outputs Images**: The model generates 512x512 or 1024x576 pixel images based on the provided text prompts, with a futuristic sci-fi style. Capabilities Future-Diffusion can generate a wide range of images with a distinct futuristic aesthetic, including human characters, animals, vehicles, and landscapes. The model's ability to capture this specific style sets it apart from more generic text-to-image models. What can I use it for? The Future-Diffusion model can be useful for various creative and commercial applications, such as: Generating concept art for science fiction stories, games, or films Designing futuristic product visuals or packaging Creating promotional materials or marketing assets with a futuristic flair Exploring and experimenting with novel visual styles and aesthetics Things to try One interesting aspect of Future-Diffusion is the ability to combine the "future style" token with other style tokens, such as those from the Ghibli-Diffusion or Arcane-Diffusion models. This can result in unique and unexpected hybrid styles, allowing users to expand their creative possibilities.

Read more

Updated Invalid Date

AI model preview image



Total Score


Stable Diffusion is a latent text-to-image diffusion model capable of generating photo-realistic images given any text input. Developed by Stability AI, it is an impressive AI model that can create stunning visuals from simple text prompts. The model has several versions, with each newer version being trained for longer and producing higher-quality images than the previous ones. The main advantage of Stable Diffusion is its ability to generate highly detailed and realistic images from a wide range of textual descriptions. This makes it a powerful tool for creative applications, allowing users to visualize their ideas and concepts in a photorealistic way. The model has been trained on a large and diverse dataset, enabling it to handle a broad spectrum of subjects and styles. Model inputs and outputs Inputs Prompt**: The text prompt that describes the desired image. This can be a simple description or a more detailed, creative prompt. Seed**: An optional random seed value to control the randomness of the image generation process. Width and Height**: The desired dimensions of the generated image, which must be multiples of 64. Scheduler**: The algorithm used to generate the image, with options like DPMSolverMultistep. Num Outputs**: The number of images to generate (up to 4). Guidance Scale**: The scale for classifier-free guidance, which controls the trade-off between image quality and faithfulness to the input prompt. Negative Prompt**: Text that specifies things the model should avoid including in the generated image. Num Inference Steps**: The number of denoising steps to perform during the image generation process. Outputs Array of image URLs**: The generated images are returned as an array of URLs pointing to the created images. Capabilities Stable Diffusion is capable of generating a wide variety of photorealistic images from text prompts. It can create images of people, animals, landscapes, architecture, and more, with a high level of detail and accuracy. The model is particularly skilled at rendering complex scenes and capturing the essence of the input prompt. One of the key strengths of Stable Diffusion is its ability to handle diverse prompts, from simple descriptions to more creative and imaginative ideas. The model can generate images of fantastical creatures, surreal landscapes, and even abstract concepts with impressive results. What can I use it for? Stable Diffusion can be used for a variety of creative applications, such as: Visualizing ideas and concepts for art, design, or storytelling Generating images for use in marketing, advertising, or social media Aiding in the development of games, movies, or other visual media Exploring and experimenting with new ideas and artistic styles The model's versatility and high-quality output make it a valuable tool for anyone looking to bring their ideas to life through visual art. By combining the power of AI with human creativity, Stable Diffusion opens up new possibilities for visual expression and innovation. Things to try One interesting aspect of Stable Diffusion is its ability to generate images with a high level of detail and realism. Users can experiment with prompts that combine specific elements, such as "a steam-powered robot exploring a lush, alien jungle," to see how the model handles complex and imaginative scenes. Additionally, the model's support for different image sizes and resolutions allows users to explore the limits of its capabilities. By generating images at various scales, users can see how the model handles the level of detail and complexity required for different use cases, such as high-resolution artwork or smaller social media graphics. Overall, Stable Diffusion is a powerful and versatile AI model that offers endless possibilities for creative expression and exploration. By experimenting with different prompts, settings, and output formats, users can unlock the full potential of this cutting-edge text-to-image technology.

Read more

Updated Invalid Date