Create variations of an image while preserving shape and depth.

## Model overview

`stable-diffusion-depth2img` is a Cog implementation of the Diffusers Stable Diffusion v2 model, which is capable of generating variations of an image while preserving its shape and depth. This model builds upon the [Stable Diffusion](https://aimodels.fyi/models/replicate/stable-diffusion-stability-ai) model, which is a powerful latent text-to-image diffusion model that can generate photo-realistic images from any text input. The `stable-diffusion-depth2img` model adds the ability to create variations of an existing image, while maintaining the overall structure and depth information.

## Model inputs and outputs

The `stable-diffusion-depth2img` model takes a variety of inputs to control the image generation process, including a prompt, an existing image, and various parameters to fine-tune the output. The model then generates one or more new images based on these inputs.

### Inputs
- **Prompt**: The text prompt that guides the image generation process.
- **Image**: The existing image that will be used as the starting point for the process.
- **Seed**: An optional random seed value to control the image generation.
- **Scheduler**: The type of scheduler to use for the diffusion process.
- **Num Outputs**: The number of images to generate (up to 8).
- **Guidance Scale**: The scale for classifier-free guidance, which controls the balance between the text prompt and the input image.
- **Negative Prompt**: An optional prompt that specifies what the model should not generate.
- **Prompt Strength**: The strength of the text prompt relative to the input image.
- **Num Inference Steps**: The number of denoising steps to perform during the image generation process.

### Outputs
- **Images**: One or more new images generated based on the provided inputs.

## Capabilities

The `stable-diffusion-depth2img` model can be used to generate a wide variety of image variations based on an existing image. By preserving the shape and depth information from the input image, the model can create new images that maintain the overall structure and composition, while introducing new elements and variations based on the provided text prompt. This can be useful for tasks such as art generation, product design, and architectural visualization.

## What can I use it for?

The `stable-diffusion-depth2img` model can be used for a variety of creative and design-related projects. For example, you could use it to generate concept art for a fantasy landscape, create variations of a product design, or explore different architectural styles for a building. The ability to preserve the shape and depth information of the input image can be particularly useful for these types of applications, as it allows you to maintain the overall structure and composition while introducing new elements and variations.

## Things to try

One interesting thing to try with the `stable-diffusion-depth2img` model is to experiment with different prompts and input images to see how the model generates new variations. Try using a variety of input images, from landscapes to still lifes to abstract art, and see how the model responds to different types of visual information. You can also play with the various parameters, such as guidance scale and prompt strength, to fine-tune the output and explore the limits of the model's capabilities.