CompVis `latent-diffusion text2im` finetuned for inpainting.

## Model overview

The `glid-3-xl` model is a text-to-image diffusion model created by the [Replicate](https://aimodels.fyi/creators/replicate/afiaka87) team. It is a finetuned version of the CompVis `latent-diffusion` model, with improvements for inpainting tasks. Compared to similar models like [stable-diffusion](https://aimodels.fyi/models/replicate/stable-diffusion-stability-ai), [inkpunk-diffusion](https://aimodels.fyi/models/replicate/inkpunk-diffusion-adithram), and [inpainting-xl](https://aimodels.fyi/models/replicate/inpainting-xl-ikun-ai), `glid-3-xl` focuses specifically on high-quality inpainting capabilities.

## Model inputs and outputs

The `glid-3-xl` model takes a text prompt, an optional initial image, and an optional mask as inputs. It then generates a new image that matches the text prompt, while preserving the content of the initial image where the mask specifies. The outputs are one or more high-resolution images.

### Inputs
- **Prompt**: The text prompt describing the desired image
- **Init Image**: An optional initial image to use as a starting point
- **Mask**: An optional mask image specifying which parts of the initial image to keep

### Outputs
- **Generated Images**: One or more high-resolution images matching the text prompt, with the initial image content preserved where specified by the mask

## Capabilities

The `glid-3-xl` model excels at generating high-quality images that match text prompts, while also allowing for inpainting of existing images. It can produce detailed, photorealistic illustrations as well as more stylized artwork. The inpainting capabilities make it useful for tasks like editing and modifying existing images.

## What can I use it for?

The `glid-3-xl` model is well-suited for a variety of creative and generative tasks. You could use it to create custom illustrations, concept art, or product designs based on textual descriptions. The inpainting functionality also makes it useful for tasks like photo editing, object removal, and image manipulation. Businesses could leverage the model to generate visuals for marketing, product design, or even custom content creation.

## Things to try

Try experimenting with different types of prompts to see the range of images the `glid-3-xl` model can generate. You can also play with the inpainting capabilities by providing an initial image and mask to see how the model can modify and enhance existing visuals. Additionally, try adjusting the various input parameters like guidance scale and aesthetic weight to see how they impact the output.