GLIDE from OpenAI finetuned on roughly 30M more samples. See `laionide-v3` for the latest.

## Model overview

`laionide-v2` is a text-to-image model from LAION-AI, a prominent AI research collective. It is a fine-tuned version of the GLIDE model from OpenAI, trained on an additional 30 million samples. This model can generate photorealistic images from text prompts. Compared to similar models like [`laionide-v3`](https://aimodels.fyi/models/replicate/laionide-v3-laion-ai), `laionide-v2` has a slightly smaller training dataset but may produce images with fewer artifacts. Other related models from LAION-AI include [`ongo`](https://aimodels.fyi/models/replicate/ongo-laion-ai), [`erlich`](https://aimodels.fyi/models/replicate/erlich-laion-ai), and [`puck`](https://aimodels.fyi/models/replicate/puck-laion-ai), which specialize in generating paintings, logos, and retro game art respectively.

## Model inputs and outputs

`laionide-v2` takes a text prompt as input and generates a corresponding image. The model can output images at a range of resolutions, with the ability to generate upscaled versions of the base image. Key input parameters include the text prompt, image dimensions, and various hyperparameters that control the sampling process.

### Inputs
- **Prompt**: The text prompt to use for generating the image
- **Side X**: The width of the generated image in pixels (multiple of 8, up to 128)
- **Side Y**: The height of the generated image in pixels (multiple of 8, up to 128)
- **Batch Size**: The number of images to generate simultaneously (1-6)
- **Upsample Stage**: Whether to perform prompt-aware upsampling to increase the image resolution by 4x
- **Timestep Respacing**: The number of timesteps to use for the base model (5-150)
- **SR Timestep Respacing**: The number of timesteps to use for the upsampling model (5-40)
- **Seed**: A seed value for reproducibility

### Outputs
- **Image**: The generated image file
- **Text**: The prompt used to generate the image

## Capabilities

`laionide-v2` can generate a wide variety of photorealistic images from text prompts, including landscapes, portraits, and abstract scenes. The model is particularly adept at capturing realistic textures, lighting, and details. While it may produce some artifacts or inconsistencies in complex or unusual prompts, the overall quality of the generated images is high.

## What can I use it for?

`laionide-v2` can be a powerful tool for a range of applications, from creative content generation to visual prototyping and illustration. Artists and designers can use the model to quickly explore ideas and concepts, while businesses can leverage it for product visualizations, marketing materials, and more. The model's ability to generate high-quality images from text also makes it suitable for media production, educational resources, and other visual-centric use cases.

## Things to try

Experiment with the model's various input parameters to see how they affect the generated images. Try prompts that combine specific details with more abstract or emotive language to see the model's ability to interpret and translate complex concepts into visuals. You can also explore the model's limitations by providing prompts that are particularly challenging or outside its training distribution.