## Model overview

`kandinskyvideo` is a text-to-video generation model developed by the team at [Replicate](https://aimodels.fyi/creators/replicate/cjwbw). It is based on the FusionFrames architecture, which consists of two main stages: keyframe generation and interpolation. This approach for temporal conditioning allows the model to generate videos with high-quality appearance, smoothness, and dynamics. `kandinskyvideo` is considered state-of-the-art in open-source text-to-video generation solutions.

## Model inputs and outputs

`kandinskyvideo` takes a text prompt as input and generates a corresponding video as output. The model uses a text encoder, a latent diffusion U-Net3D, and a MoVQ encoder/decoder to transform the text prompt into a high-quality video.

### Inputs
- **Prompt**: A text description of the desired video content.
- **Width**: The desired width of the output video (default is 640).
- **Height**: The desired height of the output video (default is 384).
- **FPS**: The frames per second of the output video (default is 10).
- **Guidance Scale**: The scale for classifier-free guidance (default is 5).
- **Negative Prompt**: A text description of content to avoid in the output video.
- **Num Inference Steps**: The number of denoising steps (default is 50).
- **Interpolation Level**: The quality level of the interpolation between keyframes (low, medium, or high).
- **Interpolation Guidance Scale**: The scale for interpolation guidance (default is 0.25).

### Outputs
- **Video**: The generated video corresponding to the input prompt.

## Capabilities

`kandinskyvideo` is capable of generating a wide variety of videos from text prompts, including scenes of cars drifting, chemical explosions, erupting volcanoes, luminescent jellyfish, and more. The model is able to produce high-quality, dynamic videos with smooth transitions and realistic details.

## What can I use it for?

You can use `kandinskyvideo` to generate videos for a variety of applications, such as creative content, visual effects, and entertainment. For example, you could use it to create video assets for social media, film productions, or immersive experiences. The model's ability to generate unique video content from text prompts makes it a valuable tool for content creators and visual artists.

## Things to try

Some interesting things to try with `kandinskyvideo` include generating videos with specific moods or emotions, experimenting with different levels of detail and realism, and exploring the model's capabilities for generating more abstract or fantastical video content. You can also try using the model in combination with other tools, such as [VideoCrafter2](https://aimodels.fyi/models/replicate/videocrafter-cjwbw) or [TokenFlow](https://aimodels.fyi/models/replicate/tokenflow-cjwbw), to create even more complex and compelling video experiences.