## Model overview

`Mustango` is an exciting addition to the world of Multimodal Large Language Models designed for controlled music generation. Developed by the declare-lab team, `Mustango` leverages Latent Diffusion Model (LDM), Flan-T5, and musical features to generate music from text prompts. It builds upon the work of similar models like [MusicGen](https://aimodels.fyi/models/replicate/musicgen-meta) and [MusicGen Remixer](https://aimodels.fyi/models/replicate/musicgen-remixer-sakemin), but with a focus on more fine-grained control and improved overall music quality.

## Model inputs and outputs

`Mustango` takes in a text prompt describing the desired music and generates an audio file in response. The model can be used to create a wide range of musical styles, from ambient to pop, by crafting the right prompts.

### Inputs
- **Prompt**: A text description of the desired music, including details about the instrumentation, genre, tempo, and mood.

### Outputs
- **Audio file**: A generated audio file containing the music based on the input prompt.

## Capabilities

`Mustango` demonstrates impressive capabilities in generating music that closely matches the provided text prompt. The model is able to capture details like instrumentation, rhythm, and mood, and translate them into coherent musical compositions. Compared to earlier text-to-music models, `Mustango` shows significant improvements in terms of overall musical quality and coherence.

## What can I use it for?

`Mustango` opens up a world of possibilities for content creators, musicians, and hobbyists alike. The model can be used to generate custom background music for videos, podcasts, or video games. Composers could leverage `Mustango` to quickly prototype musical ideas or explore new creative directions. Advertisers and marketers may find the model useful for generating jingles or soundtracks for their campaigns.

## Things to try

One interesting aspect of `Mustango` is its ability to generate music in a variety of styles based on the input prompt. Try experimenting with different genres, moods, and levels of detail in your prompts to see the diverse range of musical compositions the model can produce. Additionally, the team has released several pre-trained models, including a [Mustango Pretrained](https://huggingface.co/declare-lab/mustango-pretrained) version, which may be worth exploring for specific use cases.