Zero_shot_audio_source_separation

retrocirce

zero_shot_audio_source_separation

The Zero Shot Audio Source Separation model is designed to separate different sound sources from a mixture of audio samples. It can do this even when the model has not been trained on specific examples of the sources to be separated. Instead, the model can separate sound sources based on a query sample provided by the user. This allows for more flexible and customizable audio source separation without the need for extensive training.

Use cases

The Zero Shot Audio Source Separation model has a wide range of potential use cases in various industries. In music production, it can be used to isolate individual instruments or vocals from a mixed recording, allowing for remixing or enhancing specific elements in a song. In audio transcription and speech recognition, it can help separate different speakers or remove background noise, improving the accuracy of transcriptions or voice commands. In surveillance or security applications, it can be used to extract and analyze specific sounds from audio recordings, such as detecting gunshots or breaking glass. In the gaming industry, it can enhance user experience by separating different audio sources in immersive environments, making the virtual world more realistic. The possibilities for practical applications of this model are vast, and it has the potential to revolutionize audio processing and analysis across industries.

Audio-to-Audio

Pricing

Cost per run
$0.0165
USD
Avg run time
30
Seconds
Hardware
Nvidia T4 GPU
Prediction

Creator Models

ModelCostRuns
Zero_​shot_​audio_​source_​separation$?0

Similar Models

Try it!

You can use this area to play around with demo applications that incorporate the Zero_shot_audio_source_separation model. These demos are maintained and hosted externally by third-party creators. If you see an error, message me on Twitter.

Currently, there are no demos available for this model.

Overview

Summary of this model and related resources.

PropertyValue
Creatorretrocirce
Model NameZero_shot_audio_source_separation
Description
Zero shot Sound separation by arbitrary query samples
TagsAudio-to-Audio
Model LinkView on Replicate
API SpecView on Replicate
Github LinkView on Github
Paper LinkView on Arxiv

Popularity

How popular is this model, by number of runs? How popular is the creator, by the sum of all their runs?

PropertyValue
Runs8,997
Model Rank
Creator Rank

Cost

How much does it cost to run this model? How long, on average, does it take to complete a run?

PropertyValue
Cost per Run$0.0165
Prediction HardwareNvidia T4 GPU
Average Completion Time30 seconds