Whisper Subtitles


AI model preview image
whisper-subtitles is a model that uses OpenAI's Whisper ASR (Automatic Speech Recognition) model to generate subtitles from an audio file. The model takes an audio input and transcribes it into text, allowing for the creation of subtitles or captions for various purposes such as videos or podcasts. It provides an efficient and accurate solution for converting spoken content into written form.

Use cases

The whisper-subtitles model has numerous use cases in various industries. In media and entertainment, it can be used to generate subtitles for movies, TV shows, and online videos, improving accessibility for the hearing impaired and allowing for broader international distribution. In the education sector, the model can be utilized to create captions for online courses, enhancing the learning experience for students and making the content more accessible. The model can also find applications in the transcription industry, automating the process of converting audio recordings into text, saving time and effort for transcriptionists. Additionally, the whisper-subtitles model can be integrated into voice assistants, enabling them to accurately transcribe spoken commands or queries for improved user interactions. Overall, this powerful AI model opens up opportunities for creating a range of products and applications that leverage the conversion of audio to text, enhancing accessibility, convenience, and productivity.



Cost per run
Avg run time
Nvidia T4 GPU

Creator Models

Hello World Rust$?31
Emoji Diffusion$0.01152,437
Safe Latent Diffusion$0.33811,829
Ghibli Diffusion$0.01381,107
Stable Diffusion Rs$?55

Similar Models

Try it!

You can use this area to play around with demo applications that incorporate the Whisper Subtitles model. These demos are maintained and hosted externally by third-party creators. If you see an error, message me on Twitter.

Currently, there are no demos available for this model.


Summary of this model and related resources.

Model NameWhisper Subtitles
Generate subtitles from an audio file, using OpenAI's Whisper model.
Model LinkView on Replicate
API SpecView on Replicate
Github LinkView on Github
Paper LinkView on Arxiv


How popular is this model, by number of runs? How popular is the creator, by the sum of all their runs?

Model Rank
Creator Rank


How much does it cost to run this model? How long, on average, does it take to complete a run?

Cost per Run$0.00825
Prediction HardwareNvidia T4 GPU
Average Completion Time15 seconds