Sadtalker

cjwbw

sadtalker

SadTalker is a model developed to generate realistic and expressive talking face animations from a single image. It combines audio and image inputs to synthesize facial expressions, head tilts, and mouth movements that match the given audio input. The model has been trained on a large dataset of talking face videos with different emotional expressions, allowing it to generate animations with diverse emotional responses.

Use cases

SadTalker has a wide range of potential use cases in various industries. In entertainment, it can be used to create realistic and expressive animated characters for movies and video games. This model could also be utilized in virtual reality and augmented reality applications, providing more immersive and engaging experiences by enabling realistic and interactive avatars. In the field of education, SadTalker could be used to enhance online learning platforms by creating animated instructors that can deliver lectures with realistic expressions and gestures. Additionally, this model could have applications in the advertising industry, where it could be used to create animated spokespersons that deliver persuasive messages with more realism and emotional impact. Overall, SadTalker opens up possibilities for creating innovative products and practical uses that require realistic and expressive talking face animations.

Image-to-Image

Pricing

Cost per run
$0.2346
USD
Avg run time
102
Seconds
Hardware
Nvidia A100 (40GB) GPU
Prediction

Creator Models

ModelCostRuns
Pix2pix Zero$?4,206
Night Enhancement$0.0104520,721
Mindall E$?1,645
Compositional Vsual Generation With Composable Diffusion Models Pytorch$0.01155774
Idefics$?538

Similar Models

Try it!

You can use this area to play around with demo applications that incorporate the Sadtalker model. These demos are maintained and hosted externally by third-party creators. If you see an error, message me on Twitter.

Currently, there are no demos available for this model.

Overview

Summary of this model and related resources.

PropertyValue
Creatorcjwbw
Model NameSadtalker
Description
Stylized Audio-Driven Single Image Talking Face Animation
TagsImage-to-Image
Model LinkView on Replicate
API SpecView on Replicate
Github LinkView on Github
Paper LinkView on Arxiv

Popularity

How popular is this model, by number of runs? How popular is the creator, by the sum of all their runs?

PropertyValue
Runs35,547
Model Rank
Creator Rank

Cost

How much does it cost to run this model? How long, on average, does it take to complete a run?

PropertyValue
Cost per Run$0.2346
Prediction HardwareNvidia A100 (40GB) GPU
Average Completion Time102 seconds