Get a weekly rundown of the latest AI models and research... subscribe! https://aimodels.substack.com/

Wav2vec2 Large Robust 12 Ft Emotion Msp Dim

audeering

๐Ÿ

The wav2vec2-large-robust-12-ft-emotion-msp-dim model is a model for Dimensional Speech Emotion Recognition based on Wav2vec 2.0. It takes a raw audio signal as input and predicts the arousal, dominance, and valence dimensions of speech emotion in a range of 0 to 1. The model was created by fine-tuning the Wav2Vec2-Large-Robust model on the MSP-Podcast dataset and is pruned to 12 transformer layers. It also provides the pooled states of the last transformer layer. The model can be used for emotion recognition in speech applications.

Use cases

This AI model has various potential use cases for audio classification and speech emotion recognition. It can be utilized in applications such as automatic speech recognition (ASR) systems, sentiment analysis in call centers or customer service, emotion detection in voice assistants or chatbots, analyzing emotional content in podcasts or audio recordings, and enhancing human-computer interaction by detecting the emotional state of the speaker. Potential products or practical uses of this model include speech emotion analysis APIs, emotion-aware voice assistants or chatbots, sentiment analysis tools for analyzing audio data, and emotion detection systems for call centers or customer service. This model provides a powerful tool for understanding and analyzing the emotional dimensions of speech, enabling the development of more emotionally intelligent AI systems.

audio-classification

Pricing

Cost per run
$-
USD
Avg run time
-
Seconds
Hardware
-
Prediction

Creator Models

ModelCostRuns
No other models by this creator

Similar Models

Try it!

You can use this area to play around with demo applications that incorporate the Wav2vec2 Large Robust 12 Ft Emotion Msp Dim model. These demos are maintained and hosted externally by third-party creators. If you see an error, message me on Twitter.

Overview

Summary of this model and related resources.

PropertyValue
Creatoraudeering
Model NameWav2vec2 Large Robust 12 Ft Emotion Msp Dim
Description

Model for Dimensional Speech Emotion Recognition based on Wav2v...

Read more ยป
Tagsaudio-classification
Model LinkView on HuggingFace
API SpecView on HuggingFace
Github LinkNo Github link provided
Paper LinkNo paper link provided

Popularity

How popular is this model, by number of runs? How popular is the creator, by the sum of all their runs?

PropertyValue
Runs22,049
Model Rank
Creator Rank

Cost

How much does it cost to run this model? How long, on average, does it take to complete a run?

PropertyValue
Cost per Run$-
Prediction Hardware-
Average Completion Time-