Paper | Code | Accuracy | F1 Score | Precision | Recall | F1 | ModelName | ReleaseDate |
---|---|---|---|---|---|---|---|---|
A vector quantized masked autoencoder for speech emotion recognition | ✓ Link | 84.1 | 0.844 | VQ-MAE-S-12 (Frame) + Query2Emo | 2023-04-21 | |||
Shallow over Deep Neural Networks: A empirical analysis for human emotion classification using audio data | 82.99% | 0.82 | 0.82 | 0.82 | CNN-X (Shallow CNN) | 2020-07-03 | ||
A proposal for Multimodal Emotion Recognition using aural transformers and Action Units on RAVDESS dataset | ✓ Link | 81.82% | xlsr-Wav2Vec2.0(FineTuning) | 2021-12-30 | ||||
Multimodal Emotion Recognition on RAVDESS Dataset Using Transfer Learning | 76.58% | CNN-14 (Fine-Tuning) | 2021-11-18 | |||||
Multimodal Emotion Recognition on RAVDESS Dataset Using Transfer Learning | 61.67% | AlexNet (FineTuning) | 2021-11-18 |