Efficient Long-distance Latent Relation-aware Graph Neural Network for Multi-modal Emotion Recognition in Conversations | | 69.9 | 68.7 | | | ELR-GNN | 2024-06-27 |
BiosERC: Integrating Biography Speakers Supported by LLMs for ERC Tasks | ✓ Link | 69.83 | | | | BiosERC | 2024-07-05 |
CKERC : Joint Large Language Models with Commonsense Knowledge for Emotion Recognition in Conversation | | 69.27 | | | | CKERC | 2024-03-12 |
InstructERC: Reforming Emotion Recognition in Conversation with Multi-task Retrieval-Augmented Large Language Models | ✓ Link | 69.15 | | | | InstructERC | 2023-09-21 |
Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum | | 69.0 | 68.1 | | | GS-MCC | 2024-04-27 |
Beyond Silent Letters: Amplifying LLMs in Emotion Recognition with Vocal Nuances | ✓ Link | 67.604 | | | | SpeechCueLLM | 2024-07-31 |
Revisiting Multi-modal Emotion Learning with Broad State Space Models and Probability-guidance Fusion | | 67.6 | 68.0 | | | Mamba-like Model | 2024-04-27 |
TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation | ✓ Link | 67.37 | | | | TelME | 2024-01-16 |
Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation | ✓ Link | 67.25 | | | | SPCL-CL-ERC | 2022-10-17 |
Emotion-Anchored Contrastive Learning Framework for Emotion Recognition in Conversation | ✓ Link | 67.12 | | | | EACL | 2024-03-29 |
Revisiting Disentanglement and Fusion on Modality and Context in Conversational Multimodal Emotion Recognition | | 67.03 | 68.28 | | | DF-ERC | 2023-08-08 |
Hierarchical Dialogue Understanding with Special Tokens and Turn-level Attention | ✓ Link | 66.96 | | | | HiDialog | 2023-04-29 |
Supervised Adversarial Contrastive Learning for Emotion Recognition in Conversations | ✓ Link | 66.86 | 67.89 | | | SACL-LSTM (one seed) | 2023-06-02 |
A Facial Expression-Aware Multimodal Multi-task Learning Framework for Emotion Recognition in Multi-party Conversations | ✓ Link | 66.73 | | | | FacialMMT | 2023-07-01 |
M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation | | 66.71 | 67.85 | | | M2FNet | 2022-06-05 |
Tracing Intricate Cues in Dialogue: Joint Graph Structure and Sentiment Dynamics for Multimodal Emotion Recognition | ✓ Link | 66.71 | 67.70 | | | GraphSmile | 2024-07-31 |
CFN-ESA: A Cross-Modal Fusion Network with Emotion-Shift Awareness for Dialogue Emotion Recognition | ✓ Link | 66.70 | 67.85 | | | CFN-ESA | 2023-07-28 |
A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations | ✓ Link | 66.60 | 67.55 | | | SDT | 2023-10-31 |
CoMPM: Context Modeling with Speaker's Pre-trained Memory Tracking for Emotion Recognition in Conversation | ✓ Link | 66.52 | | | | CoMPM | 2021-08-26 |
EmotionFlow: Capture the Dialogue Level Emotion Transitions | ✓ Link | 66.50 | | | | EmotionFlow-large | 2022-05-07 |
The Emotion is Not One-hot Encoding: Learning with Grayscale Label for Emotion Recognition in Conversation | ✓ Link | 66.49 | | | | EmoOne-RoBERTa | 2022-06-15 |
Supervised Adversarial Contrastive Learning for Emotion Recognition in Conversations | ✓ Link | 66.45 | 67.51 | | | SACL-LSTM | 2023-06-02 |
EmotionIC: emotional inertia and contagion-driven dependency modeling for emotion recognition in conversation | ✓ Link | 66.32 | | 67.59 | | EmotionIC | 2023-03-20 |
M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation | | 66.23 | 67.24 | | | M2FNet-Text | 2022-06-05 |
Static and Dynamic Speaker Modeling based on Graph Neural Network for Emotion Recognition in Conversation | | 65.90 | | | | Static-Dynamic Modeling | |
HCAM -- Hierarchical Cross Attention Model for Multi-modal Emotion Recognition | | 65.8 | | | | Audio + Text (Stage III) | 2023-04-14 |
DialogueCRN: Contextual Reasoning Networks for Emotion Recognition in Conversations | ✓ Link | 65.77 | 66.93 | | | DialogueCRN+RoBERTa | 2021-06-03 |
EmoBERTa: Speaker-Aware Emotion Recognition in Conversation with RoBERTa | ✓ Link | 65.61 | | | | EmoBERTa | 2021-08-26 |
GRASP: Guiding model with RelAtional Semantics using Prompt for Dialogue Relation Extraction | ✓ Link | 65.6 | | | | GRASP_Large | 2022-08-26 |
UniMSE: Towards Unified Multimodal Sentiment Analysis and Emotion Recognition | ✓ Link | 65.51 | 65.09 | | | UniMSE | 2022-11-21 |
Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection | | 65.47 | | | | TODKAT | 2021-06-02 |
Graph Based Network with Contextualized Representations of Turns in Dialogue | ✓ Link | 65.36 | | | | TUCORE-GCN_RoBERTa | 2021-09-09 |
COSMIC: COmmonSense knowledge for eMotion Identification in Conversations | ✓ Link | 65.21 | | | | COSMIC | 2020-10-06 |
Past, Present, and Future: Conversational Emotion Recognition through Structural Modeling of Psychological Knowledge | ✓ Link | 65.18 | | | | SKAIG-ERC | |
EmotionFlow: Capture the Dialogue Level Emotion Transitions | ✓ Link | 65.05 | | | | EmotionFlow-base | 2022-05-07 |
Multimodal Prompt Transformer with Hybrid Contrastive Learning for Emotion Recognition in Conversation | | 65.02 | 65.86 | | | MPT-HCL | 2023-10-04 |
Contrast and Generation Make BART a Good Dialogue Emotion Recognizer | ✓ Link | 64.81 | | | | CoG-BART | 2021-12-21 |
Accumulating Word Representations in Multi-level Context Integration for ERC Task | ✓ Link | 64.58 | | | | AccumWR | 2023-11-06 |
A Discourse-Aware Graph Neural Network for Emotion Recognition in Multi-Party Conversation | | 64.22 | | | | ERMC-DisGCN | |
Long-Short Distance Graph Neural Networks and Improved Curriculum Learning for Emotion Recognition in Conversation | ✓ Link | 64.07 | | | | LSDGNN+ICL | 2025-07-21 |
EmoCaps: Emotion Capsule based Model for Conversational Emotion Recognition | | 64.00 | | | | EmoCaps | 2022-03-25 |
Directed Acyclic Graph Network for Conversational Emotion Recognition | ✓ Link | 63.65 | | | | DAG-ERC | 2021-05-27 |
EmoCaps: Emotion Capsule based Model for Conversational Emotion Recognition | | 63.51 | | | | EmoCaps-Text | 2022-03-25 |
S+PAGE: A Speaker and Position-Aware Graph Neural Network Model for Emotion Recognition in Conversation | | 63.32 | | | | S+PAGE | 2021-12-23 |
Knowledge-Interactive Network with Sentiment Polarity Intensity-Aware Multi-Task Learning for Emotion Recognition in Conversations | | 63.24 | | | | KI-Net | |
Graph Based Network with Contextualized Representations of Turns in Dialogue | ✓ Link | 62.47 | | | | TUCORE-GCN_BERT | 2021-09-09 |
DialogXL: All-in-One XLNet for Multi-Party Conversation Emotion Recognition | ✓ Link | 62.41 | | | | DialogXL | 2020-12-16 |
A Hierarchical Transformer with Speaker Modeling for Emotion Recognition in Conversation | ✓ Link | 62.36 | | | | TRMSM-Att | 2020-12-29 |
HiTrans: A Transformer-Based Context- and Speaker-Sensitive Model for Emotion Detection in Conversations | | 61.94 | | | | HiTrans | 2020-12-01 |
Multi-Task Learning with Auxiliary Speaker Identification for Conversational Emotion Recognition | | 61.90 | | | | BERT+MTL | 2020-03-03 |
Hierarchical Pre-training for Sequence Labelling in Spoken Dialog | | 61.90 | | | | Pretrained Hierarchical Transformer | 2020-09-23 |
Relation-aware Graph Attention Networks with Relational Position Encodings for Emotion Recognition in Conversations | ✓ Link | 60.91 | | | | RGAT-ERC | |
BiERU: Bidirectional Emotional Recurrent Unit for Conversational Sentiment Analysis | ✓ Link | 60.84 | | | | BiERU-lc | 2020-05-31 |
An Iterative Emotion Interaction Network for Emotion Recognition in Conversations | | 60.72 | | | | Iterative | 2020-12-01 |
Multi-Task Learning with Auxiliary Speaker Identification for Conversational Emotion Recognition | | 60.69 | | | | GloVE+MTL | 2020-03-03 |
MM-DFN: Multimodal Dynamic Fusion Network for Emotion Recognition in Conversations | ✓ Link | 59.46 | 62.49 | | | MM-DFN | 2022-03-04 |
GA2MIF: Graph and Attention Based Two-Stage Multi-Source Information Fusion for Conversational Emotion Detection | ✓ Link | 58.94 | 61.65 | | | GA2MIF | 2022-07-25 |
GraphCFC: A Directed Graph Based Cross-Modal Feature Complementation Approach for Multimodal Conversational Emotion Recognition | ✓ Link | 58.86 | 61.42 | | | GraphCFC | 2022-07-06 |
Summarize before Aggregate: A Global-to-local Heterogeneous Graph Inference Network for Conversational Emotion Recognition | | 58.45 | | | | SumAggGIN | 2020-12-01 |
DialogueCRN: Contextual Reasoning Networks for Emotion Recognition in Conversations | ✓ Link | 58.39 | 60.73 | | | DialogueCRN | 2021-06-03 |
Contextualized Emotion Recognition in Conversation as Sequence Tagging | | 58.36 | | | | CESTa | 2020-07-01 |
Knowledge-Enriched Transformer for Emotion Detection in Textual Conversations | ✓ Link | 58.18 | | | | KET | 2019-09-24 |
DialogueGCN: A Graph Convolutional Neural Network for Emotion Recognition in Conversation | ✓ Link | 58.10 | 59.46 | | | DialogueGCN | 2019-08-30 |
Modeling both context- and speaker-sensitive dependence for emotion detection in multi-speaker conversations | | 57.4 | | | | ConGCN | 2019-07-01 |
DialogueRNN: An Attentive RNN for Emotion Detection in Conversations | ✓ Link | 57.03 | 59.54 | | | DialogueRNN | 2018-11-01 |
Context-Dependent Sentiment Analysis in User-Generated Videos | ✓ Link | 56.44 | 57.50 | | | bc-LSTM+Att | 2017-07-01 |
Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition | ✓ Link | | 60.03 | | 60.03 | ConCluGen | 2024-04-16 |
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models | ✓ Link | | 55.70 | | | Qwen-Audio | 2023-11-14 |