Multi-Modal Emotion Recognition Feature Extraction and Data Fusion Methods Evaluation
Sanjeeva Rao Sanku¹, B. Sandhya²

¹Sanjeeva Rao Sanku, Department of Computer Science and Engineering, University College of Engineering, Osmania University, Hyderabad (Telangana), India.

²Prof. B. Sandhya, Department of Computer Science and Engineering, MVSR Engineering College, Hyderabad (Telangana), India.

Open Access | Editorial and Publishing Policies | Cite | Zenodo | OJS | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Research into emotion detection is crucial because of the wide range of fields that can benefit from it, including healthcare, intelligent customer service, and education. In comparison to unimodal approaches, multimodal emotion recognition (MER) integrates multiple modalities, including text, facial expressions, and voice, to provide improved accuracy and robustness. This article provides a historical and present-day overview of MER, focusing on its relevance, difficulties, and approaches. We examine several datasets, comparing and contrasting their features and shortcomings; they include IEMOCAP and MELD. Recent developments in deep learning approaches, particularly fusion strategies such as early, late, and hybrid fusion, are reviewed in the literature. Data redundancy, complicated feature extraction, and real-time detection are among the identified shortcomings. Our suggested technique enhances emotion recognition accuracy by using deep learning to extract features using a hybrid fusion approach. To overcome existing restrictions and advance the field of MER, this study aims to guide future investigations in the right direction. Examining various data fusion strategies, reviewing new methodologies in multimodal emotion identification, and identifying problems and research needs make up the primary body of this work.

Keywords: Multimodal Emotion Recognition (MER), Speech Analysis, Facial Expression Recognition, MELD, Hybrid Fusion.
Scope of the Article: Computer Science and Applications

Download PDF

JOURNAL

REQUIREMENTS

PRODUCT

CONTACT US

J996813100924

Share this entry

JOURNAL

REQUIREMENTS

PRODUCT

CONTACT US