Efficient Recognition of Human Emotional States from Audio Signals

dc.contributor.authorErdem, Ernur Sonat
dc.contributor.authorSert, Mustafa
dc.contributor.orcIDhttps://orcid.org/0000-0002-7056-4245en_US
dc.contributor.researcherIDAAB-8673-2019en_US
dc.date.accessioned2024-03-20T11:40:09Z
dc.date.available2024-03-20T11:40:09Z
dc.date.issued2014
dc.description.abstractAutomatic recognition of human emotional states is an important task for efficient human-machine communication. Most of existing works focus on the recognition of emotional states using audio signals alone, visual signals alone, or both. Here we propose empirical methods for feature extraction and classifier optimization that consider the temporal aspects of audio signals and introduce our framework to efficiently recognize human emotional states from audio signals. The framework is based on the prediction of input audio clips that are described using representative low-level features. In the experiments, seven (7) discrete emotional states (anger, fear, boredom, disgust, happiness, sadness, and neutral) from EmoDB dataset, are recognized and tested based on nineteen (19) audio features (15 standalone, 4 joint) by using the Support Vector Machine (SVM) classifier. Extensive experiments have been conducted to demonstrate the effect of feature extraction and classifier optimization methods to the recognition accuracy of the emotional states. Our experiments show that, feature extraction and classifier optimization procedures lead to significant improvement of over 11% in emotion recognition. As a result, the overall recognition accuracy achieved for seven emotions in the EmoDB dataset is 83.33% compared to the baseline accuracy of 72.22%.en_US
dc.identifier.endpage142en_US
dc.identifier.scopus2-s2.0-84930442687en_US
dc.identifier.startpage139en_US
dc.identifier.urihttp://hdl.handle.net/11727/11900
dc.identifier.wos000380456700026en_US
dc.language.isoengen_US
dc.relation.isversionof10.1109/ISM.2014.81en_US
dc.relation.journal2014 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM)en_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectbased emotion recognitionen_US
dc.subjectaffective computingen_US
dc.subjectMPEG-7 audioen_US
dc.subjectMFCCen_US
dc.subjectSupport Vector Machineen_US
dc.titleEfficient Recognition of Human Emotional States from Audio Signalsen_US
dc.typeconferenceObjecten_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: