Efficient Recognition of Human Emotional States from Audio Signals

Erdem, Ernur Sonat; Sert, Mustafa

Efficient Recognition of Human Emotional States from Audio Signals

dc.contributor.author	Erdem, Ernur Sonat
dc.contributor.author	Sert, Mustafa
dc.contributor.orcID	https://orcid.org/0000-0002-7056-4245	en_US
dc.contributor.researcherID	AAB-8673-2019	en_US
dc.date.accessioned	2024-03-20T11:40:09Z
dc.date.available	2024-03-20T11:40:09Z
dc.date.issued	2014
dc.description.abstract	Automatic recognition of human emotional states is an important task for efficient human-machine communication. Most of existing works focus on the recognition of emotional states using audio signals alone, visual signals alone, or both. Here we propose empirical methods for feature extraction and classifier optimization that consider the temporal aspects of audio signals and introduce our framework to efficiently recognize human emotional states from audio signals. The framework is based on the prediction of input audio clips that are described using representative low-level features. In the experiments, seven (7) discrete emotional states (anger, fear, boredom, disgust, happiness, sadness, and neutral) from EmoDB dataset, are recognized and tested based on nineteen (19) audio features (15 standalone, 4 joint) by using the Support Vector Machine (SVM) classifier. Extensive experiments have been conducted to demonstrate the effect of feature extraction and classifier optimization methods to the recognition accuracy of the emotional states. Our experiments show that, feature extraction and classifier optimization procedures lead to significant improvement of over 11% in emotion recognition. As a result, the overall recognition accuracy achieved for seven emotions in the EmoDB dataset is 83.33% compared to the baseline accuracy of 72.22%.	en_US
dc.identifier.endpage	142	en_US
dc.identifier.scopus	2-s2.0-84930442687	en_US
dc.identifier.startpage	139	en_US
dc.identifier.uri	http://hdl.handle.net/11727/11900
dc.identifier.wos	000380456700026	en_US
dc.language.iso	eng	en_US
dc.relation.isversionof	10.1109/ISM.2014.81	en_US
dc.relation.journal	2014 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM)	en_US
dc.rights	info:eu-repo/semantics/closedAccess	en_US
dc.subject	based emotion recognition	en_US
dc.subject	affective computing	en_US
dc.subject	MPEG-7 audio	en_US
dc.subject	MFCC	en_US
dc.subject	Support Vector Machine	en_US
dc.title	Efficient Recognition of Human Emotional States from Audio Signals	en_US
dc.type	Conference Object	en_US

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Mühendislik Fakültesi / Faculty of Engineering