Deep Learning Based Multi Modal Approach for Pathological Sounds Classification

Ankishan, Haydar; Kocoglu, Arif

dc.contributor.author	Ankishan, Haydar
dc.contributor.author	Kocoglu, Arif
dc.date.accessioned	2023-09-08T08:12:31Z
dc.date.available	2023-09-08T08:12:31Z
dc.date.issued	2020
dc.identifier.issn	2165-0608	en_US
dc.identifier.uri	http://hdl.handle.net/11727/10551
dc.description.abstract	Automatic detection of voice disorders is very important because it makes the diagnosis process simpler, cheaper and less time consuming. In the literature, there are many studies available on the analysis of voice disorders based on the characteristics of the voice and subdividing the result of this analysis. In general, these studies have been carried out in order to subdivide the sound into pathological - normally sub - groups by means of certain classifiers as a result of subtraction of the features on frequency, time or hybrid axis. In contrast to existing approaches, in this study, a multiple- deep learning model using feature level fusion is proposed to distinguish pathological-normal sounds from each other. First, a feature vector (HOV) on the hybrid axis was obtained from the raw sound data. Then two CNN models were used. The first model has used raw audio data and the second model has used HOV as an input. Feature data in both model SoftMax layers were obtained as a matrix, and canonical correlation analysis (Canonical Correlation Analysis (CCA) was applied at feature level fusion. The new obtained feature vector was used as an input for multiple support vector machines (M-SVMs), Decision Tree (DTC) and naive bayes (NBC) classifiers. When the experimental results are examined, it is seen that the new multi-model based deep learning architecture provides superior success in classifying pathological sound data. With the results of the study, it will be possible to automatically detect and classify the pathology of these patients according to the proposed system.	en_US
dc.language.iso	tur	en_US
dc.relation.isversionof	10.1109/SIU49456.2020.9302067	en_US
dc.rights	info:eu-repo/semantics/closedAccess	en_US
dc.subject	deep learning based multi modal	en_US
dc.subject	feature level fusion	en_US
dc.subject	decision level fusion	en_US
dc.subject	pathological sounds classification	en_US
dc.title	Deep Learning Based Multi Modal Approach for Pathological Sounds Classification	en_US
dc.type	conferenceObject	en_US
dc.relation.journal	28th Signal Processing and Communications Applications Conference (SIU)	en_US
dc.identifier.wos	000653136100041	en_US
dc.identifier.scopus	2-s2.0-85100317296	en_US

Bu öğenin dosyaları:

Dosyalar	Boyut	Biçim	Göster
Bu öğe ile ilişkili dosya yok.

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Teknik Bilimler Meslek Yüksekokulu / Vocational School of Technical Sciences [31]

Basit öğe kaydını göster