Basit öğe kaydını göster

dc.contributor.authorAnkishan, Haydar
dc.contributor.authorKocoglu, Arif
dc.date.accessioned2023-09-08T08:12:31Z
dc.date.available2023-09-08T08:12:31Z
dc.date.issued2020
dc.identifier.issn2165-0608en_US
dc.identifier.urihttp://hdl.handle.net/11727/10551
dc.description.abstractAutomatic detection of voice disorders is very important because it makes the diagnosis process simpler, cheaper and less time consuming. In the literature, there are many studies available on the analysis of voice disorders based on the characteristics of the voice and subdividing the result of this analysis. In general, these studies have been carried out in order to subdivide the sound into pathological - normally sub - groups by means of certain classifiers as a result of subtraction of the features on frequency, time or hybrid axis. In contrast to existing approaches, in this study, a multiple- deep learning model using feature level fusion is proposed to distinguish pathological-normal sounds from each other. First, a feature vector (HOV) on the hybrid axis was obtained from the raw sound data. Then two CNN models were used. The first model has used raw audio data and the second model has used HOV as an input. Feature data in both model SoftMax layers were obtained as a matrix, and canonical correlation analysis (Canonical Correlation Analysis (CCA) was applied at feature level fusion. The new obtained feature vector was used as an input for multiple support vector machines (M-SVMs), Decision Tree (DTC) and naive bayes (NBC) classifiers. When the experimental results are examined, it is seen that the new multi-model based deep learning architecture provides superior success in classifying pathological sound data. With the results of the study, it will be possible to automatically detect and classify the pathology of these patients according to the proposed system.en_US
dc.language.isoturen_US
dc.relation.isversionof10.1109/SIU49456.2020.9302067en_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectdeep learning based multi modalen_US
dc.subjectfeature level fusionen_US
dc.subjectdecision level fusionen_US
dc.subjectpathological sounds classificationen_US
dc.titleDeep Learning Based Multi Modal Approach for Pathological Sounds Classificationen_US
dc.typeconferenceObjecten_US
dc.relation.journal28th Signal Processing and Communications Applications Conference (SIU)en_US
dc.identifier.wos000653136100041en_US
dc.identifier.scopus2-s2.0-85100317296en_US


Bu öğenin dosyaları:

DosyalarBoyutBiçimGöster

Bu öğe ile ilişkili dosya yok.

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster