Combining Acoustic and Semantic Similarity for Acoustic Scene Retrieval

dc.contributor.authorSert, Mustafa
dc.contributor.authorBasbug, Ahmet Melih
dc.date.accessioned2023-08-29T10:27:07Z
dc.date.available2023-08-29T10:27:07Z
dc.date.issued2019
dc.description.abstractAutomatic retrieval of acoustic scenes in large audio collections is a challenging task due to the complex structures of these sounds. A robust and flexible retrieval system should address both the acoustic- and semantic aspects of these sounds and how to combine them. In this study, we introduce an acoustic scene retrieval system that uses a combined acoustic- and semantic-similarity method. To address the acoustic aspects of sound scenes, we use a cascaded convolutional neural network (CNN) with a gated recurrent unit (GRU). The acoustic similarity is calculated in feature space using the Euclidean distance and the semantic similarity is obtained using the Path Similarity method of the WordNet. Two performance datasets from the TAU Urban Acoustic Scenes 2019 and the TUT Urban Acoustic Scenes 2018 are used to compare the performance of the proposed retrieval system with the literature and the developed baseline. Results show that the semantic similarity improves the mAP and P@k scores.en_US
dc.identifier.endpage159en_US
dc.identifier.isbn978-1-7281-5606-4en_US
dc.identifier.startpage156en_US
dc.identifier.urihttp://hdl.handle.net/11727/10461
dc.identifier.wos000528909200025en_US
dc.language.isoengen_US
dc.relation.isversionof10.1109/ISM46123.2019.00036en_US
dc.relation.journal2019 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2019)en_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectAcoustic Scene Retrievalen_US
dc.subjectSemantic Similarityen_US
dc.subjectAcoustic Similarityen_US
dc.subjectConvolutional Neural Network (CNN)en_US
dc.subjectGated Recurrent Unit (GRU)en_US
dc.subjectQuery-By-Exampleen_US
dc.titleCombining Acoustic and Semantic Similarity for Acoustic Scene Retrievalen_US
dc.typeConference Objecten_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: