In this paper, we propose an algorithm for snoring sounds classification based on Deep Scattering Spectrum (SCAT), Gaussian Mixture Models (GMM) Supervectors and Deep Neural Networks (DNN). The task consists in the identification of the type of snoring among four target classes representing the snore sounds’ excitation location, which can be highly useful for a successful medical treatment of the habitual snorer or patient afflicted with Obstructive Sleep Apnea. The SCAT is computed from excerpt the acoustic signals, then a GMM Supervector is calculated by adapting the GMM model of the acoustic space with the Maximum a Posteriori (MAP) algorithm and concatenating the mean values of the Gaussians. Resulting supervectors are used to feed the multiclass DNN classifier. The performance of the algorithm has been assessed on the Munich-Passau Snore Sound Corpus (MPSSC), composed of recordings of Drug-Induced Sleep Endoscopy (DISE) examinations. The results are expressed in terms of Unweighted Average Recall (UAR) and a remarkable improvement with respect to the state-of-the-art performance has been registered, achieving a score up to 67.14% and 67.71% respectively on the devel and test datasets.

Snore Sounds Excitation Localization by Using Scattering Transform and Deep Neural Networks / Vesperini, Fabio; Galli, Andrea; Gabrielli, Leonardo; Principi, Emanuele; Squartini, Stefano. - (2018). (Intervento presentato al convegno International Joint Conference on Neural Networks, IJCNN 2018 tenutosi a Rio de Janeiro, Brazil nel July, 7-13, 2018) [10.1109/IJCNN.2018.8489576].

Snore Sounds Excitation Localization by Using Scattering Transform and Deep Neural Networks

Fabio Vesperini;Leonardo Gabrielli;Emanuele Principi;Stefano Squartini
2018-01-01

Abstract

In this paper, we propose an algorithm for snoring sounds classification based on Deep Scattering Spectrum (SCAT), Gaussian Mixture Models (GMM) Supervectors and Deep Neural Networks (DNN). The task consists in the identification of the type of snoring among four target classes representing the snore sounds’ excitation location, which can be highly useful for a successful medical treatment of the habitual snorer or patient afflicted with Obstructive Sleep Apnea. The SCAT is computed from excerpt the acoustic signals, then a GMM Supervector is calculated by adapting the GMM model of the acoustic space with the Maximum a Posteriori (MAP) algorithm and concatenating the mean values of the Gaussians. Resulting supervectors are used to feed the multiclass DNN classifier. The performance of the algorithm has been assessed on the Munich-Passau Snore Sound Corpus (MPSSC), composed of recordings of Drug-Induced Sleep Endoscopy (DISE) examinations. The results are expressed in terms of Unweighted Average Recall (UAR) and a remarkable improvement with respect to the state-of-the-art performance has been registered, achieving a score up to 67.14% and 67.71% respectively on the devel and test datasets.
2018
978-150906014-6
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11566/259202
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 6
social impact