In this paper, we propose an algorithm for snoring sounds classification based on Deep Scattering Spectrum (SCAT), Gaussian Mixture Models (GMM) Supervectors and Deep Neural Networks (DNN). The task consists in the identification of the type of snoring among four target classes representing the snore sounds’ excitation location, which can be highly useful for a successful medical treatment of the habitual snorer or patient afflicted with Obstructive Sleep Apnea. The SCAT is computed from excerpt the acoustic signals, then a GMM Supervector is calculated by adapting the GMM model of the acoustic space with the Maximum a Posteriori (MAP) algorithm and concatenating the mean values of the Gaussians. Resulting supervectors are used to feed the multiclass DNN classifier. The performance of the algorithm has been assessed on the Munich-Passau Snore Sound Corpus (MPSSC), composed of recordings of Drug-Induced Sleep Endoscopy (DISE) examinations. The results are expressed in terms of Unweighted Average Recall (UAR) and a remarkable improvement with respect to the state-of-the-art performance has been registered, achieving a score up to 67.14% and 67.71% respectively on the devel and test datasets.
Snore Sounds Excitation Localization by Using Scattering Transform and Deep Neural Networks / Vesperini, Fabio; Galli, Andrea; Gabrielli, Leonardo; Principi, Emanuele; Squartini, Stefano. - (2018). (Intervento presentato al convegno International Joint Conference on Neural Networks, IJCNN 2018 tenutosi a Rio de Janeiro, Brazil nel July, 7-13, 2018) [10.1109/IJCNN.2018.8489576].
Snore Sounds Excitation Localization by Using Scattering Transform and Deep Neural Networks
Fabio Vesperini;Leonardo Gabrielli;Emanuele Principi;Stefano Squartini
2018-01-01
Abstract
In this paper, we propose an algorithm for snoring sounds classification based on Deep Scattering Spectrum (SCAT), Gaussian Mixture Models (GMM) Supervectors and Deep Neural Networks (DNN). The task consists in the identification of the type of snoring among four target classes representing the snore sounds’ excitation location, which can be highly useful for a successful medical treatment of the habitual snorer or patient afflicted with Obstructive Sleep Apnea. The SCAT is computed from excerpt the acoustic signals, then a GMM Supervector is calculated by adapting the GMM model of the acoustic space with the Maximum a Posteriori (MAP) algorithm and concatenating the mean values of the Gaussians. Resulting supervectors are used to feed the multiclass DNN classifier. The performance of the algorithm has been assessed on the Munich-Passau Snore Sound Corpus (MPSSC), composed of recordings of Drug-Induced Sleep Endoscopy (DISE) examinations. The results are expressed in terms of Unweighted Average Recall (UAR) and a remarkable improvement with respect to the state-of-the-art performance has been registered, achieving a score up to 67.14% and 67.71% respectively on the devel and test datasets.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.