A plethora of Voice or Speaker Activity Detection systems exist in literature. They are indeed a fundamental part of complex systems that deals with speech processing. In this work the authors exploit neural network based VAD to address the speaker activity detection in a multi-room domestic scenario. The goal is to detect the voice activity in each of the two target rooms in presence of other sounds and speeches occurring in other rooms and outside. A large dataset recorded in a smart-home is provided and result obtained are acceptable.
Neural Networks Based Methods for Voice Activity Detection in a Multi-room Domestic Environment / Ferroni, Giacomo; Bonfigli, Roberto; Principi, Emanuele; Squartini, Stefano; Piazza, Francesco. - 1:(2014), pp. 153-158. (Intervento presentato al convegno Evalita 2014 tenutosi a Pisa, Italy nel 11 Dec 2014).
Neural Networks Based Methods for Voice Activity Detection in a Multi-room Domestic Environment
FERRONI, GIACOMO;Bonfigli, Roberto;PRINCIPI, EMANUELE;SQUARTINI, Stefano;PIAZZA, Francesco
2014-01-01
Abstract
A plethora of Voice or Speaker Activity Detection systems exist in literature. They are indeed a fundamental part of complex systems that deals with speech processing. In this work the authors exploit neural network based VAD to address the speaker activity detection in a multi-room domestic scenario. The goal is to detect the voice activity in each of the two target rooms in presence of other sounds and speeches occurring in other rooms and outside. A large dataset recorded in a smart-home is provided and result obtained are acceptable.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.