Blind source separation and speech dereverberation are two important and common issues in the field of audio processing especially in the context of real meetings. In this paper a real time framework implementing a sequential source separation and speech dereverberation algorithm based on blind channel identification is taken as starting point. The major drawback of this approach consists in the inability of the BCI stage of estimating the room impulse responses when two or more sources are concurrently active. To overcome the aforementioned disadvantage a speaker diarization system have been successfully inserted in the reference framework to pilot the BCI stage. In such a way the identification task can be accomplished by using directly the microphone mixture making the overall structure well suited for real-time applications. The proposed solution works in frequency domain and the NU-Tech software platform has been used on purpose for real-time simulations.

Real-Time Joint Blind Speech Separation and Dereverberation in Presence of Overlapping Speakers / Rotili, R.; Principi, Emanuele; Squartini, Stefano; Piazza, Francesco. - Volume 6676, LNCS:(2011), pp. 437-446. [10.1007/978-3-642-21090-7_52]

Real-Time Joint Blind Speech Separation and Dereverberation in Presence of Overlapping Speakers

PRINCIPI, EMANUELE;SQUARTINI, Stefano;PIAZZA, Francesco
2011-01-01

Abstract

Blind source separation and speech dereverberation are two important and common issues in the field of audio processing especially in the context of real meetings. In this paper a real time framework implementing a sequential source separation and speech dereverberation algorithm based on blind channel identification is taken as starting point. The major drawback of this approach consists in the inability of the BCI stage of estimating the room impulse responses when two or more sources are concurrently active. To overcome the aforementioned disadvantage a speaker diarization system have been successfully inserted in the reference framework to pilot the BCI stage. In such a way the identification task can be accomplished by using directly the microphone mixture making the overall structure well suited for real-time applications. The proposed solution works in frequency domain and the NU-Tech software platform has been used on purpose for real-time simulations.
2011
Advances in Neural Networks - ISNN 2011
9783642210891
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11566/54620
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 2
social impact