In a multi-speaker scenario, humans are able to focus on a target speaker, ignoring all other speakers and noise, thus solving the so-called cocktail-party problem. However, elderly people and people suffering for hearing loss struggle to listening under these conditions. Recent studies have confirmed that the listener’s selective attention to the attended speaker can be decoded using recording of brain activity such as electroencephalography, thus opening new opportunities in developing a new generation of neuro-steered hearing aids and hearing prostheses. To this end several algorithms have been developed for solving the so called auditory attention decoding problem from electroencephalography on the basis of neural entrainment mechanism. The most common approaches in development of auditory attention decoding algorithms are based on linear modeling of the neural entrainment. However, even though these algorithms have shown to be effective in solving cocktail-party problem, they have some inherent limitations. The main objective of this contribution is to show that nonlinear modeling of speech-electroencephalography system ensures the best performance in terms of higher correlation between stimulus and neural response, thus proving the limitations of linear approach. For this purpose the most common linear models for auditory attention decoding are reviewed and a new nonlinear model for auditory attention decoding is proposed. An extensive experimentation using a specific speech-electroencephalography dataset, confirms the superiority of nonlinear modeling in solving the auditory attention decoding problem.
Forward Nonlinear Model for Deep Learning of EEG Auditory Attention Detection in Cocktail Party Problem / Falaschetti, L.; Alessandrini, M.; Turchetti, C.. - 259:(2024), pp. 143-165. [10.1007/978-3-031-65640-8_7]
Forward Nonlinear Model for Deep Learning of EEG Auditory Attention Detection in Cocktail Party Problem
Falaschetti L.Primo
;Alessandrini M.Secondo
;Turchetti C.
Ultimo
2024-01-01
Abstract
In a multi-speaker scenario, humans are able to focus on a target speaker, ignoring all other speakers and noise, thus solving the so-called cocktail-party problem. However, elderly people and people suffering for hearing loss struggle to listening under these conditions. Recent studies have confirmed that the listener’s selective attention to the attended speaker can be decoded using recording of brain activity such as electroencephalography, thus opening new opportunities in developing a new generation of neuro-steered hearing aids and hearing prostheses. To this end several algorithms have been developed for solving the so called auditory attention decoding problem from electroencephalography on the basis of neural entrainment mechanism. The most common approaches in development of auditory attention decoding algorithms are based on linear modeling of the neural entrainment. However, even though these algorithms have shown to be effective in solving cocktail-party problem, they have some inherent limitations. The main objective of this contribution is to show that nonlinear modeling of speech-electroencephalography system ensures the best performance in terms of higher correlation between stimulus and neural response, thus proving the limitations of linear approach. For this purpose the most common linear models for auditory attention decoding are reviewed and a new nonlinear model for auditory attention decoding is proposed. An extensive experimentation using a specific speech-electroencephalography dataset, confirms the superiority of nonlinear modeling in solving the auditory attention decoding problem.| File | Dimensione | Formato | |
|---|---|---|---|
|
Falaschetti_Forward-Nonlinear-Model-Deep_2024.pdf
Solo gestori archivio
Descrizione: Chapter 7
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza d'uso:
Tutti i diritti riservati
Dimensione
435.16 kB
Formato
Adobe PDF
|
435.16 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
|
chapter_postprint.pdf
Open Access dal 19/11/2025
Tipologia:
Documento in post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza d'uso:
Licenza specifica dell'editore
Dimensione
486.65 kB
Formato
Adobe PDF
|
486.65 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


