This paper presents a combined speech recognition/speaker identification system that can be efficiently used for personalized domotic control. The proposed system works as a distributed framework and it is designed to identify a speaker in home environments in order to provide user access to customized options. Human speech signals contain both language and speaker dependent information. Using this information the system realizes a personalized control in home environments and this approach can also be applied in more generic scenarios such as car customization settings. The system was optimized with the aim to allow an immediate use only with the addition of small and cheap audio front-ends that will capture commands spoken by the user. Meanwhile a remote server performs the speech recognition as well as user identification and combines these informations to provides user specific settings which are sent back to the desired actuator at home.
Distributed Speech and Speaker Identification System for Personalized Domotic Control / Biagetti, Giorgio; Crippa, Paolo; Falaschetti, Laura; Orcioni, Simone; Turchetti, Claudio. - 392:(2016), pp. 159-170. [10.1007/978-3-319-39700-9_13]
Distributed Speech and Speaker Identification System for Personalized Domotic Control
BIAGETTI, Giorgio;CRIPPA, Paolo;FALASCHETTI, LAURA;ORCIONI, Simone;TURCHETTI, Claudio
2016-01-01
Abstract
This paper presents a combined speech recognition/speaker identification system that can be efficiently used for personalized domotic control. The proposed system works as a distributed framework and it is designed to identify a speaker in home environments in order to provide user access to customized options. Human speech signals contain both language and speaker dependent information. Using this information the system realizes a personalized control in home environments and this approach can also be applied in more generic scenarios such as car customization settings. The system was optimized with the aim to allow an immediate use only with the addition of small and cheap audio front-ends that will capture commands spoken by the user. Meanwhile a remote server performs the speech recognition as well as user identification and combines these informations to provides user specific settings which are sent back to the desired actuator at home.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.