Speech and sound recognition in home automation scenarios has been gaining an increasing interest in the last decade. One interesting approach addressed in the literature is based on the template matching paradigm, which is characterized by ease of implementation and independence on large datasets for system training. Moving from a recent contribution of some of the authors, where an Extreme Learn-ing Machine algorithm was proposed and evaluated, a wider performance analysis in diverse operating conditions is provided here, together with some relevant improvements. These are allowed by the employment of supervector features as input, for the first time used with ELMs, up to the authors’ knowledge. As already verified in other application contexts and with different learning systems, this ensures a more robust characterization of the speech segment to be classified, also in presence of mismatch between training and testing data. The accomplished computer simulations confirm the effectiveness of the approach, with F1-Measure performance up to 99% in the multicondition case, and a computational time reduction factor close to 4, with respect to the SVM counterpart.
ELM Based Algorithms for Acoustic Template Matching in Home Automation Scenarios: Advancements and Performance Analysis / DELLA PORTA, Giulio; Principi, Emanuele; Ferroni, Giacomo; Squartini, Stefano; Hussain, A.; Piazza, Francesco. - Volume 48:(2016), pp. 159-168. [10.1007/978-3-319-28109-4_16]
ELM Based Algorithms for Acoustic Template Matching in Home Automation Scenarios: Advancements and Performance Analysis
DELLA PORTA, GIULIO;PRINCIPI, EMANUELE;FERRONI, GIACOMO;SQUARTINI, Stefano;PIAZZA, Francesco
2016-01-01
Abstract
Speech and sound recognition in home automation scenarios has been gaining an increasing interest in the last decade. One interesting approach addressed in the literature is based on the template matching paradigm, which is characterized by ease of implementation and independence on large datasets for system training. Moving from a recent contribution of some of the authors, where an Extreme Learn-ing Machine algorithm was proposed and evaluated, a wider performance analysis in diverse operating conditions is provided here, together with some relevant improvements. These are allowed by the employment of supervector features as input, for the first time used with ELMs, up to the authors’ knowledge. As already verified in other application contexts and with different learning systems, this ensures a more robust characterization of the speech segment to be classified, also in presence of mismatch between training and testing data. The accomplished computer simulations confirm the effectiveness of the approach, with F1-Measure performance up to 99% in the multicondition case, and a computational time reduction factor close to 4, with respect to the SVM counterpart.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.