Deep Learning for Timbre Modification and Transfer: An Evaluation Study

Gabrielli, Leonardo; Cella, Carmine Emanuel; Vesperini, Fabio; Droghini, Diego; Principi, Emanuele; Squartini, Stefano

In the past years, several hybridization techniques have been proposed to synthesize novel audio content owing its properties from two audio sources. These algorithms, however, usually provide no feature learning, leaving the user, often intentionally, exploring parameters by trial-and-error. The introduction of machine learning algorithms in the music processing field calls for an investigation to seek for possible exploitation of their properties such as the ability to learn semantically meaningful features. In this first work we adopt a Neural Network Autoencoder architecture, and we enhance it to exploit temporal dependencies. In our experiments the architecture was able to modify the original timbre, resembling what it learned during the training phase, while preserving the pitch envelope from the input.

Deep Learning for Timbre Modification and Transfer: An Evaluation Study / Gabrielli, Leonardo; Cella, Carmine Emanuel; Vesperini, Fabio; Droghini, Diego; Principi, Emanuele; Squartini, Stefano. - ELETTRONICO. - (2018). (Intervento presentato al convegno Audio Engineering Society Convention 144 tenutosi a Milan, Italy nel May 2018).