This work proposes a pipeline that aims to recognize the products in a shelf, at the level of the single SKU (Stock Keeping Unit), starting from a photo of that shelf. It is composed of a first neural network that detects the individual products on the shelf and has been trained with the SKU110K dataset and a second network, designed and built within this work that associates to the single image created by the first network, an embedding vector, which describes its distinctive features. By obtaining this vector of the input image, it is possible to measure the similarity, by means of the cosine similarity, between this vector and all the embedding vectors in the comparison dataset. The vector with the highest cosine similarity is associated to an image labeled with the EAN (European Article Number) code and, therefore, this EAN will be that of the input image. Given the particular task, there are not currently any dataset able to meet our requirements as they have not such a granular level of detail (EAN labeled), so a new properly designed dataset is created to solve this task.
A Deep Learning-Based System for Product Recognition in Intelligent Retail Environment / Pietrini, R; Rossi, L; Mancini, A; Zingaretti, P; Frontoni, E; Paolanti, M. - 13232:(2022), pp. 371-382. [10.1007/978-3-031-06430-2_31]
A Deep Learning-Based System for Product Recognition in Intelligent Retail Environment
Pietrini, R;Rossi, L;Mancini, A;Zingaretti, P;Frontoni, E;Paolanti, M
2022-01-01
Abstract
This work proposes a pipeline that aims to recognize the products in a shelf, at the level of the single SKU (Stock Keeping Unit), starting from a photo of that shelf. It is composed of a first neural network that detects the individual products on the shelf and has been trained with the SKU110K dataset and a second network, designed and built within this work that associates to the single image created by the first network, an embedding vector, which describes its distinctive features. By obtaining this vector of the input image, it is possible to measure the similarity, by means of the cosine similarity, between this vector and all the embedding vectors in the comparison dataset. The vector with the highest cosine similarity is associated to an image labeled with the EAN (European Article Number) code and, therefore, this EAN will be that of the input image. Given the particular task, there are not currently any dataset able to meet our requirements as they have not such a granular level of detail (EAN labeled), so a new properly designed dataset is created to solve this task.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.