In the anomaly and defect detection tasks, the number of negative samples greatly exceeds the number of defective samples. As a result, a high-class imbalance exists among different classes in the detection task. In our work, we introduce a data-level solution for improving the generalization performance of the semantic segmentation of surface defects based on a data augmentation (DA) strategy. In particular, our DA approach comprised a generative stage to simulate synthetic defects and a validation stage to validate the synthetic image as close as possible to the real one. A Siamese network fully validates our synthetic samples to select only synthetic defects as close to the real ones. We demonstrated the effectiveness of our approach in a real-use case scenario to baseline DA approaches. Our DA approach allows balancing the minority classes while improving the overall generalization performance for semantic segmentation for defect detection.

Data augmentation strategy for generating realistic samples on defect segmentation task / Martini, Massimo; Rosati, Riccardo; Romeo, Luca; Mancini, Adriano. - In: PROCEDIA COMPUTER SCIENCE. - ISSN 1877-0509. - 232:(2024), pp. 1597-1606. (Intervento presentato al convegno 5th International Conference on Industry 4.0 and Smart Manufacturing, ISM 2023 tenutosi a University Institute of Lisbon, prt nel 2023) [10.1016/j.procs.2024.01.157].

Data augmentation strategy for generating realistic samples on defect segmentation task

Martini, Massimo;Rosati, Riccardo;Mancini, Adriano
2024-01-01

Abstract

In the anomaly and defect detection tasks, the number of negative samples greatly exceeds the number of defective samples. As a result, a high-class imbalance exists among different classes in the detection task. In our work, we introduce a data-level solution for improving the generalization performance of the semantic segmentation of surface defects based on a data augmentation (DA) strategy. In particular, our DA approach comprised a generative stage to simulate synthetic defects and a validation stage to validate the synthetic image as close as possible to the real one. A Siamese network fully validates our synthetic samples to select only synthetic defects as close to the real ones. We demonstrated the effectiveness of our approach in a real-use case scenario to baseline DA approaches. Our DA approach allows balancing the minority classes while improving the overall generalization performance for semantic segmentation for defect detection.
2024
Procedia Computer Science
File in questo prodotto:
File Dimensione Formato  
ISM_2023_manuscript_v2_pdf_4828.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza d'uso: Creative commons
Dimensione 980.83 kB
Formato Adobe PDF
980.83 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11566/342620
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact