Highlights: What are the main findings? Integration of DP, FCM, and LLMs achieved high diagnostic accuracy in pediatric surgical pathology. Adding IHC data improved AI performance, resolving false negatives and reaching 100% sensitivity in neoplastic cases. What is the implication of the main finding? AI-assisted DP workflows can enhance diagnostic efficiency and intraoperative decision-making. These results highlight the potential of AI to complement expert pathologists and accelerate the adoption of digital workflows in pediatric pathology. Background: Digital pathology (DP) combined with fluorescence confocal microscopy (FCM) allows rapid tissue assessment while preserving specimens. Artificial intelligence (AI) and large language models (LLMs) may enhance diagnostic workflows, but their role in pediatric surgical pathology is largely unexplored. Methods: We conducted a prospective, single-center study including 20 pediatric surgical cases with ex vivo FCM images acquired intraoperatively. Two commercially available LLMs, GPT-4V (AnPathology-Gpt) and Claude 3.7 Sonnet (AnPathology Project), were tested using structured prompts to generate diagnostic reports with and without immunohistochemistry (IHC) data, when available. Outputs were compared against the gold standard diagnosis by an experienced pediatric pathologist. Diagnostic performance was evaluated through accuracy, sensitivity, specificity, and Cohen’s kappa. A paired sub-analysis was performed for cases with IHC (n = 5), and a sensitivity analysis excluding IHC cases (n = 15) was conducted. Results: Across all 20 cases, AnPathology-Gpt achieved 85% accuracy, 100% sensitivity, 86% specificity, and κ = 0.78, while AnPathology Project reached 80% accuracy, 100% sensitivity, 57% specificity, and κ = 0.63. Both models correctly identified all 13 neoplastic cases, with errors limited to non-neoplastic lesions mimicking tumors. In the IHC sub-analysis, accuracy improved from 40% to 80% and sensitivity from 50% to 100% for both models, resolving two false negatives observed in the FCM-only evaluation. Sensitivity analysis excluding IHC confirmed consistency of the results. Conclusions: This pilot study demonstrates that multimodal LLMs can support accurate and rapid diagnosis in pediatric digital pathology. The addition of IHC improves performance in diagnostically complex cases. Larger multicenter studies are needed to validate these findings and to define the role of AI-assisted workflows in pediatric surgical pathology.

Seeing Beyond the Microscope: Artificial Intelligence and Fluorescence Confocal Digital Imaging in Pediatric Surgical Pathology / Di Fabrizio, D.; Daziani, G.; Qose, I.; Bindi, E.; Ilari, M.; Filosa, A.; Busardo, F. P.; Goteri, G.; Cobellis, G.. - In: CHILDREN. - ISSN 2227-9067. - 12:12(2025). [10.3390/children12121608]

Seeing Beyond the Microscope: Artificial Intelligence and Fluorescence Confocal Digital Imaging in Pediatric Surgical Pathology

Di Fabrizio D.
;
Daziani G.;Bindi E.;Ilari M.;Filosa A.;Busardo F. P.;Goteri G.;Cobellis G.
2025-01-01

Abstract

Highlights: What are the main findings? Integration of DP, FCM, and LLMs achieved high diagnostic accuracy in pediatric surgical pathology. Adding IHC data improved AI performance, resolving false negatives and reaching 100% sensitivity in neoplastic cases. What is the implication of the main finding? AI-assisted DP workflows can enhance diagnostic efficiency and intraoperative decision-making. These results highlight the potential of AI to complement expert pathologists and accelerate the adoption of digital workflows in pediatric pathology. Background: Digital pathology (DP) combined with fluorescence confocal microscopy (FCM) allows rapid tissue assessment while preserving specimens. Artificial intelligence (AI) and large language models (LLMs) may enhance diagnostic workflows, but their role in pediatric surgical pathology is largely unexplored. Methods: We conducted a prospective, single-center study including 20 pediatric surgical cases with ex vivo FCM images acquired intraoperatively. Two commercially available LLMs, GPT-4V (AnPathology-Gpt) and Claude 3.7 Sonnet (AnPathology Project), were tested using structured prompts to generate diagnostic reports with and without immunohistochemistry (IHC) data, when available. Outputs were compared against the gold standard diagnosis by an experienced pediatric pathologist. Diagnostic performance was evaluated through accuracy, sensitivity, specificity, and Cohen’s kappa. A paired sub-analysis was performed for cases with IHC (n = 5), and a sensitivity analysis excluding IHC cases (n = 15) was conducted. Results: Across all 20 cases, AnPathology-Gpt achieved 85% accuracy, 100% sensitivity, 86% specificity, and κ = 0.78, while AnPathology Project reached 80% accuracy, 100% sensitivity, 57% specificity, and κ = 0.63. Both models correctly identified all 13 neoplastic cases, with errors limited to non-neoplastic lesions mimicking tumors. In the IHC sub-analysis, accuracy improved from 40% to 80% and sensitivity from 50% to 100% for both models, resolving two false negatives observed in the FCM-only evaluation. Sensitivity analysis excluding IHC confirmed consistency of the results. Conclusions: This pilot study demonstrates that multimodal LLMs can support accurate and rapid diagnosis in pediatric digital pathology. The addition of IHC improves performance in diagnostically complex cases. Larger multicenter studies are needed to validate these findings and to define the role of AI-assisted workflows in pediatric surgical pathology.
2025
artificial intelligence; children; digital pathology; ex vivo fluorescence confocal microscopy; large language models
File in questo prodotto:
File Dimensione Formato  
Di Fabrizio_Seeing-Beyond-Microscope-Artificial_2025.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza d'uso: Creative commons
Dimensione 235.7 kB
Formato Adobe PDF
235.7 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11566/351687
Citazioni
  • ???jsp.display-item.citation.pmc??? 0
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact