Dysarthric Speech Classification: A Comparative Analysis of Decision-Support Methods

IRIS

This study compares data-driven and non-data-driven methods for classifying dysarthric speech employing the UA-Speech dataset. Data-driven deep-learning architectures, including MobileNetV3, ResNet50, and ResNet152, are evaluated alongside non-data-driven approaches such as pre-trained Automatic Speech Recognition (ASR) based methods and DSP-based methods. The findings reveal that overlapping patient data between training and testing sets leads to inflated performance metrics. By enforcing patient-separated evaluations, the study highlights reduced model accuracy, emphasizing the need for methodologies that generalize effectively to unseen data. Among the tested approaches, the ASR-based approach demonstrates the highest potential, achieving strong correlation with human intelligibility assessments and offering promising clinical applicability. This analysis underscores the importance of robust evaluation protocols and highlights the potential of pre-trained ASR models for reliable dysarthric speech assessment.

Dysarthric Speech Classification: A Comparative Analysis of Decision-Support Methods / Lillini, D., Aironi, C., Migliorelli, L., Gabrielli, L., Squartini, S.. - (2025). (2025 International Joint Conference on Neural Networks, IJCNN 2025 Rome, Italy 30 June 2025 - 05 July 2025) [10.1109/IJCNN64981.2025.11228715].

Dysarthric Speech Classification: A Comparative Analysis of Decision-Support Methods

Lillini D.;Aironi C.;Migliorelli L.;Gabrielli L.;Squartini S.

2025-01-01

Abstract

This study compares data-driven and non-data-driven methods for classifying dysarthric speech employing the UA-Speech dataset. Data-driven deep-learning architectures, including MobileNetV3, ResNet50, and ResNet152, are evaluated alongside non-data-driven approaches such as pre-trained Automatic Speech Recognition (ASR) based methods and DSP-based methods. The findings reveal that overlapping patient data between training and testing sets leads to inflated performance metrics. By enforcing patient-separated evaluations, the study highlights reduced model accuracy, emphasizing the need for methodologies that generalize effectively to unseen data. Among the tested approaches, the ASR-based approach demonstrates the highest potential, achieving strong correlation with human intelligibility assessments and offering promising clinical applicability. This analysis underscores the importance of robust evaluation protocols and highlights the potential of pre-trained ASR models for reliable dysarthric speech assessment.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2025
			
	Codice ISBN
	
				9798331510428
			
	Codice DOI
	
				https://dx.doi.org/10.1109/IJCNN64981.2025.11228715
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
Lillini_Dysarthric-Speech-Classification-Comparative-Analysis_2025.pdf Solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza d'uso: Tutti i diritti riservati Dimensione 1.04 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.04 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11566/351252

Citazioni

ND

0

0

social impact