Improving Industrial Question Answering Chatbots with Domain-Specific LLMs Fine-Tuning

IRIS

The industrial landscape is experiencing a significant transformation and there is a growing emphasis on human-machine collaboration, personalization, and sustainable manufacturing. Within this context, large language models (LLMs) have emerged as pivotal Artificial Intelligence (AI) tools that enhance communication, streamline operations, and support complex decision-making tasks through advanced natural language processing capabilities. This paper explores the process of fine-tuning a foundational general-purpose LLM, specifically the open-source LLaMA 3 model, to cater to industrial applications. We delve into the methodology of fine-tuning LLaMA on domain-specific data from two distinct industrial scenarios (i.e., product management and production line operations). Our results demonstrate significant improvements in the model’s performance, accuracy, and relevance in generating responses, thereby enhancing decision-making and operational processes in industrial settings. Additionally, we propose the integration of the fine-tuned model into a chatbot system designed to assist human operators by providing targeted, accurate responses in real-time. This paper aims to provide a comprehensive guide for industry practitioners and researchers to harness the power of LLMs for specialized industrial tasks, ultimately contributing to the advancement of intelligent industrial automation and decision-making systems.

Improving Industrial Question Answering Chatbots with Domain-Specific LLMs Fine-Tuning / Rosati, Riccardo; Antonini, Filippo; Muralikrishna, Nikhil; Tonetto, Flavio; Mancini, Adriano. - (2024). (Intervento presentato al convegno 20th IEEE/ASME International Conference on Mechatronic, Embedded Systems and Applications, MESA 2024 tenutosi a Genova, Italy nel 2024) [10.1109/mesa61532.2024.10704843].

Improving Industrial Question Answering Chatbots with Domain-Specific LLMs Fine-Tuning

Rosati, Riccardo;Antonini, Filippo;Muralikrishna, Nikhil;Tonetto, Flavio;Mancini, Adriano

2024-01-01

Abstract

The industrial landscape is experiencing a significant transformation and there is a growing emphasis on human-machine collaboration, personalization, and sustainable manufacturing. Within this context, large language models (LLMs) have emerged as pivotal Artificial Intelligence (AI) tools that enhance communication, streamline operations, and support complex decision-making tasks through advanced natural language processing capabilities. This paper explores the process of fine-tuning a foundational general-purpose LLM, specifically the open-source LLaMA 3 model, to cater to industrial applications. We delve into the methodology of fine-tuning LLaMA on domain-specific data from two distinct industrial scenarios (i.e., product management and production line operations). Our results demonstrate significant improvements in the model’s performance, accuracy, and relevance in generating responses, thereby enhancing decision-making and operational processes in industrial settings. Additionally, we propose the integration of the fine-tuned model into a chatbot system designed to assist human operators by providing targeted, accurate responses in real-time. This paper aims to provide a comprehensive guide for industry practitioners and researchers to harness the power of LLMs for specialized industrial tasks, ultimately contributing to the advancement of intelligent industrial automation and decision-making systems.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2024
			
	Codice ISBN
	
				9798331516239
			
	Codice DOI
	
				https://dx.doi.org/10.1109/mesa61532.2024.10704843
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
Improving_Industrial_Question_Answering_Chatbots_with_Domain-Specific_LLMs_Fine-Tuning.pdf Solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza d'uso: Tutti i diritti riservati Dimensione 425.65 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	425.65 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11566/342619

Citazioni

ND

1

0

social impact