The industrial landscape is experiencing a significant transformation and there is a growing emphasis on human-machine collaboration, personalization, and sustainable manufacturing. Within this context, large language models (LLMs) have emerged as pivotal Artificial Intelligence (AI) tools that enhance communication, streamline operations, and support complex decision-making tasks through advanced natural language processing capabilities. This paper explores the process of fine-tuning a foundational general-purpose LLM, specifically the open-source LLaMA 3 model, to cater to industrial applications. We delve into the methodology of fine-tuning LLaMA on domain-specific data from two distinct industrial scenarios (i.e., product management and production line operations). Our results demonstrate significant improvements in the model’s performance, accuracy, and relevance in generating responses, thereby enhancing decision-making and operational processes in industrial settings. Additionally, we propose the integration of the fine-tuned model into a chatbot system designed to assist human operators by providing targeted, accurate responses in real-time. This paper aims to provide a comprehensive guide for industry practitioners and researchers to harness the power of LLMs for specialized industrial tasks, ultimately contributing to the advancement of intelligent industrial automation and decision-making systems.

Improving Industrial Question Answering Chatbots with Domain-Specific LLMs Fine-Tuning / Rosati, Riccardo; Antonini, Filippo; Muralikrishna, Nikhil; Tonetto, Flavio; Mancini, Adriano. - (2024). (Intervento presentato al convegno 20th IEEE/ASME International Conference on Mechatronic, Embedded Systems and Applications, MESA 2024 tenutosi a Genova, Italy nel 2024) [10.1109/mesa61532.2024.10704843].

Improving Industrial Question Answering Chatbots with Domain-Specific LLMs Fine-Tuning

Rosati, Riccardo;Antonini, Filippo;Muralikrishna, Nikhil;Mancini, Adriano
2024-01-01

Abstract

The industrial landscape is experiencing a significant transformation and there is a growing emphasis on human-machine collaboration, personalization, and sustainable manufacturing. Within this context, large language models (LLMs) have emerged as pivotal Artificial Intelligence (AI) tools that enhance communication, streamline operations, and support complex decision-making tasks through advanced natural language processing capabilities. This paper explores the process of fine-tuning a foundational general-purpose LLM, specifically the open-source LLaMA 3 model, to cater to industrial applications. We delve into the methodology of fine-tuning LLaMA on domain-specific data from two distinct industrial scenarios (i.e., product management and production line operations). Our results demonstrate significant improvements in the model’s performance, accuracy, and relevance in generating responses, thereby enhancing decision-making and operational processes in industrial settings. Additionally, we propose the integration of the fine-tuned model into a chatbot system designed to assist human operators by providing targeted, accurate responses in real-time. This paper aims to provide a comprehensive guide for industry practitioners and researchers to harness the power of LLMs for specialized industrial tasks, ultimately contributing to the advancement of intelligent industrial automation and decision-making systems.
2024
9798331516239
File in questo prodotto:
File Dimensione Formato  
Improving_Industrial_Question_Answering_Chatbots_with_Domain-Specific_LLMs_Fine-Tuning.pdf

Solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza d'uso: Tutti i diritti riservati
Dimensione 425.65 kB
Formato Adobe PDF
425.65 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11566/342619
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
social impact