Social networks are perceived by users as a natural environment for publicly sharing their thoughts and emotions about different subjects. In these platforms, despite the availability of various forms of communication, text is the most widespread way of communication. Therefore, the development of automatic techniques for emotion recognition in social media texts, such as tweets or Facebook posts, gives the opportunity of extracting information that could be valuable for many application fields, ranging from the analysis of customer satisfaction to the optimization of political campaigns. Nowadays, several emotion classifiers and datasets have been built for English texts while few resources are available for other languages. For this reason, in this work we present a deep learning algorithm for emotion recognition in Italian social media texts. The algorithm was named EmotionAlBERTo as it is based on AlBERTo, a BERT-based language understanding model for the Italian language. We trained and evaluated EmotionALBERTo on two different Twitter datasets: the MultiEmotions-It dataset and TwIT, a novel Italian dataset for emotion recognition that we built by collecting and manually labelling a corpus of about 3100 Italian tweets. Experiments show that the models achieve remarkable performance on both 4- class and 6-class emotion classification, by respectively obtaining F 1 = 0.91 and F 1 = 0.83 on MultiEmotions-It, F 1 = 0.92 and F 1 = 0.86 on TwIT.

EmotionAlBERTo: Emotion Recognition of Italian Social Media Texts Through BERT / Chiorrini, Andrea; Diamantini, Claudia; Mircoli, Alex; Potena, Domenico; Storti, Emanuele. - (2022), pp. 1706-1711. (Intervento presentato al convegno 26th International Conference on Pattern Recognition) [10.1109/ICPR56361.2022.9956403].

EmotionAlBERTo: Emotion Recognition of Italian Social Media Texts Through BERT

Andrea Chiorrini;Claudia Diamantini;Alex Mircoli;Domenico Potena;Emanuele Storti
2022-01-01

Abstract

Social networks are perceived by users as a natural environment for publicly sharing their thoughts and emotions about different subjects. In these platforms, despite the availability of various forms of communication, text is the most widespread way of communication. Therefore, the development of automatic techniques for emotion recognition in social media texts, such as tweets or Facebook posts, gives the opportunity of extracting information that could be valuable for many application fields, ranging from the analysis of customer satisfaction to the optimization of political campaigns. Nowadays, several emotion classifiers and datasets have been built for English texts while few resources are available for other languages. For this reason, in this work we present a deep learning algorithm for emotion recognition in Italian social media texts. The algorithm was named EmotionAlBERTo as it is based on AlBERTo, a BERT-based language understanding model for the Italian language. We trained and evaluated EmotionALBERTo on two different Twitter datasets: the MultiEmotions-It dataset and TwIT, a novel Italian dataset for emotion recognition that we built by collecting and manually labelling a corpus of about 3100 Italian tweets. Experiments show that the models achieve remarkable performance on both 4- class and 6-class emotion classification, by respectively obtaining F 1 = 0.91 and F 1 = 0.83 on MultiEmotions-It, F 1 = 0.92 and F 1 = 0.86 on TwIT.
2022
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11566/300329
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 0
social impact