In the last two years, we have seen a huge number of debates and discussions on COVID-19 in social media. Many authors have analyzed these debates on Facebook and Twitter, while very few ones have considered Reddit. In this paper, we focus on this social network and propose three approaches to extract information from posts on COVID-19 published in it. The first performs a semi-automatic and dynamic classification of Reddit posts. The second automatically constructs virtual subreddits, each characterized by homogeneous themes. The third automatically identifies virtual communities of users with homogeneous themes. The three approaches represent an advance over the past literature. In fact, the latter lacks studies regarding classification algorithms capable of outlining the differences among the thousands of posts on COVID-19 in Reddit. Analogously, it lacks approaches able to build virtual subreddits with homogeneous topics or virtual communities of users with common interests. © 2022 World Scientific Publishing Company.

New Approaches to Extract Information from Posts on COVID-19 Published on Reddit / Bonifazi, G.; Corradini, E.; Ursino, D.; Virgili, L.. - In: INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING. - ISSN 0219-6220. - 21:5(2022), pp. 1385-1431. [10.1142/S0219622022500213]

New Approaches to Extract Information from Posts on COVID-19 Published on Reddit

G. Bonifazi
;
E. Corradini
;
D. Ursino
;
L. Virgili
2022-01-01

Abstract

In the last two years, we have seen a huge number of debates and discussions on COVID-19 in social media. Many authors have analyzed these debates on Facebook and Twitter, while very few ones have considered Reddit. In this paper, we focus on this social network and propose three approaches to extract information from posts on COVID-19 published in it. The first performs a semi-automatic and dynamic classification of Reddit posts. The second automatically constructs virtual subreddits, each characterized by homogeneous themes. The third automatically identifies virtual communities of users with homogeneous themes. The three approaches represent an advance over the past literature. In fact, the latter lacks studies regarding classification algorithms capable of outlining the differences among the thousands of posts on COVID-19 in Reddit. Analogously, it lacks approaches able to build virtual subreddits with homogeneous topics or virtual communities of users with common interests. © 2022 World Scientific Publishing Company.
File in questo prodotto:
File Dimensione Formato  
Bonifazi_Nwe-approaches-toextract_2022.pdf

Solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza d'uso: Tutti i diritti riservati
Dimensione 6.08 MB
Formato Adobe PDF
6.08 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
IJITDM20.pdf

Open Access dal 20/05/2023

Descrizione: Electronic version of an article published as New approaches to extract information from posts on COVID-19 published in Reddit / Bonifazi, G.; Corradini, E.; Ursino, D.; Virgili, L.. - In: INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING. - ISSN 0219-6220. - 21:5(2022), pp. 1385-1431. [10.1142/S0219622022500213] © World Scientific Publishing Company https://www.worldscientific.com/worldscinet/ijitdm. Only personal use of this material is permitted. Permission from publisher must be obtained for all other uses, in any current or future media.
Tipologia: Documento in post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza d'uso: Licenza specifica dell’editore
Dimensione 2.56 MB
Formato Adobe PDF
2.56 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11566/297921
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? 8
social impact