In the last two years, we have seen a huge number of debates and discussions on COVID-19 in social media. Many authors have analyzed these debates on Facebook and Twitter, while very few ones have considered Reddit. In this paper, we focus on this social network and propose three approaches to extract information from posts on COVID-19 published in it. The first performs a semi-automatic and dynamic classification of Reddit posts. The second automatically constructs virtual subreddits, each characterized by homogeneous themes. The third automatically identifies virtual communities of users with homogeneous themes. The three approaches represent an advance over the past literature. In fact, the latter lacks studies regarding classification algorithms capable of outlining the differences among the thousands of posts on COVID-19 in Reddit. Analogously, it lacks approaches able to build virtual subreddits with homogeneous topics or virtual communities of users with common interests. © 2022 World Scientific Publishing Company.
New Approaches to Extract Information from Posts on COVID-19 Published on Reddit / Bonifazi, G.; Corradini, E.; Ursino, D.; Virgili, L.. - In: INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING. - ISSN 0219-6220. - 21:5(2022), pp. 1385-1431. [10.1142/S0219622022500213]
New Approaches to Extract Information from Posts on COVID-19 Published on Reddit
G. Bonifazi
;E. Corradini
;D. Ursino
;L. Virgili
2022-01-01
Abstract
In the last two years, we have seen a huge number of debates and discussions on COVID-19 in social media. Many authors have analyzed these debates on Facebook and Twitter, while very few ones have considered Reddit. In this paper, we focus on this social network and propose three approaches to extract information from posts on COVID-19 published in it. The first performs a semi-automatic and dynamic classification of Reddit posts. The second automatically constructs virtual subreddits, each characterized by homogeneous themes. The third automatically identifies virtual communities of users with homogeneous themes. The three approaches represent an advance over the past literature. In fact, the latter lacks studies regarding classification algorithms capable of outlining the differences among the thousands of posts on COVID-19 in Reddit. Analogously, it lacks approaches able to build virtual subreddits with homogeneous topics or virtual communities of users with common interests. © 2022 World Scientific Publishing Company.File | Dimensione | Formato | |
---|---|---|---|
Bonifazi_Nwe-approaches-toextract_2022.pdf
Solo gestori archivio
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza d'uso:
Tutti i diritti riservati
Dimensione
6.08 MB
Formato
Adobe PDF
|
6.08 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
IJITDM20.pdf
Open Access dal 20/05/2023
Descrizione: Electronic version of an article published as New approaches to extract information from posts on COVID-19 published in Reddit / Bonifazi, G.; Corradini, E.; Ursino, D.; Virgili, L.. - In: INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING. - ISSN 0219-6220. - 21:5(2022), pp. 1385-1431. [10.1142/S0219622022500213] © World Scientific Publishing Company https://www.worldscientific.com/worldscinet/ijitdm. Only personal use of this material is permitted. Permission from publisher must be obtained for all other uses, in any current or future media.
Tipologia:
Documento in post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza d'uso:
Licenza specifica dell’editore
Dimensione
2.56 MB
Formato
Adobe PDF
|
2.56 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.