While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical violence. In this work, we aim at containing and preventing the alarming diffusion of such hate campaigns. Using Facebook as a benchmark, we consider the textual content of comments appeared on a set of public Italian pages. We first propose a variety of hate categories to distinguish the kind of hate. Crawled comments are then annotated by up to five distinct human annotators, according to the defined taxonomy. Leveraging morpho-syntactical features, sentiment polarity and word embedding lexicons, we design and implement two classifiers for the Italian language, based on different learning algorithms: the first based on Support Vector Machines (SVM) and the second on a particular Recurrent Neural Network named Long Short Term Memory (LSTM). We test these two learning algorithms in order to verify their classification performances on the task of hate speech recognition. The results show the effectiveness of the two classification approaches tested over the first manually annotated Italian Hate Speech Corpus of social media text.

Hate me, hate me not: Hate speech detection on Facebook

F Del Vigna;A Cimino;F Dell'Orletta;M Petrocchi;M Tesconi
2017

Abstract

While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical violence. In this work, we aim at containing and preventing the alarming diffusion of such hate campaigns. Using Facebook as a benchmark, we consider the textual content of comments appeared on a set of public Italian pages. We first propose a variety of hate categories to distinguish the kind of hate. Crawled comments are then annotated by up to five distinct human annotators, according to the defined taxonomy. Leveraging morpho-syntactical features, sentiment polarity and word embedding lexicons, we design and implement two classifiers for the Italian language, based on different learning algorithms: the first based on Support Vector Machines (SVM) and the second on a particular Recurrent Neural Network named Long Short Term Memory (LSTM). We test these two learning algorithms in order to verify their classification performances on the task of hate speech recognition. The results show the effectiveness of the two classification approaches tested over the first manually annotated Italian Hate Speech Corpus of social media text.
Campo DC Valore Lingua
dc.authority.anceserie CEUR WORKSHOP PROCEEDINGS -
dc.authority.anceserie CEUR Workshop Proceedings -
dc.authority.orgunit Istituto di informatica e telematica - IIT -
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people F Del Vigna it
dc.authority.people A Cimino it
dc.authority.people F Dell'Orletta it
dc.authority.people M Petrocchi it
dc.authority.people M Tesconi it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di informatica e telematica - IIT *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 912 *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/16 06:12:30 -
dc.date.available 2024/02/16 06:12:30 -
dc.date.issued 2017 -
dc.description.abstracteng While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical violence. In this work, we aim at containing and preventing the alarming diffusion of such hate campaigns. Using Facebook as a benchmark, we consider the textual content of comments appeared on a set of public Italian pages. We first propose a variety of hate categories to distinguish the kind of hate. Crawled comments are then annotated by up to five distinct human annotators, according to the defined taxonomy. Leveraging morpho-syntactical features, sentiment polarity and word embedding lexicons, we design and implement two classifiers for the Italian language, based on different learning algorithms: the first based on Support Vector Machines (SVM) and the second on a particular Recurrent Neural Network named Long Short Term Memory (LSTM). We test these two learning algorithms in order to verify their classification performances on the task of hate speech recognition. The results show the effectiveness of the two classification approaches tested over the first manually annotated Italian Hate Speech Corpus of social media text. -
dc.description.affiliations IIT-CNR, Pisa, Italy (1); ILC-CNR, Pisa, Italy (2) -
dc.description.allpeople F. Del Vigna ; A. Cimino ; F. Dell'Orletta ; M. Petrocchi ; M. Tesconi -
dc.description.allpeopleoriginal F. Del Vigna (1); A. Cimino (2); F. Dell'Orletta (2); M. Petrocchi (1); M. Tesconi (1) -
dc.description.fulltext none en
dc.description.numberofauthors 5 -
dc.identifier.scopus 2-s2.0-85017337270 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/355348 -
dc.identifier.url http://www.scopus.com/inward/record.url?eid=2-s2.0-85017337270&partnerID=q2rCbXpz -
dc.language.iso eng -
dc.relation.conferencedate 17-20/01/2017 -
dc.relation.conferencename ITA-SEC 17 -
dc.relation.conferenceplace Venezia, Italia -
dc.relation.firstpage 86 -
dc.relation.lastpage 95 -
dc.relation.volume 1816 -
dc.subject.keywords Hate speech -
dc.subject.keywords NLP -
dc.subject.keywords Social Networks -
dc.subject.singlekeyword Hate speech *
dc.subject.singlekeyword NLP *
dc.subject.singlekeyword Social Networks *
dc.title Hate me, hate me not: Hate speech detection on Facebook en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.ugov.descaux1 369760 -
iris.orcid.lastModifiedDate 2024/03/15 08:58:43 *
iris.orcid.lastModifiedMillisecond 1710489523424 *
iris.scopus.extIssued 2017 -
iris.scopus.extTitle Hate me, hate me not: Hate speech detection on Facebook -
iris.sitodocente.maxattempts 1 -
scopus.authority.anceserie CEUR WORKSHOP PROCEEDINGS###1613-0073 *
scopus.category 1700 *
scopus.contributor.affiliation University of Pisa -
scopus.contributor.affiliation CNR -
scopus.contributor.affiliation CNR -
scopus.contributor.affiliation CNR -
scopus.contributor.affiliation CNR -
scopus.contributor.afid 60028868 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60027212 -
scopus.contributor.afid 60027212 -
scopus.contributor.auid 57188927435 -
scopus.contributor.auid 57002803800 -
scopus.contributor.auid 57540567000 -
scopus.contributor.auid 9433836000 -
scopus.contributor.auid 55884637000 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.name Fabio -
scopus.contributor.name Andrea -
scopus.contributor.name Felice -
scopus.contributor.name Marinella -
scopus.contributor.name Maurizio -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale; -
scopus.contributor.subaffiliation Istituto di Informatica e Telematica; -
scopus.contributor.subaffiliation Istituto di Informatica e Telematica; -
scopus.contributor.surname Del Vigna -
scopus.contributor.surname Cimino -
scopus.contributor.surname Dell'Orletta -
scopus.contributor.surname Petrocchi -
scopus.contributor.surname Tesconi -
scopus.date.issued 2017 *
scopus.description.abstracteng While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical violence. In this work, we aim at containing and preventing the alarming diffusion of such hate campaigns. Using Facebook as a benchmark, we consider the textual content of comments appeared on a set of public Italian pages. We first propose a variety of hate categories to distinguish the kind of hate. Crawled comments are then annotated by up to five distinct human annotators, according to the defined taxonomy. Leveraging morpho-syntactical features, sentiment polarity and word embedding lexicons, we design and implement two classifiers for the Italian language, based on different learning algorithms: the first based on Support Vector Machines (SVM) and the second on a particular Recurrent Neural Network named Long Short Term Memory (LSTM). We test these two learning algorithms in order to verify their classification performances on the task of hate speech recognition. The results show the effectiveness of the two classification approaches tested over the first manually annotated Italian Hate Speech Corpus of social media text. *
scopus.description.allpeopleoriginal Del Vigna F.; Cimino A.; Dell'Orletta F.; Petrocchi M.; Tesconi M. *
scopus.differences scopus.relation.conferencename *
scopus.differences scopus.authority.anceserie *
scopus.differences scopus.publisher.name *
scopus.differences scopus.relation.conferencedate *
scopus.differences scopus.description.allpeopleoriginal *
scopus.differences scopus.relation.conferenceplace *
scopus.document.type cp *
scopus.document.types cp *
scopus.identifier.pui 615346286 *
scopus.identifier.scopus 2-s2.0-85017337270 *
scopus.journal.sourceid 21100218356 *
scopus.language.iso eng *
scopus.publisher.name CEUR-WS *
scopus.relation.conferencedate 2017 *
scopus.relation.conferencename 1st Italian Conference on Cybersecurity, ITASEC 2017 *
scopus.relation.conferenceplace ita *
scopus.relation.firstpage 86 *
scopus.relation.lastpage 95 *
scopus.relation.volume 1816 *
scopus.title Hate me, hate me not: Hate speech detection on Facebook *
scopus.titleeng Hate me, hate me not: Hate speech detection on Facebook *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/355348
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 325
  • ???jsp.display-item.citation.isi??? ND
social impact