While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical violence. In this work, we aim at containing and preventing the alarming diffusion of such hate campaigns. Using Facebook as a benchmark, we consider the textual content of comments appeared on a set of public Italian pages. We first propose a variety of hate categories to distinguish the kind of hate. Crawled comments are then annotated by up to five distinct human annotators, according to the defined taxonomy. Leveraging morpho-syntactical features, sentiment polarity and word embedding lexicons, we design and implement two classifiers for the Italian language, based on different learning algorithms: the first based on Support Vector Machines (SVM) and the second on a particular Recurrent Neural Network named Long Short Term Memory (LSTM). We test these two learning algorithms in order to verify their classification performances on the task of hate speech recognition. The results show the effectiveness of the two classification approaches tested over the first manually annotated Italian Hate Speech Corpus of social media text.
Hate me, hate me not: Hate speech detection on Facebook
F Del Vigna;A Cimino;F Dell'Orletta;M Petrocchi;M Tesconi
2017
Abstract
While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical violence. In this work, we aim at containing and preventing the alarming diffusion of such hate campaigns. Using Facebook as a benchmark, we consider the textual content of comments appeared on a set of public Italian pages. We first propose a variety of hate categories to distinguish the kind of hate. Crawled comments are then annotated by up to five distinct human annotators, according to the defined taxonomy. Leveraging morpho-syntactical features, sentiment polarity and word embedding lexicons, we design and implement two classifiers for the Italian language, based on different learning algorithms: the first based on Support Vector Machines (SVM) and the second on a particular Recurrent Neural Network named Long Short Term Memory (LSTM). We test these two learning algorithms in order to verify their classification performances on the task of hate speech recognition. The results show the effectiveness of the two classification approaches tested over the first manually annotated Italian Hate Speech Corpus of social media text.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.anceserie | CEUR WORKSHOP PROCEEDINGS | - |
| dc.authority.anceserie | CEUR Workshop Proceedings | - |
| dc.authority.orgunit | Istituto di informatica e telematica - IIT | - |
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | F Del Vigna | it |
| dc.authority.people | A Cimino | it |
| dc.authority.people | F Dell'Orletta | it |
| dc.authority.people | M Petrocchi | it |
| dc.authority.people | M Tesconi | it |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di informatica e telematica - IIT | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 912 | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/16 06:12:30 | - |
| dc.date.available | 2024/02/16 06:12:30 | - |
| dc.date.issued | 2017 | - |
| dc.description.abstracteng | While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical violence. In this work, we aim at containing and preventing the alarming diffusion of such hate campaigns. Using Facebook as a benchmark, we consider the textual content of comments appeared on a set of public Italian pages. We first propose a variety of hate categories to distinguish the kind of hate. Crawled comments are then annotated by up to five distinct human annotators, according to the defined taxonomy. Leveraging morpho-syntactical features, sentiment polarity and word embedding lexicons, we design and implement two classifiers for the Italian language, based on different learning algorithms: the first based on Support Vector Machines (SVM) and the second on a particular Recurrent Neural Network named Long Short Term Memory (LSTM). We test these two learning algorithms in order to verify their classification performances on the task of hate speech recognition. The results show the effectiveness of the two classification approaches tested over the first manually annotated Italian Hate Speech Corpus of social media text. | - |
| dc.description.affiliations | IIT-CNR, Pisa, Italy (1); ILC-CNR, Pisa, Italy (2) | - |
| dc.description.allpeople | F. Del Vigna ; A. Cimino ; F. Dell'Orletta ; M. Petrocchi ; M. Tesconi | - |
| dc.description.allpeopleoriginal | F. Del Vigna (1); A. Cimino (2); F. Dell'Orletta (2); M. Petrocchi (1); M. Tesconi (1) | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 5 | - |
| dc.identifier.scopus | 2-s2.0-85017337270 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/355348 | - |
| dc.identifier.url | http://www.scopus.com/inward/record.url?eid=2-s2.0-85017337270&partnerID=q2rCbXpz | - |
| dc.language.iso | eng | - |
| dc.relation.conferencedate | 17-20/01/2017 | - |
| dc.relation.conferencename | ITA-SEC 17 | - |
| dc.relation.conferenceplace | Venezia, Italia | - |
| dc.relation.firstpage | 86 | - |
| dc.relation.lastpage | 95 | - |
| dc.relation.volume | 1816 | - |
| dc.subject.keywords | Hate speech | - |
| dc.subject.keywords | NLP | - |
| dc.subject.keywords | Social Networks | - |
| dc.subject.singlekeyword | Hate speech | * |
| dc.subject.singlekeyword | NLP | * |
| dc.subject.singlekeyword | Social Networks | * |
| dc.title | Hate me, hate me not: Hate speech detection on Facebook | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| dc.ugov.descaux1 | 369760 | - |
| iris.orcid.lastModifiedDate | 2024/03/15 08:58:43 | * |
| iris.orcid.lastModifiedMillisecond | 1710489523424 | * |
| iris.scopus.extIssued | 2017 | - |
| iris.scopus.extTitle | Hate me, hate me not: Hate speech detection on Facebook | - |
| iris.sitodocente.maxattempts | 1 | - |
| scopus.authority.anceserie | CEUR WORKSHOP PROCEEDINGS###1613-0073 | * |
| scopus.category | 1700 | * |
| scopus.contributor.affiliation | University of Pisa | - |
| scopus.contributor.affiliation | CNR | - |
| scopus.contributor.affiliation | CNR | - |
| scopus.contributor.affiliation | CNR | - |
| scopus.contributor.affiliation | CNR | - |
| scopus.contributor.afid | 60028868 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 60027212 | - |
| scopus.contributor.afid | 60027212 | - |
| scopus.contributor.auid | 57188927435 | - |
| scopus.contributor.auid | 57002803800 | - |
| scopus.contributor.auid | 57540567000 | - |
| scopus.contributor.auid | 9433836000 | - |
| scopus.contributor.auid | 55884637000 | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.name | Fabio | - |
| scopus.contributor.name | Andrea | - |
| scopus.contributor.name | Felice | - |
| scopus.contributor.name | Marinella | - |
| scopus.contributor.name | Maurizio | - |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale; | - |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale; | - |
| scopus.contributor.subaffiliation | Istituto di Informatica e Telematica; | - |
| scopus.contributor.subaffiliation | Istituto di Informatica e Telematica; | - |
| scopus.contributor.surname | Del Vigna | - |
| scopus.contributor.surname | Cimino | - |
| scopus.contributor.surname | Dell'Orletta | - |
| scopus.contributor.surname | Petrocchi | - |
| scopus.contributor.surname | Tesconi | - |
| scopus.date.issued | 2017 | * |
| scopus.description.abstracteng | While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical violence. In this work, we aim at containing and preventing the alarming diffusion of such hate campaigns. Using Facebook as a benchmark, we consider the textual content of comments appeared on a set of public Italian pages. We first propose a variety of hate categories to distinguish the kind of hate. Crawled comments are then annotated by up to five distinct human annotators, according to the defined taxonomy. Leveraging morpho-syntactical features, sentiment polarity and word embedding lexicons, we design and implement two classifiers for the Italian language, based on different learning algorithms: the first based on Support Vector Machines (SVM) and the second on a particular Recurrent Neural Network named Long Short Term Memory (LSTM). We test these two learning algorithms in order to verify their classification performances on the task of hate speech recognition. The results show the effectiveness of the two classification approaches tested over the first manually annotated Italian Hate Speech Corpus of social media text. | * |
| scopus.description.allpeopleoriginal | Del Vigna F.; Cimino A.; Dell'Orletta F.; Petrocchi M.; Tesconi M. | * |
| scopus.differences | scopus.relation.conferencename | * |
| scopus.differences | scopus.authority.anceserie | * |
| scopus.differences | scopus.publisher.name | * |
| scopus.differences | scopus.relation.conferencedate | * |
| scopus.differences | scopus.description.allpeopleoriginal | * |
| scopus.differences | scopus.relation.conferenceplace | * |
| scopus.document.type | cp | * |
| scopus.document.types | cp | * |
| scopus.identifier.pui | 615346286 | * |
| scopus.identifier.scopus | 2-s2.0-85017337270 | * |
| scopus.journal.sourceid | 21100218356 | * |
| scopus.language.iso | eng | * |
| scopus.publisher.name | CEUR-WS | * |
| scopus.relation.conferencedate | 2017 | * |
| scopus.relation.conferencename | 1st Italian Conference on Cybersecurity, ITASEC 2017 | * |
| scopus.relation.conferenceplace | ita | * |
| scopus.relation.firstpage | 86 | * |
| scopus.relation.lastpage | 95 | * |
| scopus.relation.volume | 1816 | * |
| scopus.title | Hate me, hate me not: Hate speech detection on Facebook | * |
| scopus.titleeng | Hate me, hate me not: Hate speech detection on Facebook | * |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


