CNR Institutional Research Information System

While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical violence. In this work, we aim at containing and preventing the alarming diffusion of such hate campaigns. Using Facebook as a benchmark, we consider the textual content of comments appeared on a set of public Italian pages. We first propose a variety of hate categories to distinguish the kind of hate. Crawled comments are then annotated by up to five distinct human annotators, according to the defined taxonomy. Leveraging morpho-syntactical features, sentiment polarity and word embedding lexicons, we design and implement two classifiers for the Italian language, based on different learning algorithms: the first based on Support Vector Machines (SVM) and the second on a particular Recurrent Neural Network named Long Short Term Memory (LSTM). We test these two learning algorithms in order to verify their classification performances on the task of hate speech recognition. The results show the effectiveness of the two classification approaches tested over the first manually annotated Italian Hate Speech Corpus of social media text.

Hate me, hate me not: Hate speech detection on Facebook

F Del Vigna;A Cimino;F Dell'Orletta;M Petrocchi;M Tesconi

2017

Abstract

While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical violence. In this work, we aim at containing and preventing the alarming diffusion of such hate campaigns. Using Facebook as a benchmark, we consider the textual content of comments appeared on a set of public Italian pages. We first propose a variety of hate categories to distinguish the kind of hate. Crawled comments are then annotated by up to five distinct human annotators, according to the defined taxonomy. Leveraging morpho-syntactical features, sentiment polarity and word embedding lexicons, we design and implement two classifiers for the Italian language, based on different learning algorithms: the first based on Support Vector Machines (SVM) and the second on a particular Recurrent Neural Network named Long Short Term Memory (LSTM). We test these two learning algorithms in order to verify their classification performances on the task of hate speech recognition. The results show the effectiveness of the two classification approaches tested over the first manually annotated Italian Hate Speech Corpus of social media text.

Scheda breve

Scheda completa

Scheda completa (DC)

Campo DC	Valore	Lingua
dc.authority.anceserie	CEUR WORKSHOP PROCEEDINGS	-
dc.authority.anceserie	CEUR Workshop Proceedings	-
dc.authority.orgunit	Istituto di informatica e telematica - IIT	-
dc.authority.orgunit	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	-
dc.authority.people	F Del Vigna	it
dc.authority.people	A Cimino	it
dc.authority.people	F Dell'Orletta	it
dc.authority.people	M Petrocchi	it
dc.authority.people	M Tesconi	it
dc.collection.id.s	71c7200a-7c5f-4e83-8d57-d3d2ba88f40d	*
dc.collection.name	04.01 Contributo in Atti di convegno	*
dc.contributor.appartenenza	Istituto di informatica e telematica - IIT	*
dc.contributor.appartenenza	Istituto di linguistica computazionale "Antonio Zampolli" - ILC	*
dc.contributor.appartenenza.mi	912	*
dc.contributor.appartenenza.mi	918	*
dc.date.accessioned	2024/02/16 06:12:30	-
dc.date.available	2024/02/16 06:12:30	-
dc.date.issued	2017	-
dc.description.abstracteng	While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical violence. In this work, we aim at containing and preventing the alarming diffusion of such hate campaigns. Using Facebook as a benchmark, we consider the textual content of comments appeared on a set of public Italian pages. We first propose a variety of hate categories to distinguish the kind of hate. Crawled comments are then annotated by up to five distinct human annotators, according to the defined taxonomy. Leveraging morpho-syntactical features, sentiment polarity and word embedding lexicons, we design and implement two classifiers for the Italian language, based on different learning algorithms: the first based on Support Vector Machines (SVM) and the second on a particular Recurrent Neural Network named Long Short Term Memory (LSTM). We test these two learning algorithms in order to verify their classification performances on the task of hate speech recognition. The results show the effectiveness of the two classification approaches tested over the first manually annotated Italian Hate Speech Corpus of social media text.	-
dc.description.affiliations	IIT-CNR, Pisa, Italy (1); ILC-CNR, Pisa, Italy (2)	-
dc.description.allpeople	F. Del Vigna ; A. Cimino ; F. Dell'Orletta ; M. Petrocchi ; M. Tesconi	-
dc.description.allpeopleoriginal	F. Del Vigna (1); A. Cimino (2); F. Dell'Orletta (2); M. Petrocchi (1); M. Tesconi (1)	-
dc.description.fulltext	none	en
dc.description.numberofauthors	5	-
dc.identifier.scopus	2-s2.0-85017337270	-
dc.identifier.uri	https://hdl.handle.net/20.500.14243/355348	-
dc.identifier.url	http://www.scopus.com/inward/record.url?eid=2-s2.0-85017337270&partnerID=q2rCbXpz	-
dc.language.iso	eng	-
dc.relation.conferencedate	17-20/01/2017	-
dc.relation.conferencename	ITA-SEC 17	-
dc.relation.conferenceplace	Venezia, Italia	-
dc.relation.firstpage	86	-
dc.relation.lastpage	95	-
dc.relation.volume	1816	-
dc.subject.keywords	Hate speech	-
dc.subject.keywords	NLP	-
dc.subject.keywords	Social Networks	-
dc.subject.singlekeyword	Hate speech	*
dc.subject.singlekeyword	NLP	*
dc.subject.singlekeyword	Social Networks	*
dc.title	Hate me, hate me not: Hate speech detection on Facebook	en
dc.type.driver	info:eu-repo/semantics/conferenceObject	-
dc.type.full	04 Contributo in convegno::04.01 Contributo in Atti di convegno	it
dc.type.miur	273	-
dc.ugov.descaux1	369760	-
iris.orcid.lastModifiedDate	2024/03/15 08:58:43	*
iris.orcid.lastModifiedMillisecond	1710489523424	*
iris.scopus.extIssued	2017	-
iris.scopus.extTitle	Hate me, hate me not: Hate speech detection on Facebook	-
iris.sitodocente.maxattempts	1	-
scopus.authority.anceserie	CEUR WORKSHOP PROCEEDINGS###1613-0073	*
scopus.category	1700	*
scopus.contributor.affiliation	University of Pisa	-
scopus.contributor.affiliation	CNR	-
scopus.contributor.affiliation	CNR	-
scopus.contributor.affiliation	CNR	-
scopus.contributor.affiliation	CNR	-
scopus.contributor.afid	60028868	-
scopus.contributor.afid	60008941	-
scopus.contributor.afid	60008941	-
scopus.contributor.afid	60027212	-
scopus.contributor.afid	60027212	-
scopus.contributor.auid	57188927435	-
scopus.contributor.auid	57002803800	-
scopus.contributor.auid	57540567000	-
scopus.contributor.auid	9433836000	-
scopus.contributor.auid	55884637000	-
scopus.contributor.country	Italy	-
scopus.contributor.country	Italy	-
scopus.contributor.country	Italy	-
scopus.contributor.country	Italy	-
scopus.contributor.country	Italy	-
scopus.contributor.dptid		-
scopus.contributor.dptid		-
scopus.contributor.dptid		-
scopus.contributor.dptid		-
scopus.contributor.dptid		-
scopus.contributor.name	Fabio	-
scopus.contributor.name	Andrea	-
scopus.contributor.name	Felice	-
scopus.contributor.name	Marinella	-
scopus.contributor.name	Maurizio	-
scopus.contributor.subaffiliation		-
scopus.contributor.subaffiliation	Istituto di Linguistica Computazionale;	-
scopus.contributor.subaffiliation	Istituto di Linguistica Computazionale;	-
scopus.contributor.subaffiliation	Istituto di Informatica e Telematica;	-
scopus.contributor.subaffiliation	Istituto di Informatica e Telematica;	-
scopus.contributor.surname	Del Vigna	-
scopus.contributor.surname	Cimino	-
scopus.contributor.surname	Dell'Orletta	-
scopus.contributor.surname	Petrocchi	-
scopus.contributor.surname	Tesconi	-
scopus.date.issued	2017	*
scopus.description.abstracteng	While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical violence. In this work, we aim at containing and preventing the alarming diffusion of such hate campaigns. Using Facebook as a benchmark, we consider the textual content of comments appeared on a set of public Italian pages. We first propose a variety of hate categories to distinguish the kind of hate. Crawled comments are then annotated by up to five distinct human annotators, according to the defined taxonomy. Leveraging morpho-syntactical features, sentiment polarity and word embedding lexicons, we design and implement two classifiers for the Italian language, based on different learning algorithms: the first based on Support Vector Machines (SVM) and the second on a particular Recurrent Neural Network named Long Short Term Memory (LSTM). We test these two learning algorithms in order to verify their classification performances on the task of hate speech recognition. The results show the effectiveness of the two classification approaches tested over the first manually annotated Italian Hate Speech Corpus of social media text.	*
scopus.description.allpeopleoriginal	Del Vigna F.; Cimino A.; Dell'Orletta F.; Petrocchi M.; Tesconi M.	*
scopus.differences	scopus.relation.conferencename	*
scopus.differences	scopus.authority.anceserie	*
scopus.differences	scopus.publisher.name	*
scopus.differences	scopus.relation.conferencedate	*
scopus.differences	scopus.description.allpeopleoriginal	*
scopus.differences	scopus.relation.conferenceplace	*
scopus.document.type	cp	*
scopus.document.types	cp	*
scopus.identifier.pui	615346286	*
scopus.identifier.scopus	2-s2.0-85017337270	*
scopus.journal.sourceid	21100218356	*
scopus.language.iso	eng	*
scopus.publisher.name	CEUR-WS	*
scopus.relation.conferencedate	2017	*
scopus.relation.conferencename	1st Italian Conference on Cybersecurity, ITASEC 2017	*
scopus.relation.conferenceplace	ita	*
scopus.relation.firstpage	86	*
scopus.relation.lastpage	95	*
scopus.relation.volume	1816	*
scopus.title	Hate me, hate me not: Hate speech detection on Facebook	*
scopus.titleeng	Hate me, hate me not: Hate speech detection on Facebook	*
Appare nelle tipologie:	04.01 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/355348

Citazioni

ND

329

ND

social impact