While favouring communications and easing information sharing, Online Social Networks are increasingly used to launch harmful campaigns against specific groups and individuals. Although providers struggle to keep pace by manually removing hate content published on their platforms, recent research efforts rely on automatic text classification techniques, whose performances are usually measured on annotated corpora. In this work, we propose six distinct machine learning classification strategies: three based on conventional machine learning approaches, three based on neural networks. The latter are able to process texts almost from scratch, avoiding the need of i) NLP tools specialised for a specific language, ii) the phase of time-consuming feature engineering, and iii) the high computational cost usually derived from processing a huge amount of features. Thus, the main goal of the paper is to investigate whether it is possible to rely on neural networks and to achieve performance results at least comparable with those of NLP-based classifiers. The performances of the six configurations are evaluated over an annotated dataset consisting of 4,000 Italian tweets and 4,000 Italian Facebook comments. By comparing the classification results, we demonstrate that relying on deep learning techniques for hate speech detection is more than encouraging. In particular, a deep learning model, based on an ensemble approach, obtains a F1 score of 0.786 on the Twitter data and 0.775 on the Facebook ones, the best results, compared to the ones obtained with the other tested configurations.

Six Things I Hate About You (in Italian) and Six Classification Strategies to More and More Effectively Find Them

Tiziano Fagni;Leonardo Nizzoli;Marinella Petrocchi;Maurizio Tesconi
2019

Abstract

While favouring communications and easing information sharing, Online Social Networks are increasingly used to launch harmful campaigns against specific groups and individuals. Although providers struggle to keep pace by manually removing hate content published on their platforms, recent research efforts rely on automatic text classification techniques, whose performances are usually measured on annotated corpora. In this work, we propose six distinct machine learning classification strategies: three based on conventional machine learning approaches, three based on neural networks. The latter are able to process texts almost from scratch, avoiding the need of i) NLP tools specialised for a specific language, ii) the phase of time-consuming feature engineering, and iii) the high computational cost usually derived from processing a huge amount of features. Thus, the main goal of the paper is to investigate whether it is possible to rely on neural networks and to achieve performance results at least comparable with those of NLP-based classifiers. The performances of the six configurations are evaluated over an annotated dataset consisting of 4,000 Italian tweets and 4,000 Italian Facebook comments. By comparing the classification results, we demonstrate that relying on deep learning techniques for hate speech detection is more than encouraging. In particular, a deep learning model, based on an ensemble approach, obtains a F1 score of 0.786 on the Twitter data and 0.775 on the Facebook ones, the best results, compared to the ones obtained with the other tested configurations.
2019
deep learning
hate speech
nlp
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/346459
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact