Nowadays, the spread of deceptive reviews is a problem that has reached critical dimensions, having a significant economic impact on business activities. This paper aims to estimate – at the quantitative and qualitative levels – the possibility of using particular words to disambiguate between truthful and deceptive text, focusing on reviews produced in the cultural heritage domain. For this purpose, a lexicon-based methodology has used two different lexicons: sentiment information, intensifiers, downtoners, and negation operators. As known in the literature, these elements are crucial in a classification process related to deceptiveness. The evaluation phase has considered quantitative metrics such as accuracy and F1 score and ad hoc developed metrics that consider specific linguistic parameters such as polarity and tone of voice intensifiers. A qualitative analysis of a subset of the corpus has also been carried out to understand better factors that impact the classification of deceptive review. Several linguistic features have been considered, ranging from the number of intensifiers to their type and position in phrases and sentences. A comparison between the performances of two different lexicons used has been added to the analysis.

Classifying deceptive reviews for the cultural heritage domain: A lexicon-based approach for the Italian language

Guarasci R.
Primo
Writing – Original Draft Preparation
;
Catelli R.
Secondo
Software
;
Esposito M.
Ultimo
Supervision
2024

Abstract

Nowadays, the spread of deceptive reviews is a problem that has reached critical dimensions, having a significant economic impact on business activities. This paper aims to estimate – at the quantitative and qualitative levels – the possibility of using particular words to disambiguate between truthful and deceptive text, focusing on reviews produced in the cultural heritage domain. For this purpose, a lexicon-based methodology has used two different lexicons: sentiment information, intensifiers, downtoners, and negation operators. As known in the literature, these elements are crucial in a classification process related to deceptiveness. The evaluation phase has considered quantitative metrics such as accuracy and F1 score and ad hoc developed metrics that consider specific linguistic parameters such as polarity and tone of voice intensifiers. A qualitative analysis of a subset of the corpus has also been carried out to understand better factors that impact the classification of deceptive review. Several linguistic features have been considered, ranging from the number of intensifiers to their type and position in phrases and sentences. A comparison between the performances of two different lexicons used has been added to the analysis.
2024
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
Cultural heritage
Fake reviews detection
Lexicon-based
File in questo prodotto:
File Dimensione Formato  
ESWA_culturalHeritage (2).pdf

non disponibili

Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 545.96 kB
Formato Adobe PDF
545.96 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/505681
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact