Online product reviews have become increasingly important in digital consumer markets where they play a crucial role in making purchasing decisions by most consumers. Unfortunately, spammers often take advantage of online reviews by writing fake reviews to promote/demote certain products. Most of the previous studies have focused on detecting fake reviews and individual fake reviewer-ids. However, to target a particular product, fake reviewers work collaboratively in groups and/or create multiple fake ids to write reviews and control the sentiments of the product. This article addresses the problem of finding such fake reviewer groups. More specifically, we propose a top-down framework for candidate fake reviewer groups' detection based on the DeepWalk approach on reviewers' graph data and a (modified) semisupervised clustering method, which can incorporate partial background knowledge. We validate our proposed framework on a real review dataset from the Google Play Store, which has partial ground-truth information about 2207 fraud reviewer-ids out of all 38,123 reviewer-ids in the dataset. Our experimental results demonstrate that the proposed approach is able to identify the candidate spammer groups with reasonable accuracy. The proposed approach can also be extended to detect groups of opinion spammers in social media (e.g. fake comments or fake postings) with temporal affinity, semantic characteristics, and sentiment analysis.

Identifying Fake Reviewer Groups using a Semi-Supervised Approach

2021

Abstract

Online product reviews have become increasingly important in digital consumer markets where they play a crucial role in making purchasing decisions by most consumers. Unfortunately, spammers often take advantage of online reviews by writing fake reviews to promote/demote certain products. Most of the previous studies have focused on detecting fake reviews and individual fake reviewer-ids. However, to target a particular product, fake reviewers work collaboratively in groups and/or create multiple fake ids to write reviews and control the sentiments of the product. This article addresses the problem of finding such fake reviewer groups. More specifically, we propose a top-down framework for candidate fake reviewer groups' detection based on the DeepWalk approach on reviewers' graph data and a (modified) semisupervised clustering method, which can incorporate partial background knowledge. We validate our proposed framework on a real review dataset from the Google Play Store, which has partial ground-truth information about 2207 fraud reviewer-ids out of all 38,123 reviewer-ids in the dataset. Our experimental results demonstrate that the proposed approach is able to identify the candidate spammer groups with reasonable accuracy. The proposed approach can also be extended to detect groups of opinion spammers in social media (e.g. fake comments or fake postings) with temporal affinity, semantic characteristics, and sentiment analysis.
2021
Istituto di informatica e telematica - IIT
fake reviewers detection
clustering
semi-supervised clustering
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/402922
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact