The paper aims at investigating variations in the writing style of book reviews published on different social reading platforms and referring to books of different genres, which enables acquiring insights into communication strategies adopted by readers to share their reading experiences. To this end, we introduce a corpus-based study focused on the analysis of A Good Review, a novel corpus of online book reviews written in Italian, posted on Amazon and Goodreads, and covering six literary fiction genres. We rely on stylometric analysis to explore the linguistic properties and lexicon of reviews and the authors conducted automatic classification experiments using multiple approaches and feature configurations to predict either the review's platform or the literary genre. The analysis of user-generated reviews demonstrates that language is a quite variable dimension across reading platforms, but not as much across book genres. The classification experiments revealed that features modelling the syntactic structure of the sentence are reliable proxies for discerning Amazon and Goodreads reviews, whereas lexical information showed a higher predictive role for automatically discriminating the genre.

Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews

Chiara Alzetta;Felice Dell'Orletta;Alessio Miaschi;Giulia Venturi
2023

Abstract

The paper aims at investigating variations in the writing style of book reviews published on different social reading platforms and referring to books of different genres, which enables acquiring insights into communication strategies adopted by readers to share their reading experiences. To this end, we introduce a corpus-based study focused on the analysis of A Good Review, a novel corpus of online book reviews written in Italian, posted on Amazon and Goodreads, and covering six literary fiction genres. We rely on stylometric analysis to explore the linguistic properties and lexicon of reviews and the authors conducted automatic classification experiments using multiple approaches and feature configurations to predict either the review's platform or the literary genre. The analysis of user-generated reviews demonstrates that language is a quite variable dimension across reading platforms, but not as much across book genres. The classification experiments revealed that features modelling the syntactic structure of the sentence are reliable proxies for discerning Amazon and Goodreads reviews, whereas lexical information showed a higher predictive role for automatically discriminating the genre.
Campo DC Valore Lingua
dc.authority.ancejournal JOURNAL OF DOCUMENTATION en
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Chiara Alzetta en
dc.authority.people Felice Dell'Orletta en
dc.authority.people Alessio Miaschi en
dc.authority.people Elena Prat en
dc.authority.people Giulia Venturi en
dc.collection.id.s b3f88f24-048a-4e43-8ab1-6697b90e068e *
dc.collection.name 01.01 Articolo in rivista *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.accessioned 2024/02/20 06:07:47 -
dc.date.available 2024/02/20 06:07:47 -
dc.date.firstsubmission 2025/01/29 15:53:26 *
dc.date.issued 2023 -
dc.date.submission 2025/01/29 15:53:26 *
dc.description.abstracteng The paper aims at investigating variations in the writing style of book reviews published on different social reading platforms and referring to books of different genres, which enables acquiring insights into communication strategies adopted by readers to share their reading experiences. To this end, we introduce a corpus-based study focused on the analysis of A Good Review, a novel corpus of online book reviews written in Italian, posted on Amazon and Goodreads, and covering six literary fiction genres. We rely on stylometric analysis to explore the linguistic properties and lexicon of reviews and the authors conducted automatic classification experiments using multiple approaches and feature configurations to predict either the review's platform or the literary genre. The analysis of user-generated reviews demonstrates that language is a quite variable dimension across reading platforms, but not as much across book genres. The classification experiments revealed that features modelling the syntactic structure of the sentence are reliable proxies for discerning Amazon and Goodreads reviews, whereas lexical information showed a higher predictive role for automatically discriminating the genre. -
dc.description.affiliations Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche, Pisa, Italy; Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche, Pisa, Italy; Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche, Pisa, Italy; Le Mans Universite, Le Mans, France; Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche, Pisa, Italy -
dc.description.allpeople Alzetta, Chiara; Dell'Orletta, Felice; Miaschi, Alessio; Prat, Elena; Venturi, Giulia -
dc.description.allpeopleoriginal Chiara Alzetta, Felice Dell'Orletta, Alessio Miaschi, Elena Prat, Giulia Venturi en
dc.description.fulltext restricted en
dc.description.numberofauthors 5 -
dc.identifier.doi 10.1108/JD-04-2023-0073 en
dc.identifier.isi WOS:001010945100001 -
dc.identifier.scopus 2-s2.0-85162196949 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/439017 -
dc.identifier.url https://www.emerald.com/insight/content/doi/10.1108/JD-04-2023-0073/full/html en
dc.language.iso eng en
dc.miur.last.status.update 2024-12-20T09:02:04Z *
dc.relation.numberofpages 23 en
dc.relation.volume 79 en
dc.subject.keywords Stylometric analysis -
dc.subject.keywords Textual Genre detection -
dc.subject.keywords Book reviews -
dc.subject.singlekeyword Stylometric analysis *
dc.subject.singlekeyword Textual Genre detection *
dc.subject.singlekeyword Book reviews *
dc.title Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews en
dc.type.driver info:eu-repo/semantics/article -
dc.type.full 01 Contributo su Rivista::01.01 Articolo in rivista it
dc.type.miur 262 -
dc.type.referee Sì, ma tipo non specificato en
dc.ugov.descaux1 488202 -
iris.isi.extIssued 2024 -
iris.isi.extTitle Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews -
iris.mediafilter.data 2025/04/04 04:37:29 *
iris.orcid.lastModifiedDate 2025/03/05 05:45:30 *
iris.orcid.lastModifiedMillisecond 1741149930323 *
iris.scopus.extIssued 2024 -
iris.scopus.extTitle Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews -
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoahost repository *
iris.unpaywall.bestoaversion submittedVersion *
iris.unpaywall.doi 10.1108/jd-04-2023-0073 *
iris.unpaywall.hosttype repository *
iris.unpaywall.isoa true *
iris.unpaywall.journalisindoaj false *
iris.unpaywall.landingpage https://hal.science/hal-04137185 *
iris.unpaywall.license other-oa *
iris.unpaywall.metadataCallLastModified 28/04/2026 04:52:50 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1777344770921 -
iris.unpaywall.oastatus green *
iris.unpaywall.pdfurl https://hal.science/hal-04137185/document *
isi.authority.ancejournal JOURNAL OF DOCUMENTATION###0022-0418 *
isi.category NU *
isi.category ET *
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Le Mans Universite -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country France -
isi.contributor.country Italy -
isi.contributor.name Chiara -
isi.contributor.name Felice -
isi.contributor.name Alessio -
isi.contributor.name Elena -
isi.contributor.name Giulia -
isi.contributor.researcherId KVX-9760-2024 -
isi.contributor.researcherId AAX-1864-2020 -
isi.contributor.researcherId GCD-5321-2022 -
isi.contributor.researcherId DMH-8029-2022 -
isi.contributor.researcherId AAY-3932-2020 -
isi.contributor.subaffiliation Ist Linguist Computaz Antonio Zampolli -
isi.contributor.subaffiliation Ist Linguist Computaz Antonio Zampolli -
isi.contributor.subaffiliation Ist Linguist Computaz Antonio Zampolli -
isi.contributor.subaffiliation -
isi.contributor.subaffiliation Ist Linguist Computaz Antonio Zampolli -
isi.contributor.surname Alzetta -
isi.contributor.surname Dell'Orletta -
isi.contributor.surname Miaschi -
isi.contributor.surname Prat -
isi.contributor.surname Venturi -
isi.date.issued 2024 *
isi.description.abstracteng PurposeThe authors' goal is to investigate variations in the writing style of book reviews published on different social reading platforms and referring to books of different genres, which enables acquiring insights into communication strategies adopted by readers to share their reading experiences.Design/methodology/approachThe authors propose a corpus-based study focused on the analysis of A Good Review, a novel corpus of online book reviews written in Italian, posted on Amazon and Goodreads, and covering six literary fiction genres. The authors rely on stylometric analysis to explore the linguistic properties and lexicon of reviews and the authors conducted automatic classification experiments using multiple approaches and feature configurations to predict either the review's platform or the literary genre.FindingsThe analysis of user-generated reviews demonstrates that language is a quite variable dimension across reading platforms, but not as much across book genres. The classification experiments revealed that features modelling the syntactic structure of the sentence are reliable proxies for discerning Amazon and Goodreads reviews, whereas lexical information showed a higher predictive role for automatically discriminating the genre.Originality/valueThe high availability of cultural products makes information services necessary to help users navigate these resources and acquire information from unstructured data. This study contributes to a better understanding of the linguistic characteristics of user-generated book reviews, which can support the development of linguistically-informed recommendation services. Additionally, the authors release a novel corpus of online book reviews meant to support the reproducibility and advancements of the research. *
isi.description.allpeopleoriginal Alzetta, C; Dell'Orletta, F; Miaschi, A; Prat, E; Venturi, G; *
isi.document.sourcetype WOS.SSCI *
isi.document.type Article *
isi.document.types Article *
isi.identifier.doi 10.1108/JD-04-2023-0073 *
isi.identifier.eissn 1758-7379 *
isi.identifier.isi WOS:001010945100001 *
isi.journal.journaltitle JOURNAL OF DOCUMENTATION *
isi.journal.journaltitleabbrev J DOC *
isi.language.original English *
isi.publisher.place Floor 5, Northspring 21-23 Wellington Street, Leeds, W YORKSHIRE, ENGLAND *
isi.relation.firstpage 180 *
isi.relation.issue 1 *
isi.relation.lastpage 202 *
isi.relation.volume 80 *
isi.title Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews *
scopus.authority.ancejournal JOURNAL OF DOCUMENTATION###0022-0418 *
scopus.category 1710 *
scopus.category 3309 *
scopus.contributor.affiliation Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche -
scopus.contributor.affiliation Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche -
scopus.contributor.affiliation Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche -
scopus.contributor.affiliation Le Mans Universite -
scopus.contributor.affiliation Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60004848 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 57192938832 -
scopus.contributor.auid 57540567000 -
scopus.contributor.auid 57211678681 -
scopus.contributor.auid 58318779600 -
scopus.contributor.auid 27568199800 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country France -
scopus.contributor.country Italy -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.name Chiara -
scopus.contributor.name Felice -
scopus.contributor.name Alessio -
scopus.contributor.name Elena -
scopus.contributor.name Giulia -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.surname Alzetta -
scopus.contributor.surname Dell'Orletta -
scopus.contributor.surname Miaschi -
scopus.contributor.surname Prat -
scopus.contributor.surname Venturi -
scopus.date.issued 2024 *
scopus.description.abstracteng Purpose: The authors’ goal is to investigate variations in the writing style of book reviews published on different social reading platforms and referring to books of different genres, which enables acquiring insights into communication strategies adopted by readers to share their reading experiences. Design/methodology/approach: The authors propose a corpus-based study focused on the analysis of A Good Review, a novel corpus of online book reviews written in Italian, posted on Amazon and Goodreads, and covering six literary fiction genres. The authors rely on stylometric analysis to explore the linguistic properties and lexicon of reviews and the authors conducted automatic classification experiments using multiple approaches and feature configurations to predict either the review's platform or the literary genre. Findings: The analysis of user-generated reviews demonstrates that language is a quite variable dimension across reading platforms, but not as much across book genres. The classification experiments revealed that features modelling the syntactic structure of the sentence are reliable proxies for discerning Amazon and Goodreads reviews, whereas lexical information showed a higher predictive role for automatically discriminating the genre. Originality/value: The high availability of cultural products makes information services necessary to help users navigate these resources and acquire information from unstructured data. This study contributes to a better understanding of the linguistic characteristics of user-generated book reviews, which can support the development of linguistically-informed recommendation services. Additionally, the authors release a novel corpus of online book reviews meant to support the reproducibility and advancements of the research. *
scopus.description.allpeopleoriginal Alzetta C.; Dell'Orletta F.; Miaschi A.; Prat E.; Venturi G. *
scopus.differences scopus.relation.lastpage *
scopus.differences scopus.subject.keywords *
scopus.differences scopus.relation.firstpage *
scopus.differences scopus.description.allpeopleoriginal *
scopus.differences scopus.description.abstracteng *
scopus.differences scopus.relation.issue *
scopus.differences scopus.date.issued *
scopus.differences scopus.relation.volume *
scopus.document.type ar *
scopus.document.types ar *
scopus.funding.funders 501100009888 - Regione Toscana; *
scopus.identifier.doi 10.1108/JD-04-2023-0073 *
scopus.identifier.pui 2023922005 *
scopus.identifier.scopus 2-s2.0-85162196949 *
scopus.journal.sourceid 12794 *
scopus.language.iso eng *
scopus.publisher.name Emerald Publishing *
scopus.relation.firstpage 180 *
scopus.relation.issue 1 *
scopus.relation.lastpage 202 *
scopus.relation.volume 80 *
scopus.subject.keywords Book reviews; Computational linguistics; Genre detection; Machine learning; Reading platform; Stylometric analysis; *
scopus.title Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews *
scopus.titleeng Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews *
Appare nelle tipologie: 01.01 Articolo in rivista
File in questo prodotto:
File Dimensione Formato  
10-1108_jd-04-2023-0073.pdf

solo utenti autorizzati

Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 1.98 MB
Formato Adobe PDF
1.98 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/439017
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact