The paper aims at investigating variations in the writing style of book reviews published on different social reading platforms and referring to books of different genres, which enables acquiring insights into communication strategies adopted by readers to share their reading experiences. To this end, we introduce a corpus-based study focused on the analysis of A Good Review, a novel corpus of online book reviews written in Italian, posted on Amazon and Goodreads, and covering six literary fiction genres. We rely on stylometric analysis to explore the linguistic properties and lexicon of reviews and the authors conducted automatic classification experiments using multiple approaches and feature configurations to predict either the review's platform or the literary genre. The analysis of user-generated reviews demonstrates that language is a quite variable dimension across reading platforms, but not as much across book genres. The classification experiments revealed that features modelling the syntactic structure of the sentence are reliable proxies for discerning Amazon and Goodreads reviews, whereas lexical information showed a higher predictive role for automatically discriminating the genre.
Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews
Chiara Alzetta;Felice Dell'Orletta;Alessio Miaschi;Giulia Venturi
2023
Abstract
The paper aims at investigating variations in the writing style of book reviews published on different social reading platforms and referring to books of different genres, which enables acquiring insights into communication strategies adopted by readers to share their reading experiences. To this end, we introduce a corpus-based study focused on the analysis of A Good Review, a novel corpus of online book reviews written in Italian, posted on Amazon and Goodreads, and covering six literary fiction genres. We rely on stylometric analysis to explore the linguistic properties and lexicon of reviews and the authors conducted automatic classification experiments using multiple approaches and feature configurations to predict either the review's platform or the literary genre. The analysis of user-generated reviews demonstrates that language is a quite variable dimension across reading platforms, but not as much across book genres. The classification experiments revealed that features modelling the syntactic structure of the sentence are reliable proxies for discerning Amazon and Goodreads reviews, whereas lexical information showed a higher predictive role for automatically discriminating the genre.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.ancejournal | JOURNAL OF DOCUMENTATION | en |
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | en |
| dc.authority.people | Chiara Alzetta | en |
| dc.authority.people | Felice Dell'Orletta | en |
| dc.authority.people | Alessio Miaschi | en |
| dc.authority.people | Elena Prat | en |
| dc.authority.people | Giulia Venturi | en |
| dc.collection.id.s | b3f88f24-048a-4e43-8ab1-6697b90e068e | * |
| dc.collection.name | 01.01 Articolo in rivista | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.contributor.area | Non assegn | * |
| dc.date.accessioned | 2024/02/20 06:07:47 | - |
| dc.date.available | 2024/02/20 06:07:47 | - |
| dc.date.firstsubmission | 2025/01/29 15:53:26 | * |
| dc.date.issued | 2023 | - |
| dc.date.submission | 2025/01/29 15:53:26 | * |
| dc.description.abstracteng | The paper aims at investigating variations in the writing style of book reviews published on different social reading platforms and referring to books of different genres, which enables acquiring insights into communication strategies adopted by readers to share their reading experiences. To this end, we introduce a corpus-based study focused on the analysis of A Good Review, a novel corpus of online book reviews written in Italian, posted on Amazon and Goodreads, and covering six literary fiction genres. We rely on stylometric analysis to explore the linguistic properties and lexicon of reviews and the authors conducted automatic classification experiments using multiple approaches and feature configurations to predict either the review's platform or the literary genre. The analysis of user-generated reviews demonstrates that language is a quite variable dimension across reading platforms, but not as much across book genres. The classification experiments revealed that features modelling the syntactic structure of the sentence are reliable proxies for discerning Amazon and Goodreads reviews, whereas lexical information showed a higher predictive role for automatically discriminating the genre. | - |
| dc.description.affiliations | Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche, Pisa, Italy; Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche, Pisa, Italy; Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche, Pisa, Italy; Le Mans Universite, Le Mans, France; Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche, Pisa, Italy | - |
| dc.description.allpeople | Alzetta, Chiara; Dell'Orletta, Felice; Miaschi, Alessio; Prat, Elena; Venturi, Giulia | - |
| dc.description.allpeopleoriginal | Chiara Alzetta, Felice Dell'Orletta, Alessio Miaschi, Elena Prat, Giulia Venturi | en |
| dc.description.fulltext | restricted | en |
| dc.description.numberofauthors | 5 | - |
| dc.identifier.doi | 10.1108/JD-04-2023-0073 | en |
| dc.identifier.isi | WOS:001010945100001 | - |
| dc.identifier.scopus | 2-s2.0-85162196949 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/439017 | - |
| dc.identifier.url | https://www.emerald.com/insight/content/doi/10.1108/JD-04-2023-0073/full/html | en |
| dc.language.iso | eng | en |
| dc.miur.last.status.update | 2024-12-20T09:02:04Z | * |
| dc.relation.numberofpages | 23 | en |
| dc.relation.volume | 79 | en |
| dc.subject.keywords | Stylometric analysis | - |
| dc.subject.keywords | Textual Genre detection | - |
| dc.subject.keywords | Book reviews | - |
| dc.subject.singlekeyword | Stylometric analysis | * |
| dc.subject.singlekeyword | Textual Genre detection | * |
| dc.subject.singlekeyword | Book reviews | * |
| dc.title | Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews | en |
| dc.type.driver | info:eu-repo/semantics/article | - |
| dc.type.full | 01 Contributo su Rivista::01.01 Articolo in rivista | it |
| dc.type.miur | 262 | - |
| dc.type.referee | Sì, ma tipo non specificato | en |
| dc.ugov.descaux1 | 488202 | - |
| iris.isi.extIssued | 2024 | - |
| iris.isi.extTitle | Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews | - |
| iris.mediafilter.data | 2025/04/04 04:37:29 | * |
| iris.orcid.lastModifiedDate | 2025/03/05 05:45:30 | * |
| iris.orcid.lastModifiedMillisecond | 1741149930323 | * |
| iris.scopus.extIssued | 2024 | - |
| iris.scopus.extTitle | Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews | - |
| iris.sitodocente.maxattempts | 1 | - |
| iris.unpaywall.bestoahost | repository | * |
| iris.unpaywall.bestoaversion | submittedVersion | * |
| iris.unpaywall.doi | 10.1108/jd-04-2023-0073 | * |
| iris.unpaywall.hosttype | repository | * |
| iris.unpaywall.isoa | true | * |
| iris.unpaywall.journalisindoaj | false | * |
| iris.unpaywall.landingpage | https://hal.science/hal-04137185 | * |
| iris.unpaywall.license | other-oa | * |
| iris.unpaywall.metadataCallLastModified | 28/04/2026 04:52:50 | - |
| iris.unpaywall.metadataCallLastModifiedMillisecond | 1777344770921 | - |
| iris.unpaywall.oastatus | green | * |
| iris.unpaywall.pdfurl | https://hal.science/hal-04137185/document | * |
| isi.authority.ancejournal | JOURNAL OF DOCUMENTATION###0022-0418 | * |
| isi.category | NU | * |
| isi.category | ET | * |
| isi.contributor.affiliation | Consiglio Nazionale delle Ricerche (CNR) | - |
| isi.contributor.affiliation | Consiglio Nazionale delle Ricerche (CNR) | - |
| isi.contributor.affiliation | Consiglio Nazionale delle Ricerche (CNR) | - |
| isi.contributor.affiliation | Le Mans Universite | - |
| isi.contributor.affiliation | Consiglio Nazionale delle Ricerche (CNR) | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | France | - |
| isi.contributor.country | Italy | - |
| isi.contributor.name | Chiara | - |
| isi.contributor.name | Felice | - |
| isi.contributor.name | Alessio | - |
| isi.contributor.name | Elena | - |
| isi.contributor.name | Giulia | - |
| isi.contributor.researcherId | KVX-9760-2024 | - |
| isi.contributor.researcherId | AAX-1864-2020 | - |
| isi.contributor.researcherId | GCD-5321-2022 | - |
| isi.contributor.researcherId | DMH-8029-2022 | - |
| isi.contributor.researcherId | AAY-3932-2020 | - |
| isi.contributor.subaffiliation | Ist Linguist Computaz Antonio Zampolli | - |
| isi.contributor.subaffiliation | Ist Linguist Computaz Antonio Zampolli | - |
| isi.contributor.subaffiliation | Ist Linguist Computaz Antonio Zampolli | - |
| isi.contributor.subaffiliation | - | |
| isi.contributor.subaffiliation | Ist Linguist Computaz Antonio Zampolli | - |
| isi.contributor.surname | Alzetta | - |
| isi.contributor.surname | Dell'Orletta | - |
| isi.contributor.surname | Miaschi | - |
| isi.contributor.surname | Prat | - |
| isi.contributor.surname | Venturi | - |
| isi.date.issued | 2024 | * |
| isi.description.abstracteng | PurposeThe authors' goal is to investigate variations in the writing style of book reviews published on different social reading platforms and referring to books of different genres, which enables acquiring insights into communication strategies adopted by readers to share their reading experiences.Design/methodology/approachThe authors propose a corpus-based study focused on the analysis of A Good Review, a novel corpus of online book reviews written in Italian, posted on Amazon and Goodreads, and covering six literary fiction genres. The authors rely on stylometric analysis to explore the linguistic properties and lexicon of reviews and the authors conducted automatic classification experiments using multiple approaches and feature configurations to predict either the review's platform or the literary genre.FindingsThe analysis of user-generated reviews demonstrates that language is a quite variable dimension across reading platforms, but not as much across book genres. The classification experiments revealed that features modelling the syntactic structure of the sentence are reliable proxies for discerning Amazon and Goodreads reviews, whereas lexical information showed a higher predictive role for automatically discriminating the genre.Originality/valueThe high availability of cultural products makes information services necessary to help users navigate these resources and acquire information from unstructured data. This study contributes to a better understanding of the linguistic characteristics of user-generated book reviews, which can support the development of linguistically-informed recommendation services. Additionally, the authors release a novel corpus of online book reviews meant to support the reproducibility and advancements of the research. | * |
| isi.description.allpeopleoriginal | Alzetta, C; Dell'Orletta, F; Miaschi, A; Prat, E; Venturi, G; | * |
| isi.document.sourcetype | WOS.SSCI | * |
| isi.document.type | Article | * |
| isi.document.types | Article | * |
| isi.identifier.doi | 10.1108/JD-04-2023-0073 | * |
| isi.identifier.eissn | 1758-7379 | * |
| isi.identifier.isi | WOS:001010945100001 | * |
| isi.journal.journaltitle | JOURNAL OF DOCUMENTATION | * |
| isi.journal.journaltitleabbrev | J DOC | * |
| isi.language.original | English | * |
| isi.publisher.place | Floor 5, Northspring 21-23 Wellington Street, Leeds, W YORKSHIRE, ENGLAND | * |
| isi.relation.firstpage | 180 | * |
| isi.relation.issue | 1 | * |
| isi.relation.lastpage | 202 | * |
| isi.relation.volume | 80 | * |
| isi.title | Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews | * |
| scopus.authority.ancejournal | JOURNAL OF DOCUMENTATION###0022-0418 | * |
| scopus.category | 1710 | * |
| scopus.category | 3309 | * |
| scopus.contributor.affiliation | Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche | - |
| scopus.contributor.affiliation | Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche | - |
| scopus.contributor.affiliation | Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche | - |
| scopus.contributor.affiliation | Le Mans Universite | - |
| scopus.contributor.affiliation | Istituto di Linguistica Computazionale Antonio Zampolli Consiglio Nazionale Delle Ricerche | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.afid | 60004848 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.auid | 57192938832 | - |
| scopus.contributor.auid | 57540567000 | - |
| scopus.contributor.auid | 57211678681 | - |
| scopus.contributor.auid | 58318779600 | - |
| scopus.contributor.auid | 27568199800 | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | France | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.dptid | - | |
| scopus.contributor.name | Chiara | - |
| scopus.contributor.name | Felice | - |
| scopus.contributor.name | Alessio | - |
| scopus.contributor.name | Elena | - |
| scopus.contributor.name | Giulia | - |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.subaffiliation | - | |
| scopus.contributor.surname | Alzetta | - |
| scopus.contributor.surname | Dell'Orletta | - |
| scopus.contributor.surname | Miaschi | - |
| scopus.contributor.surname | Prat | - |
| scopus.contributor.surname | Venturi | - |
| scopus.date.issued | 2024 | * |
| scopus.description.abstracteng | Purpose: The authors’ goal is to investigate variations in the writing style of book reviews published on different social reading platforms and referring to books of different genres, which enables acquiring insights into communication strategies adopted by readers to share their reading experiences. Design/methodology/approach: The authors propose a corpus-based study focused on the analysis of A Good Review, a novel corpus of online book reviews written in Italian, posted on Amazon and Goodreads, and covering six literary fiction genres. The authors rely on stylometric analysis to explore the linguistic properties and lexicon of reviews and the authors conducted automatic classification experiments using multiple approaches and feature configurations to predict either the review's platform or the literary genre. Findings: The analysis of user-generated reviews demonstrates that language is a quite variable dimension across reading platforms, but not as much across book genres. The classification experiments revealed that features modelling the syntactic structure of the sentence are reliable proxies for discerning Amazon and Goodreads reviews, whereas lexical information showed a higher predictive role for automatically discriminating the genre. Originality/value: The high availability of cultural products makes information services necessary to help users navigate these resources and acquire information from unstructured data. This study contributes to a better understanding of the linguistic characteristics of user-generated book reviews, which can support the development of linguistically-informed recommendation services. Additionally, the authors release a novel corpus of online book reviews meant to support the reproducibility and advancements of the research. | * |
| scopus.description.allpeopleoriginal | Alzetta C.; Dell'Orletta F.; Miaschi A.; Prat E.; Venturi G. | * |
| scopus.differences | scopus.relation.lastpage | * |
| scopus.differences | scopus.subject.keywords | * |
| scopus.differences | scopus.relation.firstpage | * |
| scopus.differences | scopus.description.allpeopleoriginal | * |
| scopus.differences | scopus.description.abstracteng | * |
| scopus.differences | scopus.relation.issue | * |
| scopus.differences | scopus.date.issued | * |
| scopus.differences | scopus.relation.volume | * |
| scopus.document.type | ar | * |
| scopus.document.types | ar | * |
| scopus.funding.funders | 501100009888 - Regione Toscana; | * |
| scopus.identifier.doi | 10.1108/JD-04-2023-0073 | * |
| scopus.identifier.pui | 2023922005 | * |
| scopus.identifier.scopus | 2-s2.0-85162196949 | * |
| scopus.journal.sourceid | 12794 | * |
| scopus.language.iso | eng | * |
| scopus.publisher.name | Emerald Publishing | * |
| scopus.relation.firstpage | 180 | * |
| scopus.relation.issue | 1 | * |
| scopus.relation.lastpage | 202 | * |
| scopus.relation.volume | 80 | * |
| scopus.subject.keywords | Book reviews; Computational linguistics; Genre detection; Machine learning; Reading platform; Stylometric analysis; | * |
| scopus.title | Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews | * |
| scopus.titleeng | Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews | * |
| Appare nelle tipologie: | 01.01 Articolo in rivista | |
| File | Dimensione | Formato | |
|---|---|---|---|
|
10-1108_jd-04-2023-0073.pdf
solo utenti autorizzati
Tipologia:
Versione Editoriale (PDF)
Licenza:
NON PUBBLICO - Accesso privato/ristretto
Dimensione
1.98 MB
Formato
Adobe PDF
|
1.98 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


