This paper presents a new corpus for the Italian language representative of the fan-fiction genre. It comprises about 55k user-generated stories inspired to the original fantasy saga "Harry Potter" and published on a popular website. The corpus is large enough to support data-driven investigations in many directions, from more traditional studies on language variation aimed at characterizing this genre with respect to more traditional ones, to emerging topics in computational social science such as the identification of factors involved in the success of a story. The latter is the focus of the presented case-study, in which a wide set of multi-level linguistic features has been automatically extracted from a subset of the corpus and analysed in order to detect the ones which significantly discriminate successful from unsuccessful stories
The Style of a Successful Story: a Computational Study on the Fanfiction Genre
Dominique Brunato;Felice Dell'Orletta
2020
Abstract
This paper presents a new corpus for the Italian language representative of the fan-fiction genre. It comprises about 55k user-generated stories inspired to the original fantasy saga "Harry Potter" and published on a popular website. The corpus is large enough to support data-driven investigations in many directions, from more traditional studies on language variation aimed at characterizing this genre with respect to more traditional ones, to emerging topics in computational social science such as the identification of factors involved in the success of a story. The latter is the focus of the presented case-study, in which a wide set of multi-level linguistic features has been automatically extracted from a subset of the corpus and analysed in order to detect the ones which significantly discriminate successful from unsuccessful stories| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | Andrea Mattei | it |
| dc.authority.people | Dominique Brunato | it |
| dc.authority.people | Felice Dell'Orletta | it |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/20 22:45:27 | - |
| dc.date.available | 2024/02/20 22:45:27 | - |
| dc.date.issued | 2020 | - |
| dc.description.abstracteng | This paper presents a new corpus for the Italian language representative of the fan-fiction genre. It comprises about 55k user-generated stories inspired to the original fantasy saga "Harry Potter" and published on a popular website. The corpus is large enough to support data-driven investigations in many directions, from more traditional studies on language variation aimed at characterizing this genre with respect to more traditional ones, to emerging topics in computational social science such as the identification of factors involved in the success of a story. The latter is the focus of the presented case-study, in which a wide set of multi-level linguistic features has been automatically extracted from a subset of the corpus and analysed in order to detect the ones which significantly discriminate successful from unsuccessful stories | - |
| dc.description.affiliations | University of Pisa; stituto di Linguistica Computazionale "Antonio Zampolli" (ILC-CNR); stituto di Linguistica Computazionale "Antonio Zampolli" (ILC-CNR); | - |
| dc.description.allpeople | Andrea Mattei; Dominique Brunato; Felice Dell'Orletta | - |
| dc.description.allpeopleoriginal | Andrea Mattei, Dominique Brunato, Felice Dell'Orletta | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 2 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/400967 | - |
| dc.language.iso | eng | - |
| dc.relation.conferencedate | 01-03/03/2021 | - |
| dc.relation.conferencename | Seventh Italian Conference on Computational Linguistics (CLiC-it 2020) | - |
| dc.relation.conferenceplace | online | - |
| dc.subject.keywords | natural language processing | - |
| dc.subject.keywords | Computational Sociolinguistics | - |
| dc.subject.keywords | stylistic analysis | - |
| dc.subject.singlekeyword | natural language processing | * |
| dc.subject.singlekeyword | Computational Sociolinguistics | * |
| dc.subject.singlekeyword | stylistic analysis | * |
| dc.title | The Style of a Successful Story: a Computational Study on the Fanfiction Genre | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| dc.type.referee | Sì, ma tipo non specificato | - |
| dc.ugov.descaux1 | 450784 | - |
| iris.orcid.lastModifiedDate | 2024/03/02 05:23:27 | * |
| iris.orcid.lastModifiedMillisecond | 1709353407455 | * |
| iris.scopus.extIssued | 2020 | - |
| iris.scopus.extTitle | The style of a successful story: A computational study on the fanfiction genre | - |
| iris.sitodocente.maxattempts | 3 | - |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


