This paper presents a new corpus for the Italian language representative of the fan-fiction genre. It comprises about 55k user-generated stories inspired to the original fantasy saga "Harry Potter" and published on a popular website. The corpus is large enough to support data-driven investigations in many directions, from more traditional studies on language variation aimed at characterizing this genre with respect to more traditional ones, to emerging topics in computational social science such as the identification of factors involved in the success of a story. The latter is the focus of the presented case-study, in which a wide set of multi-level linguistic features has been automatically extracted from a subset of the corpus and analysed in order to detect the ones which significantly discriminate successful from unsuccessful stories

The Style of a Successful Story: a Computational Study on the Fanfiction Genre

Dominique Brunato;Felice Dell'Orletta
2020

Abstract

This paper presents a new corpus for the Italian language representative of the fan-fiction genre. It comprises about 55k user-generated stories inspired to the original fantasy saga "Harry Potter" and published on a popular website. The corpus is large enough to support data-driven investigations in many directions, from more traditional studies on language variation aimed at characterizing this genre with respect to more traditional ones, to emerging topics in computational social science such as the identification of factors involved in the success of a story. The latter is the focus of the presented case-study, in which a wide set of multi-level linguistic features has been automatically extracted from a subset of the corpus and analysed in order to detect the ones which significantly discriminate successful from unsuccessful stories
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Andrea Mattei it
dc.authority.people Dominique Brunato it
dc.authority.people Felice Dell'Orletta it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/20 22:45:27 -
dc.date.available 2024/02/20 22:45:27 -
dc.date.issued 2020 -
dc.description.abstracteng This paper presents a new corpus for the Italian language representative of the fan-fiction genre. It comprises about 55k user-generated stories inspired to the original fantasy saga "Harry Potter" and published on a popular website. The corpus is large enough to support data-driven investigations in many directions, from more traditional studies on language variation aimed at characterizing this genre with respect to more traditional ones, to emerging topics in computational social science such as the identification of factors involved in the success of a story. The latter is the focus of the presented case-study, in which a wide set of multi-level linguistic features has been automatically extracted from a subset of the corpus and analysed in order to detect the ones which significantly discriminate successful from unsuccessful stories -
dc.description.affiliations University of Pisa; stituto di Linguistica Computazionale "Antonio Zampolli" (ILC-CNR); stituto di Linguistica Computazionale "Antonio Zampolli" (ILC-CNR); -
dc.description.allpeople Andrea Mattei; Dominique Brunato; Felice Dell'Orletta -
dc.description.allpeopleoriginal Andrea Mattei, Dominique Brunato, Felice Dell'Orletta -
dc.description.fulltext none en
dc.description.numberofauthors 2 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/400967 -
dc.language.iso eng -
dc.relation.conferencedate 01-03/03/2021 -
dc.relation.conferencename Seventh Italian Conference on Computational Linguistics (CLiC-it 2020) -
dc.relation.conferenceplace online -
dc.subject.keywords natural language processing -
dc.subject.keywords Computational Sociolinguistics -
dc.subject.keywords stylistic analysis -
dc.subject.singlekeyword natural language processing *
dc.subject.singlekeyword Computational Sociolinguistics *
dc.subject.singlekeyword stylistic analysis *
dc.title The Style of a Successful Story: a Computational Study on the Fanfiction Genre en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 450784 -
iris.orcid.lastModifiedDate 2024/03/02 05:23:27 *
iris.orcid.lastModifiedMillisecond 1709353407455 *
iris.scopus.extIssued 2020 -
iris.scopus.extTitle The style of a successful story: A computational study on the fanfiction genre -
iris.sitodocente.maxattempts 3 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/400967
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact