Great authors of fiction and theatre have the capacity of creating memorable characters that take life and become almost as real as living persons to the readers/audience. The study of characterization, namely of how this is achieved, is a well-researched topic in corpus stylistics: for instance (Mahlberg, 2012) attempts to identify typical lexical patterns for memorable Dickens' characters by extracting those lexical bundles that stand out (namely are overrepresented) in comparison to a general corpus. In other works, authorship attribution methods are applied to the different characters of a play to identify whether the author has been able to provide each of them with a "distinct" voice. For instance (Vogel & Lynch, 2008) compare individual Shakespeare characters against the whole play or even against all plays of the same author. The purpose of this paper is to propose a methodology for the study characterization of several characters in French plays of the classical period. The tools developed are meant to support textual analysis by: 1) Verifying the degree of characterization of each character with respect to others. 2) Automatically inducing a list of linguistic features that are significant, representative for that character. Preliminary investigations have been conducted on plays by Moliere, cross-comparing four protagonists from four different plays. The proposed methodology relies on sequential data mining for the extraction of linguistic patterns and on correspondence analysis for comparison of patterns frequencies in each character and for the visual representation of such differences.

Linguistic Pattern Extraction and Analysis for Classic French Plays

Francesca Frontini;
2015

Abstract

Great authors of fiction and theatre have the capacity of creating memorable characters that take life and become almost as real as living persons to the readers/audience. The study of characterization, namely of how this is achieved, is a well-researched topic in corpus stylistics: for instance (Mahlberg, 2012) attempts to identify typical lexical patterns for memorable Dickens' characters by extracting those lexical bundles that stand out (namely are overrepresented) in comparison to a general corpus. In other works, authorship attribution methods are applied to the different characters of a play to identify whether the author has been able to provide each of them with a "distinct" voice. For instance (Vogel & Lynch, 2008) compare individual Shakespeare characters against the whole play or even against all plays of the same author. The purpose of this paper is to propose a methodology for the study characterization of several characters in French plays of the classical period. The tools developed are meant to support textual analysis by: 1) Verifying the degree of characterization of each character with respect to others. 2) Automatically inducing a list of linguistic features that are significant, representative for that character. Preliminary investigations have been conducted on plays by Moliere, cross-comparing four protagonists from four different plays. The proposed methodology relies on sequential data mining for the extraction of linguistic patterns and on correspondence analysis for comparison of patterns frequencies in each character and for the visual representation of such differences.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Francesca Frontini it
dc.authority.people Mohamed Amine Boukhaled it
dc.authority.people JeanGabriel Ganascia it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/20 23:07:38 -
dc.date.available 2024/02/20 23:07:38 -
dc.date.issued 2015 -
dc.description.abstractita Great authors of fiction and theatre have the capacity of creating memorable characters that take life and become almost as real as living persons to the readers/audience. The study of characterization, namely of how this is achieved, is a well-researched topic in corpus stylistics: for instance (Mahlberg, 2012) attempts to identify typical lexical patterns for memorable Dickens' characters by extracting those lexical bundles that stand out (namely are overrepresented) in comparison to a general corpus. In other works, authorship attribution methods are applied to the different characters of a play to identify whether the author has been able to provide each of them with a "distinct" voice. For instance (Vogel & Lynch, 2008) compare individual Shakespeare characters against the whole play or even against all plays of the same author. The purpose of this paper is to propose a methodology for the study characterization of several characters in French plays of the classical period. The tools developed are meant to support textual analysis by: 1) Verifying the degree of characterization of each character with respect to others. 2) Automatically inducing a list of linguistic features that are significant, representative for that character. Preliminary investigations have been conducted on plays by Moliere, cross-comparing four protagonists from four different plays. The proposed methodology relies on sequential data mining for the extraction of linguistic patterns and on correspondence analysis for comparison of patterns frequencies in each character and for the visual representation of such differences. -
dc.description.affiliations LIP6 (Laboratoire d'Informatique de Paris 6), Université Pierre et Marie Curie and CNRS / OBVIL - Isituto di Linguistica Computazionale - CNR LIP6 (Laboratoire d'Informatique de Paris 6), Université Pierre et Marie Curie and CNRS / OBVIL LIP6 (Laboratoire d'Informatique de Paris 6), Université Pierre et Marie Curie and CNRS / OBVIL -
dc.description.allpeople Frontini, Francesca; Amine Boukhaled, Mohamed; Ganascia, Jeangabriel -
dc.description.allpeopleoriginal Francesca Frontini, Mohamed Amine Boukhaled, Jean-Gabriel Ganascia -
dc.description.fulltext none en
dc.description.numberofauthors 3 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/276113 -
dc.identifier.url http://lipn.univ-paris13.fr/~charnois/conscilaGenres/resumes/frontini.pdf -
dc.language.iso eng -
dc.relation.conferencedate 16/01/2015 -
dc.relation.conferencename Journée ConSciLa (Confrontations en Sciences du Langage) Grammaire des genres et des styles : quelles approches privilégier ? -
dc.relation.conferenceplace Paris -
dc.relation.numberofpages 3 -
dc.subject.keywords computational stylometry -
dc.subject.keywords thater -
dc.subject.keywords sequential pattern mining -
dc.subject.singlekeyword computational stylometry *
dc.subject.singlekeyword thater *
dc.subject.singlekeyword sequential pattern mining *
dc.title Linguistic Pattern Extraction and Analysis for Classic French Plays en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 307909 -
iris.orcid.lastModifiedDate 2024/04/04 16:20:32 *
iris.orcid.lastModifiedMillisecond 1712240432978 *
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/276113
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact