In this paper we report the first results of an annotation exercise of argument coercion phenomena performed on Italian texts. Our corpus consists of ca 4000 sentences from the PAROLE sottoinsieme corpus (Bindi et al. 2000) annotated with Selection and Coercion relations among verb-noun pairs formatted in XML according to the Generative Lexicon Mark-up Language (GLML) format (Pustejovsky et al., 2008). For the purposes of coercion annotation, we selected 26 Italian verbs that impose semantic typing on their arguments in either Subject, Direct Object or Complement position. Every sentence of the corpus is annotated with the source type for the noun arguments by two annotators plus a judge. An overall agreement of 0.87 kappa indicates that the annotation methodology is reliable. A qualitative analysis of the results allows us to outline some suggestions for improvement of the task: 1) a different account of complex types for nouns has to be devised and 2) a more comprehensive account of coercion mechanisms requires annotation of the deeper meaning dimensions that are targeted in coercion operations, such as those captured by Qualia relations.

Capturing Coercions in Texts: a First Annotation Exercise

Quochi V
2010

Abstract

In this paper we report the first results of an annotation exercise of argument coercion phenomena performed on Italian texts. Our corpus consists of ca 4000 sentences from the PAROLE sottoinsieme corpus (Bindi et al. 2000) annotated with Selection and Coercion relations among verb-noun pairs formatted in XML according to the Generative Lexicon Mark-up Language (GLML) format (Pustejovsky et al., 2008). For the purposes of coercion annotation, we selected 26 Italian verbs that impose semantic typing on their arguments in either Subject, Direct Object or Complement position. Every sentence of the corpus is annotated with the source type for the noun arguments by two annotators plus a judge. An overall agreement of 0.87 kappa indicates that the annotation methodology is reliable. A qualitative analysis of the results allows us to outline some suggestions for improvement of the task: 1) a different account of complex types for nouns has to be devised and 2) a more comprehensive account of coercion mechanisms requires annotation of the deeper meaning dimensions that are targeted in coercion operations, such as those captured by Qualia relations.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Jezek E it
dc.authority.people Quochi V it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/19 20:02:57 -
dc.date.available 2024/02/19 20:02:57 -
dc.date.issued 2010 -
dc.description.abstracteng In this paper we report the first results of an annotation exercise of argument coercion phenomena performed on Italian texts. Our corpus consists of ca 4000 sentences from the PAROLE sottoinsieme corpus (Bindi et al. 2000) annotated with Selection and Coercion relations among verb-noun pairs formatted in XML according to the Generative Lexicon Mark-up Language (GLML) format (Pustejovsky et al., 2008). For the purposes of coercion annotation, we selected 26 Italian verbs that impose semantic typing on their arguments in either Subject, Direct Object or Complement position. Every sentence of the corpus is annotated with the source type for the noun arguments by two annotators plus a judge. An overall agreement of 0.87 kappa indicates that the annotation methodology is reliable. A qualitative analysis of the results allows us to outline some suggestions for improvement of the task: 1) a different account of complex types for nouns has to be devised and 2) a more comprehensive account of coercion mechanisms requires annotation of the deeper meaning dimensions that are targeted in coercion operations, such as those captured by Qualia relations. -
dc.description.affiliations Department of Theoretical and Applied Linguistics, University of Pavia, ILC-CNR, Pisa -
dc.description.allpeople Jezek E.; Quochi V. -
dc.description.allpeopleoriginal Jezek E.; Quochi V. -
dc.description.fulltext none en
dc.description.numberofauthors 1 -
dc.identifier.isbn 2-9517408-6-7 -
dc.identifier.isi WOS:000356879506038 -
dc.identifier.scopus 2-s2.0-85037121255 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/65150 -
dc.identifier.url http://www.lrec-conf.org/proceedings/lrec2010/summaries/713.html -
dc.language.iso eng -
dc.publisher.country FRA -
dc.publisher.name European Language Resources Association ELRA -
dc.publisher.place Paris -
dc.relation.alleditors Nicoletta Calzolari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias -
dc.relation.conferencedate 17-23 Maggio 2010 -
dc.relation.conferencename Seventh International Conference on Language Resources and Evaluation -
dc.relation.conferenceplace Valletta, Malta -
dc.relation.firstpage 1464 -
dc.relation.ispartofbook Proceedings of the Seventh International Conference on Language Resources and Evaluation - LREC'10 -
dc.relation.lastpage 1471 -
dc.subject.keywords Corpus (creation -
dc.subject.keywords annotation -
dc.subject.keywords etc.) -
dc.subject.keywords Knowledge Discovery/Representation -
dc.subject.keywords Semantics -
dc.subject.singlekeyword Corpus (creation *
dc.subject.singlekeyword annotation *
dc.subject.singlekeyword etc.) *
dc.subject.singlekeyword Knowledge Discovery/Representation *
dc.subject.singlekeyword Semantics *
dc.title Capturing Coercions in Texts: a First Annotation Exercise en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 84783 -
iris.isi.extIssued 2010 -
iris.isi.extTitle Capturing Coercions in Texts: a First Annotation Exercise -
iris.orcid.lastModifiedDate 2025/03/30 01:58:42 *
iris.orcid.lastModifiedMillisecond 1743296322282 *
iris.scopus.extIssued 2010 -
iris.scopus.extTitle Capturing coercions in texts: A first annotation exercise -
iris.scopus.ideLinkStatusDate 2024/04/10 09:22:14 *
iris.scopus.ideLinkStatusMillisecond 1712733734224 *
iris.sitodocente.maxattempts 3 -
isi.category OY *
isi.contributor.affiliation University of Pavia -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.name Elisabetta -
isi.contributor.name Valeria -
isi.contributor.researcherId DXH-3537-2022 -
isi.contributor.researcherId E-7468-2011 -
isi.contributor.subaffiliation Dept Theoret & Appl Linguist -
isi.contributor.subaffiliation Ist Linguist Computaz -
isi.contributor.surname Jezek -
isi.contributor.surname Quochi -
isi.date.issued 2010 *
isi.description.abstracteng In this paper we report the first results of an annotation exercise of argument coercion phenomena performed on Italian texts. Our corpus consists of ca 4000 sentences from the PAROLE sottoinsieme corpus (Bindi et al. 2000) annotated with Selection and Coercion relations among verb-noun pairs formatted in XML according to the Generative Lexicon Mark-up Language (GLML) format (Pustejovsky et al., 2008). For the purposes of coercion annotation, we selected 26 Italian verbs that impose semantic typing on their arguments in either Subject, Direct Object or Complement position. Every sentence of the corpus contains information about corpus-derived typed selectional preferences for verbs in the targeted argument slots and is annotated with the source type for the noun arguments by two annotators plus a judge. An overall agreement of 0.87 kappa indicates that the annotation methodology is reliable. A qualitative analysis of the results allows us to outline some suggestions for improvement of the task: 1) a different account of inherently polysemous nouns has to be devised and 2) a more comprehensive account of coercion mechanisms requires annotation of the deeper meaning dimensions that are targeted in coercion operations, such as those captured by Qualia relations. *
isi.description.allpeopleoriginal Jezek, E; Quochi, V; *
isi.document.sourcetype WOS.ISSHP *
isi.document.type Proceedings Paper *
isi.document.types Proceedings Paper *
isi.identifier.isi WOS:000356879506038 *
isi.journal.journaltitle LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION *
isi.language.original English *
isi.publisher.place 55-57, RUE BRILLAT-SAVARIN, PARIS, 75013, FRANCE *
isi.relation.firstpage 1464 *
isi.relation.lastpage 1471 *
isi.title Capturing Coercions in Texts: a First Annotation Exercise *
scopus.category 3304 *
scopus.category 1203 *
scopus.category 3310 *
scopus.category 3309 *
scopus.contributor.affiliation University of Pavia -
scopus.contributor.affiliation CNR -
scopus.contributor.afid 60015197 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 35573787800 -
scopus.contributor.auid 34977412400 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid 113821516 -
scopus.contributor.dptid -
scopus.contributor.name Elisabetta -
scopus.contributor.name Valeria -
scopus.contributor.subaffiliation Department of Theoretical and Applied Linguistics; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale; -
scopus.contributor.surname Jezek -
scopus.contributor.surname Quochi -
scopus.date.issued 2010 *
scopus.description.abstracteng In this paper we report the first results of an annotation exercise of argument coercion phenomena performed on Italian texts. Our corpus consists of ca 4000 sentences from the PAROLE sottoinsieme corpus (Bindi et al. 2000) annotated with Selection and Coercion relations among verb-noun pairs formatted in XML according to the Generative Lexicon Mark-up Language (GLML) format (Pustejovsky et al., 2008). For the purposes of coercion annotation, we selected 26 Italian verbs that impose semantic typing on their arguments in either Subject, Direct Object or Complement position. Every sentence of the corpus contains information about corpus-derived typed selectional preferences for verbs in the targeted argument slots and is annotated with the source type for the noun arguments by two annotators plus a judge. An overall agreement of 0.87 kappa indicates that the annotation methodology is reliable. A qualitative analysis of the results allows us to outline some suggestions for improvement of the task: 1) a different account of inherently polysemous nouns has to be devised and 2) a more comprehensive account of coercion mechanisms requires annotation of the deeper meaning dimensions that are targeted in coercion operations, such as those captured by Qualia relations. *
scopus.description.allpeopleoriginal Jezek E.; Quochi V. *
scopus.differences scopus.relation.conferencename *
scopus.differences scopus.publisher.name *
scopus.differences scopus.relation.conferencedate *
scopus.differences scopus.identifier.isbn *
scopus.differences scopus.description.abstracteng *
scopus.differences scopus.relation.conferenceplace *
scopus.document.type cp *
scopus.document.types cp *
scopus.identifier.isbn 9782951740860 *
scopus.identifier.pui 619603711 *
scopus.identifier.scopus 2-s2.0-85037121255 *
scopus.journal.sourceid 21100842263 *
scopus.language.iso eng *
scopus.publisher.name European Language Resources Association (ELRA) *
scopus.relation.conferencedate 2010 *
scopus.relation.conferencename 7th International Conference on Language Resources and Evaluation, LREC 2010 *
scopus.relation.conferenceplace Mediterranean Conference Centre, mlt *
scopus.relation.firstpage 1464 *
scopus.relation.lastpage 1471 *
scopus.title Capturing coercions in texts: A first annotation exercise *
scopus.titleeng Capturing coercions in texts: A first annotation exercise *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/65150
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 1
social impact