In this paper we report the first results of an annotation exercise of argument coercion phenomena performed on Italian texts. Our corpus consists of ca 4000 sentences from the PAROLE sottoinsieme corpus (Bindi et al. 2000) annotated with Selection and Coercion relations among verb-noun pairs formatted in XML according to the Generative Lexicon Mark-up Language (GLML) format (Pustejovsky et al., 2008). For the purposes of coercion annotation, we selected 26 Italian verbs that impose semantic typing on their arguments in either Subject, Direct Object or Complement position. Every sentence of the corpus is annotated with the source type for the noun arguments by two annotators plus a judge. An overall agreement of 0.87 kappa indicates that the annotation methodology is reliable. A qualitative analysis of the results allows us to outline some suggestions for improvement of the task: 1) a different account of complex types for nouns has to be devised and 2) a more comprehensive account of coercion mechanisms requires annotation of the deeper meaning dimensions that are targeted in coercion operations, such as those captured by Qualia relations.
Capturing Coercions in Texts: a First Annotation Exercise
Quochi V
2010
Abstract
In this paper we report the first results of an annotation exercise of argument coercion phenomena performed on Italian texts. Our corpus consists of ca 4000 sentences from the PAROLE sottoinsieme corpus (Bindi et al. 2000) annotated with Selection and Coercion relations among verb-noun pairs formatted in XML according to the Generative Lexicon Mark-up Language (GLML) format (Pustejovsky et al., 2008). For the purposes of coercion annotation, we selected 26 Italian verbs that impose semantic typing on their arguments in either Subject, Direct Object or Complement position. Every sentence of the corpus is annotated with the source type for the noun arguments by two annotators plus a judge. An overall agreement of 0.87 kappa indicates that the annotation methodology is reliable. A qualitative analysis of the results allows us to outline some suggestions for improvement of the task: 1) a different account of complex types for nouns has to be devised and 2) a more comprehensive account of coercion mechanisms requires annotation of the deeper meaning dimensions that are targeted in coercion operations, such as those captured by Qualia relations.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | Jezek E | it |
| dc.authority.people | Quochi V | it |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/19 20:02:57 | - |
| dc.date.available | 2024/02/19 20:02:57 | - |
| dc.date.issued | 2010 | - |
| dc.description.abstracteng | In this paper we report the first results of an annotation exercise of argument coercion phenomena performed on Italian texts. Our corpus consists of ca 4000 sentences from the PAROLE sottoinsieme corpus (Bindi et al. 2000) annotated with Selection and Coercion relations among verb-noun pairs formatted in XML according to the Generative Lexicon Mark-up Language (GLML) format (Pustejovsky et al., 2008). For the purposes of coercion annotation, we selected 26 Italian verbs that impose semantic typing on their arguments in either Subject, Direct Object or Complement position. Every sentence of the corpus is annotated with the source type for the noun arguments by two annotators plus a judge. An overall agreement of 0.87 kappa indicates that the annotation methodology is reliable. A qualitative analysis of the results allows us to outline some suggestions for improvement of the task: 1) a different account of complex types for nouns has to be devised and 2) a more comprehensive account of coercion mechanisms requires annotation of the deeper meaning dimensions that are targeted in coercion operations, such as those captured by Qualia relations. | - |
| dc.description.affiliations | Department of Theoretical and Applied Linguistics, University of Pavia, ILC-CNR, Pisa | - |
| dc.description.allpeople | Jezek E.; Quochi V. | - |
| dc.description.allpeopleoriginal | Jezek E.; Quochi V. | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 1 | - |
| dc.identifier.isbn | 2-9517408-6-7 | - |
| dc.identifier.isi | WOS:000356879506038 | - |
| dc.identifier.scopus | 2-s2.0-85037121255 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/65150 | - |
| dc.identifier.url | http://www.lrec-conf.org/proceedings/lrec2010/summaries/713.html | - |
| dc.language.iso | eng | - |
| dc.publisher.country | FRA | - |
| dc.publisher.name | European Language Resources Association ELRA | - |
| dc.publisher.place | Paris | - |
| dc.relation.alleditors | Nicoletta Calzolari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias | - |
| dc.relation.conferencedate | 17-23 Maggio 2010 | - |
| dc.relation.conferencename | Seventh International Conference on Language Resources and Evaluation | - |
| dc.relation.conferenceplace | Valletta, Malta | - |
| dc.relation.firstpage | 1464 | - |
| dc.relation.ispartofbook | Proceedings of the Seventh International Conference on Language Resources and Evaluation - LREC'10 | - |
| dc.relation.lastpage | 1471 | - |
| dc.subject.keywords | Corpus (creation | - |
| dc.subject.keywords | annotation | - |
| dc.subject.keywords | etc.) | - |
| dc.subject.keywords | Knowledge Discovery/Representation | - |
| dc.subject.keywords | Semantics | - |
| dc.subject.singlekeyword | Corpus (creation | * |
| dc.subject.singlekeyword | annotation | * |
| dc.subject.singlekeyword | etc.) | * |
| dc.subject.singlekeyword | Knowledge Discovery/Representation | * |
| dc.subject.singlekeyword | Semantics | * |
| dc.title | Capturing Coercions in Texts: a First Annotation Exercise | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| dc.type.referee | Sì, ma tipo non specificato | - |
| dc.ugov.descaux1 | 84783 | - |
| iris.isi.extIssued | 2010 | - |
| iris.isi.extTitle | Capturing Coercions in Texts: a First Annotation Exercise | - |
| iris.orcid.lastModifiedDate | 2025/03/30 01:58:42 | * |
| iris.orcid.lastModifiedMillisecond | 1743296322282 | * |
| iris.scopus.extIssued | 2010 | - |
| iris.scopus.extTitle | Capturing coercions in texts: A first annotation exercise | - |
| iris.scopus.ideLinkStatusDate | 2024/04/10 09:22:14 | * |
| iris.scopus.ideLinkStatusMillisecond | 1712733734224 | * |
| iris.sitodocente.maxattempts | 3 | - |
| isi.category | OY | * |
| isi.contributor.affiliation | University of Pavia | - |
| isi.contributor.affiliation | Consiglio Nazionale delle Ricerche (CNR) | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | Italy | - |
| isi.contributor.name | Elisabetta | - |
| isi.contributor.name | Valeria | - |
| isi.contributor.researcherId | DXH-3537-2022 | - |
| isi.contributor.researcherId | E-7468-2011 | - |
| isi.contributor.subaffiliation | Dept Theoret & Appl Linguist | - |
| isi.contributor.subaffiliation | Ist Linguist Computaz | - |
| isi.contributor.surname | Jezek | - |
| isi.contributor.surname | Quochi | - |
| isi.date.issued | 2010 | * |
| isi.description.abstracteng | In this paper we report the first results of an annotation exercise of argument coercion phenomena performed on Italian texts. Our corpus consists of ca 4000 sentences from the PAROLE sottoinsieme corpus (Bindi et al. 2000) annotated with Selection and Coercion relations among verb-noun pairs formatted in XML according to the Generative Lexicon Mark-up Language (GLML) format (Pustejovsky et al., 2008). For the purposes of coercion annotation, we selected 26 Italian verbs that impose semantic typing on their arguments in either Subject, Direct Object or Complement position. Every sentence of the corpus contains information about corpus-derived typed selectional preferences for verbs in the targeted argument slots and is annotated with the source type for the noun arguments by two annotators plus a judge. An overall agreement of 0.87 kappa indicates that the annotation methodology is reliable. A qualitative analysis of the results allows us to outline some suggestions for improvement of the task: 1) a different account of inherently polysemous nouns has to be devised and 2) a more comprehensive account of coercion mechanisms requires annotation of the deeper meaning dimensions that are targeted in coercion operations, such as those captured by Qualia relations. | * |
| isi.description.allpeopleoriginal | Jezek, E; Quochi, V; | * |
| isi.document.sourcetype | WOS.ISSHP | * |
| isi.document.type | Proceedings Paper | * |
| isi.document.types | Proceedings Paper | * |
| isi.identifier.isi | WOS:000356879506038 | * |
| isi.journal.journaltitle | LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | * |
| isi.language.original | English | * |
| isi.publisher.place | 55-57, RUE BRILLAT-SAVARIN, PARIS, 75013, FRANCE | * |
| isi.relation.firstpage | 1464 | * |
| isi.relation.lastpage | 1471 | * |
| isi.title | Capturing Coercions in Texts: a First Annotation Exercise | * |
| scopus.category | 3304 | * |
| scopus.category | 1203 | * |
| scopus.category | 3310 | * |
| scopus.category | 3309 | * |
| scopus.contributor.affiliation | University of Pavia | - |
| scopus.contributor.affiliation | CNR | - |
| scopus.contributor.afid | 60015197 | - |
| scopus.contributor.afid | 60008941 | - |
| scopus.contributor.auid | 35573787800 | - |
| scopus.contributor.auid | 34977412400 | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.country | Italy | - |
| scopus.contributor.dptid | 113821516 | - |
| scopus.contributor.dptid | - | |
| scopus.contributor.name | Elisabetta | - |
| scopus.contributor.name | Valeria | - |
| scopus.contributor.subaffiliation | Department of Theoretical and Applied Linguistics; | - |
| scopus.contributor.subaffiliation | Istituto di Linguistica Computazionale; | - |
| scopus.contributor.surname | Jezek | - |
| scopus.contributor.surname | Quochi | - |
| scopus.date.issued | 2010 | * |
| scopus.description.abstracteng | In this paper we report the first results of an annotation exercise of argument coercion phenomena performed on Italian texts. Our corpus consists of ca 4000 sentences from the PAROLE sottoinsieme corpus (Bindi et al. 2000) annotated with Selection and Coercion relations among verb-noun pairs formatted in XML according to the Generative Lexicon Mark-up Language (GLML) format (Pustejovsky et al., 2008). For the purposes of coercion annotation, we selected 26 Italian verbs that impose semantic typing on their arguments in either Subject, Direct Object or Complement position. Every sentence of the corpus contains information about corpus-derived typed selectional preferences for verbs in the targeted argument slots and is annotated with the source type for the noun arguments by two annotators plus a judge. An overall agreement of 0.87 kappa indicates that the annotation methodology is reliable. A qualitative analysis of the results allows us to outline some suggestions for improvement of the task: 1) a different account of inherently polysemous nouns has to be devised and 2) a more comprehensive account of coercion mechanisms requires annotation of the deeper meaning dimensions that are targeted in coercion operations, such as those captured by Qualia relations. | * |
| scopus.description.allpeopleoriginal | Jezek E.; Quochi V. | * |
| scopus.differences | scopus.relation.conferencename | * |
| scopus.differences | scopus.publisher.name | * |
| scopus.differences | scopus.relation.conferencedate | * |
| scopus.differences | scopus.identifier.isbn | * |
| scopus.differences | scopus.description.abstracteng | * |
| scopus.differences | scopus.relation.conferenceplace | * |
| scopus.document.type | cp | * |
| scopus.document.types | cp | * |
| scopus.identifier.isbn | 9782951740860 | * |
| scopus.identifier.pui | 619603711 | * |
| scopus.identifier.scopus | 2-s2.0-85037121255 | * |
| scopus.journal.sourceid | 21100842263 | * |
| scopus.language.iso | eng | * |
| scopus.publisher.name | European Language Resources Association (ELRA) | * |
| scopus.relation.conferencedate | 2010 | * |
| scopus.relation.conferencename | 7th International Conference on Language Resources and Evaluation, LREC 2010 | * |
| scopus.relation.conferenceplace | Mediterranean Conference Centre, mlt | * |
| scopus.relation.firstpage | 1464 | * |
| scopus.relation.lastpage | 1471 | * |
| scopus.title | Capturing coercions in texts: A first annotation exercise | * |
| scopus.titleeng | Capturing coercions in texts: A first annotation exercise | * |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


