Detection and correction of errors and inconsistencies in "gold treebanks" are becoming more and more central topics of corpus annotation. The paper illustrates a new incremental method for enhancing treebanks, with particular emphasis on the extension of error patterns across different textual genres and registers. Impact and role of corrections have been assessed in a dependency parsing experiment carried out with four different parsers, whose results are promising. For both evaluation datasets, the performance of parsers increases, in terms of the standard LAS and UAS measures and of a more focused measure taking into account only relations involved in error patterns, and at the level of individual dependencies.

Assessing the Impact of Iterative Error Detection and Correction. A Case Study on the Italian Universal Dependency Treebank

Alzetta C;Dell'Orletta F;Montemagni S;Venturi G
2018

Abstract

Detection and correction of errors and inconsistencies in "gold treebanks" are becoming more and more central topics of corpus annotation. The paper illustrates a new incremental method for enhancing treebanks, with particular emphasis on the extension of error patterns across different textual genres and registers. Impact and role of corrections have been assessed in a dependency parsing experiment carried out with four different parsers, whose results are promising. For both evaluation datasets, the performance of parsers increases, in terms of the standard LAS and UAS measures and of a more focused measure taking into account only relations involved in error patterns, and at the level of individual dependencies.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Alzetta C it
dc.authority.people Dell'Orletta F it
dc.authority.people Montemagni S it
dc.authority.people Simi M it
dc.authority.people Venturi G it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/17 21:37:11 -
dc.date.available 2024/02/17 21:37:11 -
dc.date.issued 2018 -
dc.description.abstracteng Detection and correction of errors and inconsistencies in "gold treebanks" are becoming more and more central topics of corpus annotation. The paper illustrates a new incremental method for enhancing treebanks, with particular emphasis on the extension of error patterns across different textual genres and registers. Impact and role of corrections have been assessed in a dependency parsing experiment carried out with four different parsers, whose results are promising. For both evaluation datasets, the performance of parsers increases, in terms of the standard LAS and UAS measures and of a more focused measure taking into account only relations involved in error patterns, and at the level of individual dependencies. -
dc.description.affiliations Università di Genova; Istituto di Linguistica Computazionale; Università di Pisa -
dc.description.allpeople Alzetta, C; Dell'Orletta, F; Montemagni, S; Simi, M; Venturi, G -
dc.description.allpeopleoriginal Alzetta C., Dell'Orletta F., Montemagni S., Simi M., Venturi G. -
dc.description.fulltext none en
dc.description.numberofauthors 5 -
dc.identifier.isbn 978-1-948087-84-1 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/371344 -
dc.identifier.url http://universaldependencies.org/udw18/PDFs/39_Paper.pdf -
dc.language.iso eng -
dc.miur.last.status.update 2024-05-16T16:05:48Z *
dc.relation.conferencedate 01/11/2018 -
dc.relation.conferencename Universal Dependencies Workshop 2018 (UDW 2018) -
dc.relation.conferenceplace Brussels -
dc.relation.firstpage 1 -
dc.relation.lastpage 7 -
dc.relation.numberofpages 7 -
dc.subject.keywords Error Detection -
dc.subject.keywords Universal Dependency Treebanks -
dc.subject.keywords Syntactic parsing -
dc.subject.singlekeyword Error Detection *
dc.subject.singlekeyword Universal Dependency Treebanks *
dc.subject.singlekeyword Syntactic parsing *
dc.title Assessing the Impact of Iterative Error Detection and Correction. A Case Study on the Italian Universal Dependency Treebank en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 391617 -
iris.orcid.lastModifiedDate 2024/04/04 10:46:38 *
iris.orcid.lastModifiedMillisecond 1712220398790 *
iris.scopus.extIssued 2018 -
iris.scopus.extTitle Assessing the Impact of Incremental Error Detection and Correction. A Case Study on the Italian Universal Dependency Treebank -
iris.sitodocente.maxattempts 10 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/371344
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact