Detection and correction of errors and inconsistencies in "gold treebanks" are becoming more and more central topics of corpus annotation. The paper illustrates a new incremental method for enhancing treebanks, with particular emphasis on the extension of error patterns across different textual genres and registers. Impact and role of corrections have been assessed in a dependency parsing experiment carried out with four different parsers, whose results are promising. For both evaluation datasets, the performance of parsers increases, in terms of the standard LAS and UAS measures and of a more focused measure taking into account only relations involved in error patterns, and at the level of individual dependencies.
Assessing the Impact of Iterative Error Detection and Correction. A Case Study on the Italian Universal Dependency Treebank
Alzetta C;Dell'Orletta F;Montemagni S;Venturi G
2018
Abstract
Detection and correction of errors and inconsistencies in "gold treebanks" are becoming more and more central topics of corpus annotation. The paper illustrates a new incremental method for enhancing treebanks, with particular emphasis on the extension of error patterns across different textual genres and registers. Impact and role of corrections have been assessed in a dependency parsing experiment carried out with four different parsers, whose results are promising. For both evaluation datasets, the performance of parsers increases, in terms of the standard LAS and UAS measures and of a more focused measure taking into account only relations involved in error patterns, and at the level of individual dependencies.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | Alzetta C | it |
| dc.authority.people | Dell'Orletta F | it |
| dc.authority.people | Montemagni S | it |
| dc.authority.people | Simi M | it |
| dc.authority.people | Venturi G | it |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/17 21:37:11 | - |
| dc.date.available | 2024/02/17 21:37:11 | - |
| dc.date.issued | 2018 | - |
| dc.description.abstracteng | Detection and correction of errors and inconsistencies in "gold treebanks" are becoming more and more central topics of corpus annotation. The paper illustrates a new incremental method for enhancing treebanks, with particular emphasis on the extension of error patterns across different textual genres and registers. Impact and role of corrections have been assessed in a dependency parsing experiment carried out with four different parsers, whose results are promising. For both evaluation datasets, the performance of parsers increases, in terms of the standard LAS and UAS measures and of a more focused measure taking into account only relations involved in error patterns, and at the level of individual dependencies. | - |
| dc.description.affiliations | Università di Genova; Istituto di Linguistica Computazionale; Università di Pisa | - |
| dc.description.allpeople | Alzetta, C; Dell'Orletta, F; Montemagni, S; Simi, M; Venturi, G | - |
| dc.description.allpeopleoriginal | Alzetta C., Dell'Orletta F., Montemagni S., Simi M., Venturi G. | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 5 | - |
| dc.identifier.isbn | 978-1-948087-84-1 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/371344 | - |
| dc.identifier.url | http://universaldependencies.org/udw18/PDFs/39_Paper.pdf | - |
| dc.language.iso | eng | - |
| dc.miur.last.status.update | 2024-05-16T16:05:48Z | * |
| dc.relation.conferencedate | 01/11/2018 | - |
| dc.relation.conferencename | Universal Dependencies Workshop 2018 (UDW 2018) | - |
| dc.relation.conferenceplace | Brussels | - |
| dc.relation.firstpage | 1 | - |
| dc.relation.lastpage | 7 | - |
| dc.relation.numberofpages | 7 | - |
| dc.subject.keywords | Error Detection | - |
| dc.subject.keywords | Universal Dependency Treebanks | - |
| dc.subject.keywords | Syntactic parsing | - |
| dc.subject.singlekeyword | Error Detection | * |
| dc.subject.singlekeyword | Universal Dependency Treebanks | * |
| dc.subject.singlekeyword | Syntactic parsing | * |
| dc.title | Assessing the Impact of Iterative Error Detection and Correction. A Case Study on the Italian Universal Dependency Treebank | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| dc.type.referee | Sì, ma tipo non specificato | - |
| dc.ugov.descaux1 | 391617 | - |
| iris.orcid.lastModifiedDate | 2024/04/04 10:46:38 | * |
| iris.orcid.lastModifiedMillisecond | 1712220398790 | * |
| iris.scopus.extIssued | 2018 | - |
| iris.scopus.extTitle | Assessing the Impact of Incremental Error Detection and Correction. A Case Study on the Italian Universal Dependency Treebank | - |
| iris.sitodocente.maxattempts | 10 | - |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


