The paper describes the methodology which is currently being defined for the construction of a "Merged Italian Dependency Treebank" (MIDT) starting from already existing resources. In particular, it reports the results of a case study carried out on two available dependency treebanks, i.e. TUT and ISST-TANL. The issues raised during the comparison of the annotation schemes underlying the two treebanks are discussed and investigated with a particular emphasis on the definition of a set of linguistic categories to be used as a "bridge" between the specific schemes. As an encoding format, the CoNLL de facto standard is used.
Harmonization and Merging of two Italian Dependency Treebanks
Simonetta Montemagni;
2012
Abstract
The paper describes the methodology which is currently being defined for the construction of a "Merged Italian Dependency Treebank" (MIDT) starting from already existing resources. In particular, it reports the results of a case study carried out on two available dependency treebanks, i.e. TUT and ISST-TANL. The issues raised during the comparison of the annotation schemes underlying the two treebanks are discussed and investigated with a particular emphasis on the definition of a set of linguistic categories to be used as a "bridge" between the specific schemes. As an encoding format, the CoNLL de facto standard is used.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | Cristina Bosco | it |
| dc.authority.people | Simonetta Montemagni | it |
| dc.authority.people | Maria Simi | it |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/21 05:39:14 | - |
| dc.date.available | 2024/02/21 05:39:14 | - |
| dc.date.issued | 2012 | - |
| dc.description.abstracteng | The paper describes the methodology which is currently being defined for the construction of a "Merged Italian Dependency Treebank" (MIDT) starting from already existing resources. In particular, it reports the results of a case study carried out on two available dependency treebanks, i.e. TUT and ISST-TANL. The issues raised during the comparison of the annotation schemes underlying the two treebanks are discussed and investigated with a particular emphasis on the definition of a set of linguistic categories to be used as a "bridge" between the specific schemes. As an encoding format, the CoNLL de facto standard is used. | - |
| dc.description.affiliations | Università di Torino Istituto di Linguistica Computazionale "Antonio Zampolli" (ILC-CNR) - Pisa Università di Pisa | - |
| dc.description.allpeople | Bosco, Cristina; Montemagni, Simonetta; Simi, Maria | - |
| dc.description.allpeopleoriginal | Cristina Bosco; Simonetta Montemagni; Maria Simi | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 3 | - |
| dc.identifier.isbn | 978-2-9517408-7-7 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/297499 | - |
| dc.identifier.url | http://www.lrec-conf.org/proceedings/lrec2012/workshops/06.LREC%202012%20Merging%20Proceedings.pdf | - |
| dc.language.iso | eng | - |
| dc.publisher.country | FRA | - |
| dc.publisher.name | European Language Resources Association ELRA | - |
| dc.publisher.place | Paris | - |
| dc.relation.alleditors | Nuria Bel et al. | - |
| dc.relation.conferencedate | 22 May 2012 | - |
| dc.relation.conferencename | LREC 2012 Workshop on Language Resource Merging | - |
| dc.relation.conferenceplace | Istambul | - |
| dc.relation.firstpage | 23 | - |
| dc.relation.ispartofbook | Proceedings of the LREC 2012 Workshop on Language Resource Merging | - |
| dc.relation.lastpage | 30 | - |
| dc.subject.keywords | Syntactic Annotation | - |
| dc.subject.keywords | Merging of Resources | - |
| dc.subject.keywords | Dependency Parsing | - |
| dc.subject.singlekeyword | Syntactic Annotation | * |
| dc.subject.singlekeyword | Merging of Resources | * |
| dc.subject.singlekeyword | Dependency Parsing | * |
| dc.title | Harmonization and Merging of two Italian Dependency Treebanks | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| dc.type.referee | Sì, ma tipo non specificato | - |
| dc.ugov.descaux1 | 330109 | - |
| iris.orcid.lastModifiedDate | 2024/04/04 12:49:41 | * |
| iris.orcid.lastModifiedMillisecond | 1712227781638 | * |
| iris.sitodocente.maxattempts | 3 | - |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
File in questo prodotto:
Non ci sono file associati a questo prodotto.
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


