The paper describes the methodology which is currently being defined for the construction of a "Merged Italian Dependency Treebank" (MIDT) starting from already existing resources. In particular, it reports the results of a case study carried out on two available dependency treebanks, i.e. TUT and ISST-TANL. The issues raised during the comparison of the annotation schemes underlying the two treebanks are discussed and investigated with a particular emphasis on the definition of a set of linguistic categories to be used as a "bridge" between the specific schemes. As an encoding format, the CoNLL de facto standard is used.

Harmonization and Merging of two Italian Dependency Treebanks

Simonetta Montemagni;
2012

Abstract

The paper describes the methodology which is currently being defined for the construction of a "Merged Italian Dependency Treebank" (MIDT) starting from already existing resources. In particular, it reports the results of a case study carried out on two available dependency treebanks, i.e. TUT and ISST-TANL. The issues raised during the comparison of the annotation schemes underlying the two treebanks are discussed and investigated with a particular emphasis on the definition of a set of linguistic categories to be used as a "bridge" between the specific schemes. As an encoding format, the CoNLL de facto standard is used.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Cristina Bosco it
dc.authority.people Simonetta Montemagni it
dc.authority.people Maria Simi it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/21 05:39:14 -
dc.date.available 2024/02/21 05:39:14 -
dc.date.issued 2012 -
dc.description.abstracteng The paper describes the methodology which is currently being defined for the construction of a "Merged Italian Dependency Treebank" (MIDT) starting from already existing resources. In particular, it reports the results of a case study carried out on two available dependency treebanks, i.e. TUT and ISST-TANL. The issues raised during the comparison of the annotation schemes underlying the two treebanks are discussed and investigated with a particular emphasis on the definition of a set of linguistic categories to be used as a "bridge" between the specific schemes. As an encoding format, the CoNLL de facto standard is used. -
dc.description.affiliations Università di Torino Istituto di Linguistica Computazionale "Antonio Zampolli" (ILC-CNR) - Pisa Università di Pisa -
dc.description.allpeople Bosco, Cristina; Montemagni, Simonetta; Simi, Maria -
dc.description.allpeopleoriginal Cristina Bosco; Simonetta Montemagni; Maria Simi -
dc.description.fulltext none en
dc.description.numberofauthors 3 -
dc.identifier.isbn 978-2-9517408-7-7 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/297499 -
dc.identifier.url http://www.lrec-conf.org/proceedings/lrec2012/workshops/06.LREC%202012%20Merging%20Proceedings.pdf -
dc.language.iso eng -
dc.publisher.country FRA -
dc.publisher.name European Language Resources Association ELRA -
dc.publisher.place Paris -
dc.relation.alleditors Nuria Bel et al. -
dc.relation.conferencedate 22 May 2012 -
dc.relation.conferencename LREC 2012 Workshop on Language Resource Merging -
dc.relation.conferenceplace Istambul -
dc.relation.firstpage 23 -
dc.relation.ispartofbook Proceedings of the LREC 2012 Workshop on Language Resource Merging -
dc.relation.lastpage 30 -
dc.subject.keywords Syntactic Annotation -
dc.subject.keywords Merging of Resources -
dc.subject.keywords Dependency Parsing -
dc.subject.singlekeyword Syntactic Annotation *
dc.subject.singlekeyword Merging of Resources *
dc.subject.singlekeyword Dependency Parsing *
dc.title Harmonization and Merging of two Italian Dependency Treebanks en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 330109 -
iris.orcid.lastModifiedDate 2024/04/04 12:49:41 *
iris.orcid.lastModifiedMillisecond 1712227781638 *
iris.sitodocente.maxattempts 3 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/297499
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact