The paper introduces Parallel Trees, a novel multilingual treebank collection that includes 20 treebanks for 10 languages. The distinguishing property of this resource is that the sentences of each language are annotated using two syntactic representation paradigms (SRPs), respectively based on the notions of dependency and constituency. By aligning the annotations of existing resources, Parallel Trees represents an example of exploiting pre-existing treebanks to adapt them to novel applications. To illustrate its potential, we present a case study where the resource is employed as a benchmark to investigate whether and how BERT, one of the first prominent neural language models (NLMs), is sensitive to the dependency- and constituency-based approaches for representing the syntactic structure of a sentence. The case study results indicate that the model's sensitivity fluctuates across languages and experimental settings. The unique nature of the Parallel Trees resource creates the prerequisites for innovative studies comparing dependency and phrase-structure trees, allowing for more focused investigations without the interference of lexical variation.

Parallel Trees: a novel resource with aligned dependency and constituency syntactic representations

Alzetta C.;Miaschi A.;Dell'Orletta F.;Venturi G.;Montemagni S.
2025

Abstract

The paper introduces Parallel Trees, a novel multilingual treebank collection that includes 20 treebanks for 10 languages. The distinguishing property of this resource is that the sentences of each language are annotated using two syntactic representation paradigms (SRPs), respectively based on the notions of dependency and constituency. By aligning the annotations of existing resources, Parallel Trees represents an example of exploiting pre-existing treebanks to adapt them to novel applications. To illustrate its potential, we present a case study where the resource is employed as a benchmark to investigate whether and how BERT, one of the first prominent neural language models (NLMs), is sensitive to the dependency- and constituency-based approaches for representing the syntactic structure of a sentence. The case study results indicate that the model's sensitivity fluctuates across languages and experimental settings. The unique nature of the Parallel Trees resource creates the prerequisites for innovative studies comparing dependency and phrase-structure trees, allowing for more focused investigations without the interference of lexical variation.
Campo DC Valore Lingua
dc.authority.ancejournal LANGUAGE RESOURCES AND EVALUATION en
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Alzetta C. en
dc.authority.people Miaschi A. en
dc.authority.people Dell'Orletta F. en
dc.authority.people Venturi G. en
dc.authority.people Montemagni S. en
dc.collection.id.s b3f88f24-048a-4e43-8ab1-6697b90e068e *
dc.collection.name 01.01 Articolo in rivista *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.accessioned 2026/03/03 15:07:50 -
dc.date.available 2026/03/03 15:07:50 -
dc.date.firstsubmission 2026/03/02 18:03:28 *
dc.date.issued 2025 -
dc.date.submission 2026/03/02 18:03:28 *
dc.description.abstracteng The paper introduces Parallel Trees, a novel multilingual treebank collection that includes 20 treebanks for 10 languages. The distinguishing property of this resource is that the sentences of each language are annotated using two syntactic representation paradigms (SRPs), respectively based on the notions of dependency and constituency. By aligning the annotations of existing resources, Parallel Trees represents an example of exploiting pre-existing treebanks to adapt them to novel applications. To illustrate its potential, we present a case study where the resource is employed as a benchmark to investigate whether and how BERT, one of the first prominent neural language models (NLMs), is sensitive to the dependency- and constituency-based approaches for representing the syntactic structure of a sentence. The case study results indicate that the model's sensitivity fluctuates across languages and experimental settings. The unique nature of the Parallel Trees resource creates the prerequisites for innovative studies comparing dependency and phrase-structure trees, allowing for more focused investigations without the interference of lexical variation. -
dc.description.allpeople Alzetta, C.; Miaschi, A.; Dell'Orletta, F.; Venturi, G.; Montemagni, S. -
dc.description.allpeopleoriginal Alzetta C.; Miaschi A.; Dell'Orletta F.; Venturi G.; Montemagni S. en
dc.description.fulltext open en
dc.description.numberofauthors 5 -
dc.identifier.doi 10.1007/s10579-025-09826-3 en
dc.identifier.isi WOS:001518296100001 -
dc.identifier.scopus 2-s2.0-105009058572 en
dc.identifier.source scopus *
dc.identifier.uri https://hdl.handle.net/20.500.14243/570443 -
dc.language.iso eng en
dc.relation.firstpage 3445 en
dc.relation.issue 4 en
dc.relation.lastpage 3485 en
dc.relation.numberofpages 41 en
dc.relation.volume 59 en
dc.subject.keywords Parallel treebanks -
dc.subject.keywords Syntactic representation -
dc.subject.keywords Diagnostic probing paradigm -
dc.subject.keywords Neural language model -
dc.subject.singlekeyword Parallel treebanks *
dc.subject.singlekeyword Syntactic representation *
dc.subject.singlekeyword Diagnostic probing paradigm *
dc.subject.singlekeyword Neural language model *
dc.title Parallel Trees: a novel resource with aligned dependency and constituency syntactic representations en
dc.type.driver info:eu-repo/semantics/article -
dc.type.full 01 Contributo su Rivista::01.01 Articolo in rivista it
dc.type.miur 262 -
iris.isi.extIssued 2025 -
iris.isi.extTitle Parallel Trees: a novel resource with aligned dependencyand constituency syntactic representations -
iris.mediafilter.data 2026/03/04 02:52:26 *
iris.orcid.lastModifiedDate 2026/03/04 01:09:50 *
iris.orcid.lastModifiedMillisecond 1772582990571 *
iris.scopus.extIssued 2025 -
iris.scopus.extTitle Parallel Trees: a novel resource with aligned dependency and constituency syntactic representations -
iris.sitodocente.maxattempts 1 -
iris.unpaywall.bestoahost publisher *
iris.unpaywall.bestoaversion publishedVersion *
iris.unpaywall.doi 10.1007/s10579-025-09826-3 *
iris.unpaywall.hosttype publisher *
iris.unpaywall.isoa true *
iris.unpaywall.journalisindoaj false *
iris.unpaywall.landingpage https://doi.org/10.1007/s10579-025-09826-3 *
iris.unpaywall.license cc-by *
iris.unpaywall.metadataCallLastModified 04/03/2026 04:33:59 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1772595239534 -
iris.unpaywall.oastatus hybrid *
iris.unpaywall.pdfurl https://link.springer.com/content/pdf/10.1007/s10579-025-09826-3.pdf *
isi.authority.ancejournal LANGUAGE RESOURCES AND EVALUATION###1574-020X *
isi.category EV *
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.name Chiara -
isi.contributor.name Alessio -
isi.contributor.name Felice -
isi.contributor.name Giulia -
isi.contributor.name Simonetta -
isi.contributor.researcherId KVX-9760-2024 -
isi.contributor.researcherId GCD-5321-2022 -
isi.contributor.researcherId AAX-1864-2020 -
isi.contributor.researcherId AAY-3932-2020 -
isi.contributor.researcherId B-8000-2015 -
isi.contributor.subaffiliation Ist Linguist Computaz A Zampolli -
isi.contributor.subaffiliation Ist Linguist Computaz A Zampolli -
isi.contributor.subaffiliation Ist Linguist Computaz A Zampolli -
isi.contributor.subaffiliation Ist Linguist Computaz A Zampolli -
isi.contributor.subaffiliation Ist Linguist Computaz A Zampolli -
isi.contributor.surname Alzetta -
isi.contributor.surname Miaschi -
isi.contributor.surname Dell'Orletta -
isi.contributor.surname Venturi -
isi.contributor.surname Montemagni -
isi.date.issued 2025 *
isi.description.abstracteng The paper introduces Parallel Trees, a novel multilingual treebank collection that includes 20 treebanks for 10 languages. The distinguishing property of this resource is that the sentences of each language are annotated using two syntactic representation paradigms (SRPs), respectively based on the notions of dependency and constituency. By aligning the annotations of existing resources, Parallel Trees represents an example of exploiting pre-existing treebanks to adapt them to novel applications. To illustrate its potential, we present a case study where the resource is employed as a benchmark to investigate whether and how BERT, one of the first prominent neural language models (NLMs), is sensitive to the dependency- and constituency-based approaches for representing the syntactic structure of a sentence. The case study results indicate that the model's sensitivity fluctuates across languages and experimental settings. The unique nature of the Parallel Trees resource creates the prerequisites for innovative studies comparing dependency and phrase-structure trees, allowing for more focused investigations without the interference of lexical variation. *
isi.description.allpeopleoriginal Alzetta, C; Miaschi, A; Dell'Orletta, F; Venturi, G; Montemagni, S; *
isi.document.sourcetype WOS.SCI *
isi.document.type Article *
isi.document.types Article *
isi.identifier.doi 10.1007/s10579-025-09826-3 *
isi.identifier.eissn 1574-0218 *
isi.identifier.isi WOS:001518296100001 *
isi.journal.journaltitle LANGUAGE RESOURCES AND EVALUATION *
isi.journal.journaltitleabbrev LANG RESOUR EVAL *
isi.language.original English *
isi.publisher.place VAN GODEWIJCKSTRAAT 30, 3311 GZ DORDRECHT, NETHERLANDS *
isi.relation.firstpage 3445 *
isi.relation.issue 4 *
isi.relation.lastpage 3485 *
isi.relation.volume 59 *
isi.title Parallel Trees: a novel resource with aligned dependencyand constituency syntactic representations *
scopus.authority.ancejournal LANGUAGE RESOURCES AND EVALUATION###1574-020X *
scopus.category 1203 *
scopus.category 3304 *
scopus.category 3310 *
scopus.category 3309 *
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.affiliation ItaliaNLP Lab -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 57192938832 -
scopus.contributor.auid 57211678681 -
scopus.contributor.auid 57540567000 -
scopus.contributor.auid 27568199800 -
scopus.contributor.auid 15056781100 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 114087935 -
scopus.contributor.dptid 114087935 -
scopus.contributor.name Chiara -
scopus.contributor.name Alessio -
scopus.contributor.name Felice -
scopus.contributor.name Giulia -
scopus.contributor.name Simonetta -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “A. Zampolli”;CNR-ILC; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “A. Zampolli”;CNR-ILC; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “A. Zampolli”;CNR-ILC; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “A. Zampolli”;CNR-ILC; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “A. Zampolli”;CNR-ILC; -
scopus.contributor.surname Alzetta -
scopus.contributor.surname Miaschi -
scopus.contributor.surname Dell’Orletta -
scopus.contributor.surname Venturi -
scopus.contributor.surname Montemagni -
scopus.date.issued 2025 *
scopus.description.abstracteng The paper introduces Parallel Trees, a novel multilingual treebank collection that includes 20 treebanks for 10 languages. The distinguishing property of this resource is that the sentences of each language are annotated using two syntactic representation paradigms (SRPs), respectively based on the notions of dependency and constituency. By aligning the annotations of existing resources, Parallel Trees represents an example of exploiting pre-existing treebanks to adapt them to novel applications. To illustrate its potential, we present a case study where the resource is employed as a benchmark to investigate whether and how BERT, one of the first prominent neural language models (NLMs), is sensitive to the dependency- and constituency-based approaches for representing the syntactic structure of a sentence. The case study results indicate that the model’s sensitivity fluctuates across languages and experimental settings. The unique nature of the Parallel Trees resource creates the prerequisites for innovative studies comparing dependency and phrase-structure trees, allowing for more focused investigations without the interference of lexical variation. *
scopus.description.allpeopleoriginal Alzetta C.; Miaschi A.; Dell'Orletta F.; Venturi G.; Montemagni S. *
scopus.differences scopus.subject.keywords *
scopus.differences scopus.description.abstracteng *
scopus.document.type ar *
scopus.document.types ar *
scopus.identifier.doi 10.1007/s10579-025-09826-3 *
scopus.identifier.eissn 1574-0218 *
scopus.identifier.pui 2035098476 *
scopus.identifier.scopus 2-s2.0-105009058572 *
scopus.journal.sourceid 145663 *
scopus.language.iso eng *
scopus.publisher.name Springer Science and Business Media B.V. *
scopus.relation.firstpage 3445 *
scopus.relation.issue 4 *
scopus.relation.lastpage 3485 *
scopus.relation.volume 59 *
scopus.subject.keywords Diagnostic probing paradigm; Neural language model; Parallel treebanks; Syntactic representation; *
scopus.title Parallel Trees: a novel resource with aligned dependency and constituency syntactic representations *
scopus.titleeng Parallel Trees: a novel resource with aligned dependency and constituency syntactic representations *
Appare nelle tipologie: 01.01 Articolo in rivista
File in questo prodotto:
File Dimensione Formato  
s10579-025-09826-3.pdf

accesso aperto

Licenza: Creative commons
Dimensione 1.99 MB
Formato Adobe PDF
1.99 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/570443
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact