In the past decades, linguistic typology went through a renewing phase that involved a significant change in the research questions and methods of the discipline, which is now interested in fine-grained features underlying language diversity. In this paper, we propose a novel approach to address the newly defined needs of linguistic typology by extracting qualitative and quantitative information about a wide range of features from multilingual annotated corpora based on Natural Language Processing methods and techniques. We tested our method in a case study focusing on word order variation in two widely investigated constructions, VERB-SUBJ(ect) and NOUN-ADJ(ective), with a specific view to structural and functional factors underlying the preference for one or the other order, both intra- and cross-linguistically, and their interaction. Preliminary experiments have been carried out aimed at acquiring typological evidence from a selection of linguistically annotated treebanks for three different languages, namely Italian, Spanish and English. Our results show the effectiveness of the method in letting similarities and differences also emerge from typologically close languages.

INFERRING QUANTITATIVE TYPOLOGICAL TRENDS FROM MULTILINGUAL TREEBANKS. A CASE STUDY

Alzetta Chiara;Dell'Orletta Felice;Montemagni Simonetta;Venturi Giulia
2019

Abstract

In the past decades, linguistic typology went through a renewing phase that involved a significant change in the research questions and methods of the discipline, which is now interested in fine-grained features underlying language diversity. In this paper, we propose a novel approach to address the newly defined needs of linguistic typology by extracting qualitative and quantitative information about a wide range of features from multilingual annotated corpora based on Natural Language Processing methods and techniques. We tested our method in a case study focusing on word order variation in two widely investigated constructions, VERB-SUBJ(ect) and NOUN-ADJ(ective), with a specific view to structural and functional factors underlying the preference for one or the other order, both intra- and cross-linguistically, and their interaction. Preliminary experiments have been carried out aimed at acquiring typological evidence from a selection of linguistically annotated treebanks for three different languages, namely Italian, Spanish and English. Our results show the effectiveness of the method in letting similarities and differences also emerge from typologically close languages.
Campo DC Valore Lingua
dc.authority.ancejournal LINGUE E LINGUAGGIO -
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Alzetta Chiara it
dc.authority.people Dell'Orletta Felice it
dc.authority.people Montemagni Simonetta it
dc.authority.people Venturi Giulia it
dc.collection.id.s b3f88f24-048a-4e43-8ab1-6697b90e068e *
dc.collection.name 01.01 Articolo in rivista *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/21 06:06:38 -
dc.date.available 2024/02/21 06:06:38 -
dc.date.issued 2019 -
dc.description.abstracteng In the past decades, linguistic typology went through a renewing phase that involved a significant change in the research questions and methods of the discipline, which is now interested in fine-grained features underlying language diversity. In this paper, we propose a novel approach to address the newly defined needs of linguistic typology by extracting qualitative and quantitative information about a wide range of features from multilingual annotated corpora based on Natural Language Processing methods and techniques. We tested our method in a case study focusing on word order variation in two widely investigated constructions, VERB-SUBJ(ect) and NOUN-ADJ(ective), with a specific view to structural and functional factors underlying the preference for one or the other order, both intra- and cross-linguistically, and their interaction. Preliminary experiments have been carried out aimed at acquiring typological evidence from a selection of linguistically annotated treebanks for three different languages, namely Italian, Spanish and English. Our results show the effectiveness of the method in letting similarities and differences also emerge from typologically close languages. -
dc.description.affiliations Università di Genova; Istituto di Linguistica Computazionale "A. Zampolli" (ILC-CNR) -
dc.description.allpeople Alzetta, Chiara; Dell'Orletta, Felice; Montemagni, Simonetta; Venturi, Giulia -
dc.description.allpeopleoriginal Alzetta, Chiara; Dell'Orletta, Felice; Montemagni, Simonetta; Venturi, Giulia -
dc.description.fulltext none en
dc.description.numberofauthors 4 -
dc.identifier.doi 10.1418/95391 -
dc.identifier.isi WOS:000503425400002 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/403586 -
dc.identifier.url https://www.rivisteweb.it/doi/10.1418/95391 -
dc.language.iso eng -
dc.relation.firstpage 209 -
dc.relation.issue 2 -
dc.relation.lastpage 242 -
dc.relation.numberofpages 34 -
dc.relation.volume 18 -
dc.subject.keywords language typology -
dc.subject.keywords multilingual annotated corpora -
dc.subject.keywords linguistic knowledge extraction and modelling -
dc.subject.keywords word order variation -
dc.subject.singlekeyword language typology *
dc.subject.singlekeyword multilingual annotated corpora *
dc.subject.singlekeyword linguistic knowledge extraction and modelling *
dc.subject.singlekeyword word order variation *
dc.title INFERRING QUANTITATIVE TYPOLOGICAL TRENDS FROM MULTILINGUAL TREEBANKS. A CASE STUDY en
dc.type.driver info:eu-repo/semantics/article -
dc.type.full 01 Contributo su Rivista::01.01 Articolo in rivista it
dc.type.miur 262 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 423880 -
iris.isi.extIssued 2019 -
iris.isi.extTitle INFERRING QUANTITATIVE TYPOLOGICAL TRENDS FROM MULTILINGUAL TREEBANKS. A CASE STUDY -
iris.orcid.lastModifiedDate 2024/03/01 14:44:16 *
iris.orcid.lastModifiedMillisecond 1709300656047 *
iris.scopus.extIssued 2019 -
iris.scopus.extTitle Inferring quantitative typological trends from multilingual treebanks. A case study -
iris.sitodocente.maxattempts 3 -
iris.unpaywall.metadataCallLastModified 15/06/2025 07:19:54 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1749964794385 -
iris.unpaywall.metadataErrorDescription 0 -
iris.unpaywall.metadataErrorType ERROR_NO_MATCH -
iris.unpaywall.metadataStatus ERROR -
isi.authority.ancejournal LINGUE E LINGUAGGIO###1720-9331 *
isi.category OY *
isi.contributor.affiliation University of Genoa -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.name Chiara -
isi.contributor.name Felice -
isi.contributor.name Simonetta -
isi.contributor.name Giulia -
isi.contributor.researcherId KVX-9760-2024 -
isi.contributor.researcherId AAX-1864-2020 -
isi.contributor.researcherId B-8000-2015 -
isi.contributor.researcherId AAY-3932-2020 -
isi.contributor.subaffiliation -
isi.contributor.subaffiliation ILC -
isi.contributor.subaffiliation ILC -
isi.contributor.subaffiliation ILC -
isi.contributor.surname Alzetta -
isi.contributor.surname Dell'Orletta -
isi.contributor.surname Montemagni -
isi.contributor.surname Venturi -
isi.date.issued 2019 *
isi.description.abstracteng In the past decades, linguistic typology went through a renewing phase that involved a significant change in the research questions and methods of the discipline, which is now interested in fine-grained features underlying language diversity. In this paper, we propose a novel approach to address the newly defined needs of linguistic typology by extracting qualitative and quantitative information about a wide range of features from multilingual annotated corpora based on Natural Language Processing methods and techniques. We tested our method in a case study focusing on word order variation in two widely investigated constructions, VERB-SUBJ(ect) and NOUN-ADJ(ective), with a specific view to structural and functional factors underlying the preference for one or the other order, both intra- and cross-linguistically, and their interaction. Preliminary experiments have been carried out aimed at acquiring typological evidence from a selection of linguistically annotated treebanks for three different languages, namely Italian, Spanish and English. Our results show the effectiveness of the method in letting similarities and differences also emerge from typologically close languages. *
isi.description.allpeopleoriginal Alzetta, C; Dell'Orletta, F; Montemagni, S; Venturi, G; *
isi.document.sourcetype WOS.ESCI *
isi.document.type Article *
isi.document.types Article *
isi.identifier.doi 10.1418/95391 *
isi.identifier.isi WOS:000503425400002 *
isi.journal.journaltitle LINGUE E LINGUAGGIO *
isi.journal.journaltitleabbrev LINGUE LINGUAGGIO *
isi.language.original English *
isi.publisher.place STRADA MAGGIORE 37, 40125 BOLOGNA, ITALY *
isi.relation.firstpage 209 *
isi.relation.issue 2 *
isi.relation.lastpage 242 *
isi.relation.volume 18 *
isi.title INFERRING QUANTITATIVE TYPOLOGICAL TRENDS FROM MULTILINGUAL TREEBANKS. A CASE STUDY *
Appare nelle tipologie: 01.01 Articolo in rivista
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/403586
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 2
social impact