In the past decades, linguistic typology went through a renewing phase that involved a significant change in the research questions and methods of the discipline, which is now interested in fine-grained features underlying language diversity. In this paper, we propose a novel approach to address the newly defined needs of linguistic typology by extracting qualitative and quantitative information about a wide range of features from multilingual annotated corpora based on Natural Language Processing methods and techniques. We tested our method in a case study focusing on word order variation in two widely investigated constructions, VERB-SUBJ(ect) and NOUN-ADJ(ective), with a specific view to structural and functional factors underlying the preference for one or the other order, both intra- and cross-linguistically, and their interaction. Preliminary experiments have been carried out aimed at acquiring typological evidence from a selection of linguistically annotated treebanks for three different languages, namely Italian, Spanish and English. Our results show the effectiveness of the method in letting similarities and differences also emerge from typologically close languages.
INFERRING QUANTITATIVE TYPOLOGICAL TRENDS FROM MULTILINGUAL TREEBANKS. A CASE STUDY
Alzetta Chiara;Dell'Orletta Felice;Montemagni Simonetta;Venturi Giulia
2019
Abstract
In the past decades, linguistic typology went through a renewing phase that involved a significant change in the research questions and methods of the discipline, which is now interested in fine-grained features underlying language diversity. In this paper, we propose a novel approach to address the newly defined needs of linguistic typology by extracting qualitative and quantitative information about a wide range of features from multilingual annotated corpora based on Natural Language Processing methods and techniques. We tested our method in a case study focusing on word order variation in two widely investigated constructions, VERB-SUBJ(ect) and NOUN-ADJ(ective), with a specific view to structural and functional factors underlying the preference for one or the other order, both intra- and cross-linguistically, and their interaction. Preliminary experiments have been carried out aimed at acquiring typological evidence from a selection of linguistically annotated treebanks for three different languages, namely Italian, Spanish and English. Our results show the effectiveness of the method in letting similarities and differences also emerge from typologically close languages.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.ancejournal | LINGUE E LINGUAGGIO | - |
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | Alzetta Chiara | it |
| dc.authority.people | Dell'Orletta Felice | it |
| dc.authority.people | Montemagni Simonetta | it |
| dc.authority.people | Venturi Giulia | it |
| dc.collection.id.s | b3f88f24-048a-4e43-8ab1-6697b90e068e | * |
| dc.collection.name | 01.01 Articolo in rivista | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/21 06:06:38 | - |
| dc.date.available | 2024/02/21 06:06:38 | - |
| dc.date.issued | 2019 | - |
| dc.description.abstracteng | In the past decades, linguistic typology went through a renewing phase that involved a significant change in the research questions and methods of the discipline, which is now interested in fine-grained features underlying language diversity. In this paper, we propose a novel approach to address the newly defined needs of linguistic typology by extracting qualitative and quantitative information about a wide range of features from multilingual annotated corpora based on Natural Language Processing methods and techniques. We tested our method in a case study focusing on word order variation in two widely investigated constructions, VERB-SUBJ(ect) and NOUN-ADJ(ective), with a specific view to structural and functional factors underlying the preference for one or the other order, both intra- and cross-linguistically, and their interaction. Preliminary experiments have been carried out aimed at acquiring typological evidence from a selection of linguistically annotated treebanks for three different languages, namely Italian, Spanish and English. Our results show the effectiveness of the method in letting similarities and differences also emerge from typologically close languages. | - |
| dc.description.affiliations | Università di Genova; Istituto di Linguistica Computazionale "A. Zampolli" (ILC-CNR) | - |
| dc.description.allpeople | Alzetta, Chiara; Dell'Orletta, Felice; Montemagni, Simonetta; Venturi, Giulia | - |
| dc.description.allpeopleoriginal | Alzetta, Chiara; Dell'Orletta, Felice; Montemagni, Simonetta; Venturi, Giulia | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 4 | - |
| dc.identifier.doi | 10.1418/95391 | - |
| dc.identifier.isi | WOS:000503425400002 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/403586 | - |
| dc.identifier.url | https://www.rivisteweb.it/doi/10.1418/95391 | - |
| dc.language.iso | eng | - |
| dc.relation.firstpage | 209 | - |
| dc.relation.issue | 2 | - |
| dc.relation.lastpage | 242 | - |
| dc.relation.numberofpages | 34 | - |
| dc.relation.volume | 18 | - |
| dc.subject.keywords | language typology | - |
| dc.subject.keywords | multilingual annotated corpora | - |
| dc.subject.keywords | linguistic knowledge extraction and modelling | - |
| dc.subject.keywords | word order variation | - |
| dc.subject.singlekeyword | language typology | * |
| dc.subject.singlekeyword | multilingual annotated corpora | * |
| dc.subject.singlekeyword | linguistic knowledge extraction and modelling | * |
| dc.subject.singlekeyword | word order variation | * |
| dc.title | INFERRING QUANTITATIVE TYPOLOGICAL TRENDS FROM MULTILINGUAL TREEBANKS. A CASE STUDY | en |
| dc.type.driver | info:eu-repo/semantics/article | - |
| dc.type.full | 01 Contributo su Rivista::01.01 Articolo in rivista | it |
| dc.type.miur | 262 | - |
| dc.type.referee | Sì, ma tipo non specificato | - |
| dc.ugov.descaux1 | 423880 | - |
| iris.isi.extIssued | 2019 | - |
| iris.isi.extTitle | INFERRING QUANTITATIVE TYPOLOGICAL TRENDS FROM MULTILINGUAL TREEBANKS. A CASE STUDY | - |
| iris.orcid.lastModifiedDate | 2024/03/01 14:44:16 | * |
| iris.orcid.lastModifiedMillisecond | 1709300656047 | * |
| iris.scopus.extIssued | 2019 | - |
| iris.scopus.extTitle | Inferring quantitative typological trends from multilingual treebanks. A case study | - |
| iris.sitodocente.maxattempts | 3 | - |
| iris.unpaywall.metadataCallLastModified | 15/06/2025 07:19:54 | - |
| iris.unpaywall.metadataCallLastModifiedMillisecond | 1749964794385 | - |
| iris.unpaywall.metadataErrorDescription | 0 | - |
| iris.unpaywall.metadataErrorType | ERROR_NO_MATCH | - |
| iris.unpaywall.metadataStatus | ERROR | - |
| isi.authority.ancejournal | LINGUE E LINGUAGGIO###1720-9331 | * |
| isi.category | OY | * |
| isi.contributor.affiliation | University of Genoa | - |
| isi.contributor.affiliation | Consiglio Nazionale delle Ricerche (CNR) | - |
| isi.contributor.affiliation | Consiglio Nazionale delle Ricerche (CNR) | - |
| isi.contributor.affiliation | Consiglio Nazionale delle Ricerche (CNR) | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | Italy | - |
| isi.contributor.name | Chiara | - |
| isi.contributor.name | Felice | - |
| isi.contributor.name | Simonetta | - |
| isi.contributor.name | Giulia | - |
| isi.contributor.researcherId | KVX-9760-2024 | - |
| isi.contributor.researcherId | AAX-1864-2020 | - |
| isi.contributor.researcherId | B-8000-2015 | - |
| isi.contributor.researcherId | AAY-3932-2020 | - |
| isi.contributor.subaffiliation | - | |
| isi.contributor.subaffiliation | ILC | - |
| isi.contributor.subaffiliation | ILC | - |
| isi.contributor.subaffiliation | ILC | - |
| isi.contributor.surname | Alzetta | - |
| isi.contributor.surname | Dell'Orletta | - |
| isi.contributor.surname | Montemagni | - |
| isi.contributor.surname | Venturi | - |
| isi.date.issued | 2019 | * |
| isi.description.abstracteng | In the past decades, linguistic typology went through a renewing phase that involved a significant change in the research questions and methods of the discipline, which is now interested in fine-grained features underlying language diversity. In this paper, we propose a novel approach to address the newly defined needs of linguistic typology by extracting qualitative and quantitative information about a wide range of features from multilingual annotated corpora based on Natural Language Processing methods and techniques. We tested our method in a case study focusing on word order variation in two widely investigated constructions, VERB-SUBJ(ect) and NOUN-ADJ(ective), with a specific view to structural and functional factors underlying the preference for one or the other order, both intra- and cross-linguistically, and their interaction. Preliminary experiments have been carried out aimed at acquiring typological evidence from a selection of linguistically annotated treebanks for three different languages, namely Italian, Spanish and English. Our results show the effectiveness of the method in letting similarities and differences also emerge from typologically close languages. | * |
| isi.description.allpeopleoriginal | Alzetta, C; Dell'Orletta, F; Montemagni, S; Venturi, G; | * |
| isi.document.sourcetype | WOS.ESCI | * |
| isi.document.type | Article | * |
| isi.document.types | Article | * |
| isi.identifier.doi | 10.1418/95391 | * |
| isi.identifier.isi | WOS:000503425400002 | * |
| isi.journal.journaltitle | LINGUE E LINGUAGGIO | * |
| isi.journal.journaltitleabbrev | LINGUE LINGUAGGIO | * |
| isi.language.original | English | * |
| isi.publisher.place | STRADA MAGGIORE 37, 40125 BOLOGNA, ITALY | * |
| isi.relation.firstpage | 209 | * |
| isi.relation.issue | 2 | * |
| isi.relation.lastpage | 242 | * |
| isi.relation.volume | 18 | * |
| isi.title | INFERRING QUANTITATIVE TYPOLOGICAL TRENDS FROM MULTILINGUAL TREEBANKS. A CASE STUDY | * |
| Appare nelle tipologie: | 01.01 Articolo in rivista | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


