This article provides a comprehensive and up-to-date survey of models and vocabularies for creating linguistic linked data (LLD) focusing on the latest developments in the area and both building upon and complementing previous works covering similar territory. The article begins with an overview of some recent trends which have had a significant impact on linked data models and vocabularies. Next, we give a general overview of existing vocabularies and models for different categories of LLD resource. After which we look at some of the latest developments in community standards and initiatives including descriptions of recent work on the OntoLex-Lemon model, a survey of recent initiatives in linguistic annotation and LLD, and a discussion of the LLD metadata vocabularies META-SHARE and lime. In the next part of the paper, we focus on the influence of projects on LLD models and vocabularies, starting with a general survey of relevant projects, before dedicating individual sections to a number of recent projects and their impact on LLD vocabularies and models. Finally, in the conclusion, we look ahead at some future challenges for LLD models and vocabularies. The appendix to the paper consists of a brief introduction to the OntoLex-Lemon model.

When linguistics meets web technologies. Recent advances in modelling linguistic linked data

Anas Fahad Khan
Primo
;
2022

Abstract

This article provides a comprehensive and up-to-date survey of models and vocabularies for creating linguistic linked data (LLD) focusing on the latest developments in the area and both building upon and complementing previous works covering similar territory. The article begins with an overview of some recent trends which have had a significant impact on linked data models and vocabularies. Next, we give a general overview of existing vocabularies and models for different categories of LLD resource. After which we look at some of the latest developments in community standards and initiatives including descriptions of recent work on the OntoLex-Lemon model, a survey of recent initiatives in linguistic annotation and LLD, and a discussion of the LLD metadata vocabularies META-SHARE and lime. In the next part of the paper, we focus on the influence of projects on LLD models and vocabularies, starting with a general survey of relevant projects, before dedicating individual sections to a number of recent projects and their impact on LLD vocabularies and models. Finally, in the conclusion, we look ahead at some future challenges for LLD models and vocabularies. The appendix to the paper consists of a brief introduction to the OntoLex-Lemon model.
Campo DC Valore Lingua
dc.authority.ancejournal SEMANTIC WEB (ONLINE) en
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Anas Fahad Khan en
dc.authority.people Christian Chiarcos en
dc.authority.people Thierry Declerck en
dc.authority.people Daniela Gifu en
dc.authority.people Elena González-Blanco García en
dc.authority.people Jorge Gracia en
dc.authority.people Maxim Ionov en
dc.authority.people Penny Labropoulou en
dc.authority.people Francesco Mambrini en
dc.authority.people John P. McCrae en
dc.authority.people Émilie Pagé-Perron en
dc.authority.people Marco Passarotti en
dc.authority.people Salvador Ros Muñoz en
dc.authority.people Ciprian-Octavian Truic en
dc.collection.id.s b3f88f24-048a-4e43-8ab1-6697b90e068e *
dc.collection.name 01.01 Articolo in rivista *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.date.accessioned 2024/02/20 20:52:48 -
dc.date.available 2024/02/20 20:52:48 -
dc.date.firstsubmission 2025/01/22 22:30:47 *
dc.date.issued 2022 -
dc.date.submission 2025/03/05 16:29:48 *
dc.description.abstracteng This article provides a comprehensive and up-to-date survey of models and vocabularies for creating linguistic linked data (LLD) focusing on the latest developments in the area and both building upon and complementing previous works covering similar territory. The article begins with an overview of some recent trends which have had a significant impact on linked data models and vocabularies. Next, we give a general overview of existing vocabularies and models for different categories of LLD resource. After which we look at some of the latest developments in community standards and initiatives including descriptions of recent work on the OntoLex-Lemon model, a survey of recent initiatives in linguistic annotation and LLD, and a discussion of the LLD metadata vocabularies META-SHARE and lime. In the next part of the paper, we focus on the influence of projects on LLD models and vocabularies, starting with a general survey of relevant projects, before dedicating individual sections to a number of recent projects and their impact on LLD vocabularies and models. Finally, in the conclusion, we look ahead at some future challenges for LLD models and vocabularies. The appendix to the paper consists of a brief introduction to the OntoLex-Lemon model. -
dc.description.affiliations Istituto di Linguistica Computazionale <>, Consiglio Nazionale delle Ricerche, Italy, Applied Computational Linguistics Lab, Goethe-Universität Frankfurt am Main, Germany, DFKI GmbH, Multilinguality and Language Technology, Saarbrücken, Germany, Faculty of Computer Science, Alexandru Ioan Cuza University of Iasi, Romania, Institute of Computer Science, Romanian Academy - Ia?i Branch, Romania, Laboratory of Innovation on Digital Humanities, IE University, Spain, Aragon Institute of Engineering Research, University of Zaragoza, Spain, Institute for Language and Speech Processing, Athena Research Center, Greece, CIRCSE Research Centre, Università Cattolica del Sacro Cuore, Milan, Italy, Insight SFI Research Centre for Data Analytics, Data Science Institute, National University of Ireland Galway, Ireland, Wolfson College, University of Oxford, United Kingdom, Laboratory of Innovation on Digital Humanities, National Distance Education University UNED, Spain, Computer Science and Engineering Department , Faculty of Automatic Control and Computers, University Politehnica of Bucharest, Romania -
dc.description.allpeople Khan, ANAS FAHAD ASLAM; Chiarcos, Christian; Declerck, Thierry; Gifu, Daniela; González-Blanco García, Elena; Gracia, Jorge; Ionov, Maxim; Labropoulou, Penny; Mambrini, Francesco; Mccrae, John P.; Pagé-Perron, Émilie; Passarotti, Marco; Ros Muñoz, Salvador; Truic, Ciprian-Octavian -
dc.description.allpeopleoriginal Anas Fahad Khan, Christian Chiarcos, Thierry Declerck, Daniela Gifu, Elena González-Blanco García, Jorge Gracia, Maxim Ionov, Penny Labropoulou, Francesco Mambrini, John P. McCrae, Émilie Pagé-Perron, Marco Passarotti, Salvador Ros Muñoz, Ciprian-Octavian Truic en
dc.description.fulltext open en
dc.description.numberofauthors 14 -
dc.identifier.doi 10.3233/SW-222859 en
dc.identifier.isi WOS:000862910800005 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/444090 -
dc.language.iso eng en
dc.subject.keywords Linguistic linked data -
dc.subject.keywords FAIR -
dc.subject.keywords corpora -
dc.subject.keywords annotation -
dc.subject.keywords language resources -
dc.subject.keywords OntoLex-Lemon -
dc.subject.keywords Digital Humanities -
dc.subject.keywords metadata -
dc.subject.keywords models -
dc.subject.keywords lexicon -
dc.subject.keywords language identification -
dc.subject.singlekeyword Linguistic linked data *
dc.subject.singlekeyword FAIR *
dc.subject.singlekeyword corpora *
dc.subject.singlekeyword annotation *
dc.subject.singlekeyword language resources *
dc.subject.singlekeyword OntoLex-Lemon *
dc.subject.singlekeyword Digital Humanities *
dc.subject.singlekeyword metadata *
dc.subject.singlekeyword models *
dc.subject.singlekeyword lexicon *
dc.subject.singlekeyword language identification *
dc.title When linguistics meets web technologies. Recent advances in modelling linguistic linked data en
dc.type.driver info:eu-repo/semantics/article -
dc.type.full 01 Contributo su Rivista::01.01 Articolo in rivista it
dc.type.miur 262 -
dc.type.referee Sì, ma tipo non specificato en
dc.ugov.descaux1 472142 -
iris.isi.extIssued 2022 -
iris.isi.extTitle When linguistics meets web technologies. Recent advances in modelling linguistic linked data -
iris.mediafilter.data 2025/04/03 04:16:32 *
iris.orcid.lastModifiedDate 2025/03/06 07:37:31 *
iris.orcid.lastModifiedMillisecond 1741243051705 *
iris.sitodocente.maxattempts 6 -
iris.unpaywall.bestoahost publisher *
iris.unpaywall.bestoaversion publishedVersion *
iris.unpaywall.doi 10.3233/sw-222859 *
iris.unpaywall.hosttype publisher *
iris.unpaywall.isoa true *
iris.unpaywall.journalisindoaj false *
iris.unpaywall.landingpage https://doi.org/10.3233/sw-222859 *
iris.unpaywall.license cc-by *
iris.unpaywall.metadataCallLastModified 26/04/2026 07:13:08 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1777180388276 -
iris.unpaywall.oastatus hybrid *
iris.unpaywall.pdfurl https://content.iospress.com:443/download/semantic-web/sw222859?id=semantic-web%2Fsw222859 *
isi.authority.ancejournal SEMANTIC WEB###1570-0844 *
isi.category EX *
isi.category EP *
isi.category ET *
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Goethe University Frankfurt -
isi.contributor.affiliation German Research Center for Artificial Intelligence (DFKI) -
isi.contributor.affiliation Alexandru Ioan Cuza University -
isi.contributor.affiliation IE University -
isi.contributor.affiliation University of Zaragoza -
isi.contributor.affiliation Goethe University Frankfurt -
isi.contributor.affiliation Institute for Language & Speech Processing (ILSP) -
isi.contributor.affiliation Catholic University of the Sacred Heart -
isi.contributor.affiliation Ollscoil na Gaillimhe-University of Galway -
isi.contributor.affiliation University of Oxford -
isi.contributor.affiliation Catholic University of the Sacred Heart -
isi.contributor.affiliation Universidad Nacional de Educacion a Distancia (UNED) -
isi.contributor.affiliation National University of Science & Technology POLITEHNICA Bucharest -
isi.contributor.country Italy -
isi.contributor.country Germany -
isi.contributor.country Germany -
isi.contributor.country Romania -
isi.contributor.country Spain -
isi.contributor.country Spain -
isi.contributor.country Germany -
isi.contributor.country Greece -
isi.contributor.country Italy -
isi.contributor.country Ireland -
isi.contributor.country England -
isi.contributor.country Italy -
isi.contributor.country Spain -
isi.contributor.country Romania -
isi.contributor.name Anas Fahad -
isi.contributor.name Christian -
isi.contributor.name Thierry -
isi.contributor.name Daniela -
isi.contributor.name Elena -
isi.contributor.name Jorge -
isi.contributor.name Maxim -
isi.contributor.name Penny -
isi.contributor.name Francesco -
isi.contributor.name John P. -
isi.contributor.name Emilie -
isi.contributor.name Marco -
isi.contributor.name Salvador -
isi.contributor.name Ciprian-Octavian -
isi.contributor.researcherId P-3751-2018 -
isi.contributor.researcherId FYU-1528-2022 -
isi.contributor.researcherId CNG-8934-2022 -
isi.contributor.researcherId D-1805-2015 -
isi.contributor.researcherId MJH-7842-2025 -
isi.contributor.researcherId J-5230-2013 -
isi.contributor.researcherId DXF-3797-2022 -
isi.contributor.researcherId GDI-6531-2022 -
isi.contributor.researcherId FYY-2827-2022 -
isi.contributor.researcherId P-8625-2016 -
isi.contributor.researcherId AAB-7440-2020 -
isi.contributor.researcherId KQJ-4882-2024 -
isi.contributor.researcherId C-4829-2015 -
isi.contributor.researcherId J-9536-2014 -
isi.contributor.subaffiliation Ist Linguist Computaz A Zampolli -
isi.contributor.subaffiliation Appl Computat Linguist Lab -
isi.contributor.subaffiliation Multilingual & Language Technol -
isi.contributor.subaffiliation Fac Comp Sci -
isi.contributor.subaffiliation Lab Innovat Digital Humanities -
isi.contributor.subaffiliation Aragon Inst Engn Res -
isi.contributor.subaffiliation Appl Computat Linguist Lab -
isi.contributor.subaffiliation Inst Language & Speech Proc -
isi.contributor.subaffiliation CIRCSE Res Ctr -
isi.contributor.subaffiliation Data Sci Inst -
isi.contributor.subaffiliation Wolfson Coll -
isi.contributor.subaffiliation CIRCSE Res Ctr -
isi.contributor.subaffiliation Lab Innovat Digital Humanities -
isi.contributor.subaffiliation Fac Automat Control & Comp -
isi.contributor.surname Khan -
isi.contributor.surname Chiarcos -
isi.contributor.surname Declerck -
isi.contributor.surname Gifu -
isi.contributor.surname Gonzalez-Blanco Garcia -
isi.contributor.surname Gracia -
isi.contributor.surname Ionov -
isi.contributor.surname Labropoulou -
isi.contributor.surname Mambrini -
isi.contributor.surname McCrae -
isi.contributor.surname Page-Perron -
isi.contributor.surname Passarotti -
isi.contributor.surname Ros Munoz -
isi.contributor.surname Truica -
isi.date.issued 2022 *
isi.description.abstracteng This article provides a comprehensive and up-to-date survey of models and vocabularies for creating linguistic linked data (LLD) focusing on the latest developments in the area and both building upon and complementing previous works covering similar territory. The article begins with an overview of some recent trends which have had a significant impact on linked data models and vocabularies. Next, we give a general overview of existing vocabularies and models for different categories of LLD resource. After which we look at some of the latest developments in community standards and initiatives including descriptions of recent work on the OntoLex-Lemon model, a survey of recent initiatives in linguistic annotation and LLD, and a discussion of the LLD metadata vocabularies META-SHARE and lime. In the next part of the paper, we focus on the influence of projects on LLD models and vocabularies, starting with a general survey of relevant projects, before dedicating individual sections to a number of recent projects and their impact on LLD vocabularies and models. Finally, in the conclusion, we look ahead at some future challenges for LLD models and vocabularies. The appendix to the paper consists of a brief introduction to the OntoLex-Lemon model. *
isi.description.allpeopleoriginal Khan, AF; Chiarcos, C; Declerck, T; Gifu, D; Garcia, EGB; Gracia, J; Ionov, M; Labropoulou, P; Mambrini, F; McCrae, JP; Pagé-Perron, É; Passarotti, M; Munoz, SR; Truica, CO; *
isi.document.sourcetype WOS.SCI *
isi.document.type Article *
isi.document.types Article *
isi.identifier.doi 10.3233/SW-222859 *
isi.identifier.eissn 2210-4968 *
isi.identifier.isi WOS:000862910800005 *
isi.journal.journaltitle SEMANTIC WEB *
isi.journal.journaltitleabbrev SEMANT WEB *
isi.language.original English *
isi.publisher.place 2455 TELLER RD, THOUSAND OAKS, CA 91320 USA *
isi.relation.firstpage 987 *
isi.relation.issue 6 *
isi.relation.lastpage 1050 *
isi.relation.volume 13 *
isi.title When linguistics meets web technologies. Recent advances in modelling linguistic linked data *
Appare nelle tipologie: 01.01 Articolo in rivista
File in questo prodotto:
File Dimensione Formato  
swj2859.pdf

accesso aperto

Licenza: Creative commons
Dimensione 1.02 MB
Formato Adobe PDF
1.02 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/444090
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 11
social impact