This paper presents the integration of CompL-it, a Linked Open Data (LOD) computational lexicon for contemporary Italian, into LiITA (Linking Italian), a Knowledge Base (KB) designed for linguistic interoperability. CompL-it contains over 101k lexical entries enriched with detailed morphological and semantic information, derived from multiple authoritative sources and modelled using the OntoLex-Lemon vocabulary. The linking process involved aligning lexical entries with lemmas in the LiITA’s Lemma Bank (LB), addressing both exact and ambiguous matches through systematic and semantically informed strategies. Moreover, 12,739 new lemmas were added to the LiITA LB. This integration enhances the expressiveness and interoperability of LiITA, enabling complex SPARQL queries that exploit the semantic network encoded in CompL-it. Examples are provided to demonstrate the advantages of querying interlinked resources.

Linking CompL-it to the LiITA Knowledge Base

Emiliano Giovannetti
;
Simone Marchi;Andrea Bellandi;Flavia Sciolette
2025

Abstract

This paper presents the integration of CompL-it, a Linked Open Data (LOD) computational lexicon for contemporary Italian, into LiITA (Linking Italian), a Knowledge Base (KB) designed for linguistic interoperability. CompL-it contains over 101k lexical entries enriched with detailed morphological and semantic information, derived from multiple authoritative sources and modelled using the OntoLex-Lemon vocabulary. The linking process involved aligning lexical entries with lemmas in the LiITA’s Lemma Bank (LB), addressing both exact and ambiguous matches through systematic and semantically informed strategies. Moreover, 12,739 new lemmas were added to the LiITA LB. This integration enhances the expressiveness and interoperability of LiITA, enabling complex SPARQL queries that exploit the semantic network encoded in CompL-it. Examples are provided to demonstrate the advantages of querying interlinked resources.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Eleonora Litta en
dc.authority.people Marco Passarotti en
dc.authority.people Giovanni Moretti en
dc.authority.people Paolo Brasolin en
dc.authority.people Francesco Mambrini en
dc.authority.people Valerio Basile en
dc.authority.people Cristina Bosco en
dc.authority.people Andrea Di Fabio en
dc.authority.people Eliana Di Palma en
dc.authority.people Emiliano Giovannetti en
dc.authority.people Simone Marchi en
dc.authority.people Andrea Bellandi en
dc.authority.people Flavia Sciolette en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di Scienze e Tecnologie della Cognizione - ISTC *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.appartenenza.mi 986 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.accessioned 2026/03/03 15:25:40 -
dc.date.available 2026/03/03 15:25:40 -
dc.date.firstsubmission 2025/10/15 18:14:14 *
dc.date.issued 2025 -
dc.date.submission 2026/02/13 15:54:30 *
dc.description.abstracteng This paper presents the integration of CompL-it, a Linked Open Data (LOD) computational lexicon for contemporary Italian, into LiITA (Linking Italian), a Knowledge Base (KB) designed for linguistic interoperability. CompL-it contains over 101k lexical entries enriched with detailed morphological and semantic information, derived from multiple authoritative sources and modelled using the OntoLex-Lemon vocabulary. The linking process involved aligning lexical entries with lemmas in the LiITA’s Lemma Bank (LB), addressing both exact and ambiguous matches through systematic and semantically informed strategies. Moreover, 12,739 new lemmas were added to the LiITA LB. This integration enhances the expressiveness and interoperability of LiITA, enabling complex SPARQL queries that exploit the semantic network encoded in CompL-it. Examples are provided to demonstrate the advantages of querying interlinked resources. -
dc.description.allpeople Litta, Eleonora; Passarotti, Marco; Moretti, Giovanni; Brasolin, Paolo; Mambrini, Francesco; Basile, Valerio; Bosco, Cristina; Di Fabio, Andrea; Di Palma, Eliana; Giovannetti, Emiliano; Marchi, Simone; Bellandi, Andrea; Sciolette, Flavia -
dc.description.allpeopleoriginal Eleonora Litta, Marco Passarotti, Giovanni Moretti, Paolo Brasolin, Francesco Mambrini, Valerio Basile, Cristina Bosco, Andrea Di Fabio, Eliana Di Palma, Emiliano Giovannetti, Simone Marchi, Andrea Bellandi, Flavia Sciolette en
dc.description.fulltext open en
dc.description.numberofauthors 13 -
dc.identifier.source manual *
dc.identifier.uri https://hdl.handle.net/20.500.14243/555245 -
dc.identifier.url https://aclanthology.org/2025.clicit-1.57.pdf en
dc.language.iso eng en
dc.publisher.name CEUR Workshop Proceedings en
dc.relation.allauthors Cristina Bosco, Elisabetta Jezek, Marco Polignano, Manuela Sanguinetti en
dc.relation.conferencedate 24-26 settembre 2025 en
dc.relation.conferencename CLiC-it 2025 Italian Conference on Computational Linguistics en
dc.relation.conferenceplace Cagliari en
dc.relation.ispartofbook Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025) en
dc.relation.medium ELETTRONICO en
dc.subject.keywordseng Linked Open Data, Italian, language resources -
dc.subject.singlekeyword Linked Open Data *
dc.subject.singlekeyword Italian *
dc.subject.singlekeyword language resources *
dc.title Linking CompL-it to the LiITA Knowledge Base en
dc.type.circulation Internazionale en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
iris.mediafilter.data 2026/03/04 02:52:28 *
iris.orcid.lastModifiedDate 2026/03/03 15:25:40 *
iris.orcid.lastModifiedMillisecond 1772547940541 *
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
56_main_long.pdf

accesso aperto

Descrizione: articolo
Tipologia: Documento in Pre-print
Licenza: Creative commons
Dimensione 784.83 kB
Formato Adobe PDF
784.83 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/555245
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact