The growing impact of Large Language Models has highlighted the need for explicit, interpretable linguistic knowledge. Lexical resources respond to this need by offering structured representations that complement and constrain the implicit semantics of neural models. This paper presents an extension of CompL-it, currently the most comprehensive open computational lexicon of Italian. Building on the semantic layer inherited from LexicO—itself derived from the PAROLE-SIMPLE-CLIPS resource—the work enriches CompL-it with semantic traits and references to semantic types. Moreover, an experiment was conducted to generate missing definitions through an automatic process supported by LLMs. The resulting resource thus combines human-curated and machine-extended knowledge, ensuring both linguistic precision and scalability. This enriched semantic layer enhances CompL-it’s interoperability within the Linguistic Linked Data framework and strengthens its usability for NLP tasks such as word sense disambiguation, semantic role labelling, and knowledge grounding.

Extending the Semantic Layer of the CompL-it Italian Lexicon: Traits, Semantic Types, and Definitions

Giovannetti, Emiliano
Primo
;
Bellandi, Andrea;Marchi, Simone;Papini, Mafalda
2026

Abstract

The growing impact of Large Language Models has highlighted the need for explicit, interpretable linguistic knowledge. Lexical resources respond to this need by offering structured representations that complement and constrain the implicit semantics of neural models. This paper presents an extension of CompL-it, currently the most comprehensive open computational lexicon of Italian. Building on the semantic layer inherited from LexicO—itself derived from the PAROLE-SIMPLE-CLIPS resource—the work enriches CompL-it with semantic traits and references to semantic types. Moreover, an experiment was conducted to generate missing definitions through an automatic process supported by LLMs. The resulting resource thus combines human-curated and machine-extended knowledge, ensuring both linguistic precision and scalability. This enriched semantic layer enhances CompL-it’s interoperability within the Linguistic Linked Data framework and strengthens its usability for NLP tasks such as word sense disambiguation, semantic role labelling, and knowledge grounding.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Giovannetti, Emiliano en
dc.authority.people Bellandi, Andrea en
dc.authority.people Marchi, Simone en
dc.authority.people Papini, Mafalda en
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.firstsubmission 2026/05/03 12:06:11 *
dc.date.issued 2026 -
dc.date.submission 2026/05/21 09:49:25 *
dc.description.abstracteng The growing impact of Large Language Models has highlighted the need for explicit, interpretable linguistic knowledge. Lexical resources respond to this need by offering structured representations that complement and constrain the implicit semantics of neural models. This paper presents an extension of CompL-it, currently the most comprehensive open computational lexicon of Italian. Building on the semantic layer inherited from LexicO—itself derived from the PAROLE-SIMPLE-CLIPS resource—the work enriches CompL-it with semantic traits and references to semantic types. Moreover, an experiment was conducted to generate missing definitions through an automatic process supported by LLMs. The resulting resource thus combines human-curated and machine-extended knowledge, ensuring both linguistic precision and scalability. This enriched semantic layer enhances CompL-it’s interoperability within the Linguistic Linked Data framework and strengthens its usability for NLP tasks such as word sense disambiguation, semantic role labelling, and knowledge grounding. -
dc.description.allpeople Giovannetti, Emiliano; Bellandi, Andrea; Marchi, Simone; Papini, Mafalda -
dc.description.allpeopleoriginal Giovannetti, Emiliano; Bellandi, Andrea; Marchi, Simone; Papini, Mafalda en
dc.description.fulltext none en
dc.description.numberofauthors 4 -
dc.identifier.doi 10.63317/3rvf2vbt4ier en
dc.identifier.isbn 978-2-493814-49-4 en
dc.identifier.source orcid *
dc.identifier.uri https://hdl.handle.net/20.500.14243/579201 -
dc.language.iso eng en
dc.relation.conferencedate 11-16 Maggio 2026 en
dc.relation.conferencename Fifteenth Language Resources and Evaluation Conference (LREC 2026) en
dc.relation.conferenceplace Palma, Maiorca, Spagna en
dc.relation.firstpage 7857 en
dc.relation.ispartofbook Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026) en
dc.relation.lastpage 7866 en
dc.relation.numberofpages 10 en
dc.subject.keywordseng computational lexicon, Linguistic Linked Open Data, OntoLex Lemon, Large Language Models, Word sense definitions, semantic enrichment -
dc.subject.singlekeyword computational lexicon *
dc.subject.singlekeyword Linguistic Linked Open Data *
dc.subject.singlekeyword OntoLex Lemon *
dc.subject.singlekeyword Large Language Models *
dc.subject.singlekeyword Word sense definitions *
dc.subject.singlekeyword semantic enrichment *
dc.title Extending the Semantic Layer of the CompL-it Italian Lexicon: Traits, Semantic Types, and Definitions en
dc.type.circulation Internazionale en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
iris.orcid.lastModifiedDate 2026/05/21 09:49:25 *
iris.orcid.lastModifiedMillisecond 1779349765840 *
iris.sitodocente.maxattempts 1 -
iris.unpaywall.doi 10.63317/3rvf2vbt4ier *
iris.unpaywall.isoa false *
iris.unpaywall.journalisindoaj false *
iris.unpaywall.metadataCallLastModified 22/05/2026 04:47:08 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1779418028679 -
iris.unpaywall.oastatus closed *
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/579201
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ente

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact