This paper describes CompL-it, a new open computational lexicon for contemporary Italian. The resource was constructed from three sources: an already available Italian lexicon, a lemmatized list of inflected forms obtained from a morphological analyzer, and a set of treebanks. Integrating these resources required a standardisation process in accordance with the standards of the Linguistic Linked Open Data community, which was necessary for the subsequent conversion into the OntoLex-Lemon model. The resulting computational lexicon comprises approximately 100,000 lexical entries, 790,000 forms, 57,000 senses, and 86,000 semantic relations. The lexicon, thanks to its rich and articulated linguistic structure, can be used, as shown, to enhance information retrieval in the context of full-text search tasks.

CompL-it: a Computational Lexicon of Italian Language

Flavia Sciolette
Primo
;
Andrea Bellandi;Emiliano Giovannetti
;
Simone Marchi
2024

Abstract

This paper describes CompL-it, a new open computational lexicon for contemporary Italian. The resource was constructed from three sources: an already available Italian lexicon, a lemmatized list of inflected forms obtained from a morphological analyzer, and a set of treebanks. Integrating these resources required a standardisation process in accordance with the standards of the Linguistic Linked Open Data community, which was necessary for the subsequent conversion into the OntoLex-Lemon model. The resulting computational lexicon comprises approximately 100,000 lexical entries, 790,000 forms, 57,000 senses, and 86,000 semantic relations. The lexicon, thanks to its rich and articulated linguistic structure, can be used, as shown, to enhance information retrieval in the context of full-text search tasks.
Campo DC Valore Lingua
dc.authority.ancejournal AIDA INFORMAZIONI en
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Flavia Sciolette en
dc.authority.people Andrea Bellandi en
dc.authority.people Emiliano Giovannetti en
dc.authority.people Simone Marchi en
dc.collection.id.s b3f88f24-048a-4e43-8ab1-6697b90e068e *
dc.collection.name 01.01 Articolo in rivista *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.accessioned 2024/12/18 16:37:25 -
dc.date.available 2024/12/18 16:37:25 -
dc.date.firstsubmission 2024/12/18 16:02:35 *
dc.date.issued 2024 -
dc.date.submission 2025/03/03 16:22:19 *
dc.description.abstracteng This paper describes CompL-it, a new open computational lexicon for contemporary Italian. The resource was constructed from three sources: an already available Italian lexicon, a lemmatized list of inflected forms obtained from a morphological analyzer, and a set of treebanks. Integrating these resources required a standardisation process in accordance with the standards of the Linguistic Linked Open Data community, which was necessary for the subsequent conversion into the OntoLex-Lemon model. The resulting computational lexicon comprises approximately 100,000 lexical entries, 790,000 forms, 57,000 senses, and 86,000 semantic relations. The lexicon, thanks to its rich and articulated linguistic structure, can be used, as shown, to enhance information retrieval in the context of full-text search tasks. -
dc.description.allpeople Sciolette, Flavia; Bellandi, Andrea; Giovannetti, Emiliano; Marchi, Simone -
dc.description.allpeopleoriginal Flavia Sciolette, Andrea Bellandi, Emiliano Giovannetti, Simone Marchi en
dc.description.fulltext open en
dc.description.note codice DOI: 10.57574/596545646 (il sistema IRIS non riesce a validarlo) en
dc.description.numberofauthors 4 -
dc.identifier.source manual *
dc.identifier.uri https://hdl.handle.net/20.500.14243/519970 -
dc.identifier.url https://www.aidainformazioni.it/index.php/aidainformazioni/article/view/315 en
dc.language.iso eng en
dc.relation.firstpage 119 en
dc.relation.issue 3-4 en
dc.relation.lastpage 148 en
dc.relation.medium STAMPA en
dc.relation.numberofpages 30 en
dc.relation.volume 42 en
dc.subject.keywordseng Computational Lexicon, Linguistic Resources, Linguistic Linked Open Data, OntoLex-Lemon, Information Retrieval -
dc.subject.singlekeyword Computational Lexicon *
dc.subject.singlekeyword Linguistic Resources *
dc.subject.singlekeyword Linguistic Linked Open Data *
dc.subject.singlekeyword OntoLex-Lemon *
dc.subject.singlekeyword Information Retrieval *
dc.title CompL-it: a Computational Lexicon of Italian Language en
dc.type.driver info:eu-repo/semantics/article -
dc.type.full 01 Contributo su Rivista::01.01 Articolo in rivista it
dc.type.impactfactor si en
dc.type.miur 262 -
dc.type.referee Esperti anonimi en
iris.mediafilter.data 2025/03/30 03:18:13 *
iris.orcid.lastModifiedDate 2025/05/23 10:03:00 *
iris.orcid.lastModifiedMillisecond 1747987380434 *
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 01.01 Articolo in rivista
File in questo prodotto:
File Dimensione Formato  
CompL-it_ a Computational Lexicon of Italian Language.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 1.31 MB
Formato Adobe PDF
1.31 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/519970
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact