Parole-Simple-Clips (PSC) is a computational lexicon of the Italian language, developed from1996 to 2003 by the Institute of Computational Linguistics of the Italian National ResearchCouncil (ILC-CNR) in the context of national and European projects. The PSC resource isstrongly structured, rich of data, and, for its features, may provide an edge if used in the supportof text retrieval related tasks, such as full-text search. However, the lexicon still appears incompleteand presents some redundant, erroneous and missing data. This paper documents the first stepsundertaken for the creation of LexicO, an Italian computational lexicon built upon PSC startingfrom an in depth analysis of its four linguistic layers (semantic, syntactic, morphological, andphonological) in which it is structured. As a result of this work, LexicO has been released andmade freely available.

Parole-Simple-Clips (PSC) è un lessico computazionale per l'Italiano, sviluppato tra il 1996 e il 2003 presso l'Istituto di Linguistica Computazionale del Consiglio Nazionale delle Ricerche (ILC-CNR) nell'ambito di progetti nazionali ed Europei. PSC è una risorsa ricca, fortemente strutturata e, per le sue caratteristiche, può fornire un vantaggio in task di text retrieval come la ricerca full-text. PSC appare tuttavia incompleta in alcune sue sezioni e presenta dati ridondanti, erronei o mancanti. Questo contributo descrive i primi passi compiuti per la creazione di LexicO, un lessico computazionale italiano costruito sulla base di PSC a partire da un'analisi approfondita dei suoi quattro livelli linguistici (semantico, sintattico, morfologico e fonologico). A seguito di questi interventi, LexicO è stato rilasciato e reso liberamente disponibile.

LexicO: an Italian Computational Lexicon derived from Parole- Simple-Clips

Sciolette Flavia;Marchi Simone;Giovannetti Emiliano
2023

Abstract

Parole-Simple-Clips (PSC) is a computational lexicon of the Italian language, developed from1996 to 2003 by the Institute of Computational Linguistics of the Italian National ResearchCouncil (ILC-CNR) in the context of national and European projects. The PSC resource isstrongly structured, rich of data, and, for its features, may provide an edge if used in the supportof text retrieval related tasks, such as full-text search. However, the lexicon still appears incompleteand presents some redundant, erroneous and missing data. This paper documents the first stepsundertaken for the creation of LexicO, an Italian computational lexicon built upon PSC startingfrom an in depth analysis of its four linguistic layers (semantic, syntactic, morphological, andphonological) in which it is structured. As a result of this work, LexicO has been released andmade freely available.
Campo DC Valore Lingua
dc.authority.ancejournal UMANISTICA DIGITALE en
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC en
dc.authority.people Sciolette Flavia en
dc.authority.people Marchi Simone en
dc.authority.people Giovannetti Emiliano en
dc.collection.id.s b3f88f24-048a-4e43-8ab1-6697b90e068e *
dc.collection.name 01.01 Articolo in rivista *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.area Non assegn *
dc.contributor.area Non assegn *
dc.date.accessioned 2024/02/21 08:30:13 -
dc.date.available 2024/02/21 08:30:13 -
dc.date.firstsubmission 2025/01/28 11:03:33 *
dc.date.issued 2023 -
dc.date.submission 2025/01/28 11:03:33 *
dc.description.abstracteng Parole-Simple-Clips (PSC) is a computational lexicon of the Italian language, developed from1996 to 2003 by the Institute of Computational Linguistics of the Italian National ResearchCouncil (ILC-CNR) in the context of national and European projects. The PSC resource isstrongly structured, rich of data, and, for its features, may provide an edge if used in the supportof text retrieval related tasks, such as full-text search. However, the lexicon still appears incompleteand presents some redundant, erroneous and missing data. This paper documents the first stepsundertaken for the creation of LexicO, an Italian computational lexicon built upon PSC startingfrom an in depth analysis of its four linguistic layers (semantic, syntactic, morphological, andphonological) in which it is structured. As a result of this work, LexicO has been released andmade freely available. -
dc.description.abstractita Parole-Simple-Clips (PSC) è un lessico computazionale per l'Italiano, sviluppato tra il 1996 e il 2003 presso l'Istituto di Linguistica Computazionale del Consiglio Nazionale delle Ricerche (ILC-CNR) nell'ambito di progetti nazionali ed Europei. PSC è una risorsa ricca, fortemente strutturata e, per le sue caratteristiche, può fornire un vantaggio in task di text retrieval come la ricerca full-text. PSC appare tuttavia incompleta in alcune sue sezioni e presenta dati ridondanti, erronei o mancanti. Questo contributo descrive i primi passi compiuti per la creazione di LexicO, un lessico computazionale italiano costruito sulla base di PSC a partire da un'analisi approfondita dei suoi quattro livelli linguistici (semantico, sintattico, morfologico e fonologico). A seguito di questi interventi, LexicO è stato rilasciato e reso liberamente disponibile. -
dc.description.affiliations Istituto di Linguistica Computazionale "A. Zampolli", CNR, Pisa, Italia -
dc.description.allpeople Sciolette, Flavia; Marchi, Simone; Giovannetti, Emiliano -
dc.description.allpeopleoriginal Sciolette Flavia, Marchi Simone, Giovannetti Emiliano, en
dc.description.fulltext open en
dc.description.numberofauthors 3 -
dc.identifier.doi 10.6092/issn.2532-8816/15176 en
dc.identifier.scopus 2-s2.0-85165875733 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/462840 -
dc.language.iso eng en
dc.miur.last.status.update 2024-10-10T12:19:30Z *
dc.relation.numberofpages 24 en
dc.relation.volume 15 en
dc.subject.keywords Computational Lexicon -
dc.subject.keywords Parole-Simple-Clips -
dc.subject.keywords Linguistic Resources -
dc.subject.keywords Full-text Search -
dc.subject.keywords LexicO -
dc.subject.singlekeyword Computational Lexicon *
dc.subject.singlekeyword Parole-Simple-Clips *
dc.subject.singlekeyword Linguistic Resources *
dc.subject.singlekeyword Full-text Search *
dc.subject.singlekeyword LexicO *
dc.title LexicO: an Italian Computational Lexicon derived from Parole- Simple-Clips en
dc.type.driver info:eu-repo/semantics/article -
dc.type.full 01 Contributo su Rivista::01.01 Articolo in rivista it
dc.type.miur 262 -
dc.type.referee Sì, ma tipo non specificato en
dc.ugov.descaux1 484301 -
iris.mediafilter.data 2025/04/05 13:13:05 *
iris.orcid.lastModifiedDate 2025/03/03 15:18:26 *
iris.orcid.lastModifiedMillisecond 1741011506459 *
iris.scopus.extIssued 2023 -
iris.scopus.extTitle LexicO: an Italian Computational Lexicon derived from Parole-Simple-Clips -
iris.sitodocente.maxattempts 1 -
iris.unpaywall.metadataCallLastModified 18/06/2025 04:38:07 -
iris.unpaywall.metadataCallLastModifiedMillisecond 1750214287188 -
iris.unpaywall.metadataErrorDescription 0 -
iris.unpaywall.metadataErrorType ERROR_NO_MATCH -
iris.unpaywall.metadataStatus ERROR -
scopus.authority.ancejournal UMANISTICA DIGITALE###2532-8816 *
scopus.category 1701 *
scopus.category 1200 *
scopus.category 1706 *
scopus.category 3309 *
scopus.contributor.affiliation CNR -
scopus.contributor.affiliation CNR -
scopus.contributor.affiliation CNR -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.auid 57373961400 -
scopus.contributor.auid 27567818000 -
scopus.contributor.auid 55604835100 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.name Flavia -
scopus.contributor.name Simone -
scopus.contributor.name Emiliano -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “A. Zampolli”; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “A. Zampolli”; -
scopus.contributor.subaffiliation Istituto di Linguistica Computazionale “A. Zampolli”; -
scopus.contributor.surname Sciolette -
scopus.contributor.surname Marchi -
scopus.contributor.surname Giovannetti -
scopus.date.issued 2023 *
scopus.description.abstracteng Parole-Simple-Clips (PSC) is a computational lexicon of the Italian language, developed from 1996 to 2003 by the Institute of Computational Linguistics of the Italian National Research Council (ILC-CNR) in the context of national and European projects. The PSC resource is strongly structured, rich of data, and, for its features, may provide an edge if used in the support of text retrieval related tasks, such as full-text search. However, the lexicon still appears incomplete and presents some redundant, erroneous and missing data. This paper documents the first steps undertaken for the creation of LexicO, an Italian computational lexicon built upon PSC starting from an in depth analysis of its four linguistic layers (semantic, syntactic, morphological, and phonological) in which it is structured. As a result of this work, LexicO has been released and made freely available. *
scopus.description.allpeopleoriginal Sciolette F.; Marchi S.; Giovannetti E. *
scopus.differences scopus.relation.lastpage *
scopus.differences scopus.subject.keywords *
scopus.differences scopus.relation.firstpage *
scopus.differences scopus.description.allpeopleoriginal *
scopus.differences scopus.description.abstracteng *
scopus.differences scopus.relation.issue *
scopus.differences scopus.title *
scopus.differences scopus.relation.volume *
scopus.document.type ar *
scopus.document.types ar *
scopus.identifier.doi 10.6092/issn.2532-8816/15176 *
scopus.identifier.eissn 2532-8816 *
scopus.identifier.pui 2024899155 *
scopus.identifier.scopus 2-s2.0-85165875733 *
scopus.journal.sourceid 21101060465 *
scopus.language.iso eng *
scopus.publisher.name University of Bologna Department of Classical and Italian Philology, Alma Mater Studiorum *
scopus.relation.firstpage 169 *
scopus.relation.issue 15 *
scopus.relation.lastpage 193 *
scopus.relation.volume 2023 *
scopus.subject.keywords Computational Lexicon; Full-text Search; LexicO; Linguistic Resources; Parole-Simple-Clips; *
scopus.title LexicO: an Italian Computational Lexicon derived from Parole-Simple-Clips *
scopus.titleeng LexicO: an Italian Computational Lexicon derived from Parole-Simple-Clips *
Appare nelle tipologie: 01.01 Articolo in rivista
File in questo prodotto:
File Dimensione Formato  
15176-sciolette.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 697.4 kB
Formato Adobe PDF
697.4 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/462840
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact