The present work illustrates the first steps towards the construction of a new computational lexicon for the Italian language. Following an analysis of existing lexical resources, it was decided to use LexicO as the reference base. In this first phase a resource of nearly 800,000 inflected forms was produced, accompanied by lemmas and morphological traits, obtained by integrating the available data in LexicO with those coming from two support sources: the tool MAGIC and a selection of Italian treebanks.
Towards a New Computational Lexicon for Italian: building the morphological layer by harmonizing and merging existing resources
Flavia Sciolette;Simone Marchi;Emiliano Giovannetti
2023
Abstract
The present work illustrates the first steps towards the construction of a new computational lexicon for the Italian language. Following an analysis of existing lexical resources, it was decided to use LexicO as the reference base. In this first phase a resource of nearly 800,000 inflected forms was produced, accompanied by lemmas and morphological traits, obtained by integrating the available data in LexicO with those coming from two support sources: the tool MAGIC and a selection of Italian treebanks.File in questo prodotto:
File | Dimensione | Formato | |
---|---|---|---|
prod_491771-doc_205137.pdf
accesso aperto
Descrizione: Towards a New Computational Lexicon for Italian: building the morphological layer by harmonizing and merging existing resources
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
256.23 kB
Formato
Adobe PDF
|
256.23 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.