This article aims at formally grouping lexical and morphological information in order to obtain an electronic resource with respect to the Arabic language starting from the classical dictionary al=q?m?s al=mu???. This contribution examines practical aspects about the adoption of the guidelines provided by the Text Encoding Initiative (TEI) to encode the Arabic dictionary as a primary source. Moreover, the contribution points out a possible way to integrate semantic, morphological and syntactic information characterizing word patterns within the same TEI document. Specifically, the formalization of word patterns allows us to emphasize additional morphosyntactic regularities mainly concerning word distribution within sentences. Consequently, the obtained digital object represents both the medieval Arabic dictionary and a suitable resource that can be exploited for a number of Natural Language Processing tasks.

Creating Arabic Lexical Resources in TEI; A Schema for Discontinuous Morphology Encoding

Ouafae Nahli;Angelo Mario Del Grosso
2020

Abstract

This article aims at formally grouping lexical and morphological information in order to obtain an electronic resource with respect to the Arabic language starting from the classical dictionary al=q?m?s al=mu???. This contribution examines practical aspects about the adoption of the guidelines provided by the Text Encoding Initiative (TEI) to encode the Arabic dictionary as a primary source. Moreover, the contribution points out a possible way to integrate semantic, morphological and syntactic information characterizing word patterns within the same TEI document. Specifically, the formalization of word patterns allows us to emphasize additional morphosyntactic regularities mainly concerning word distribution within sentences. Consequently, the obtained digital object represents both the medieval Arabic dictionary and a suitable resource that can be exploited for a number of Natural Language Processing tasks.
2020
Istituto di linguistica computazionale "Antonio Zampolli" - ILC
978-1-7281-6646-9
classical Arabic dictionary
digital lexicography
al=q?m?s al=mu???
word patterns
TEI
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/394192
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact