<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/CINECAstyle.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-06-13T00:30:05Z</responseDate><request verb="GetRecord" identifier="oai:iris.cnr.it:20.500.14243/566042" metadataPrefix="oai_dc">https://iris.cnr.it/oai/request</request><GetRecord><record><header><identifier>oai:iris.cnr.it:20.500.14243/566042</identifier><datestamp>2026-05-26T14:06:32Z</datestamp><setSpec>com_20.500.14243_46</setSpec><setSpec>com_20.500.14243_21</setSpec><setSpec>col_20.500.14243_47</setSpec><setSpec>ou_ou239</setSpec></header><metadata><oai_dc:dc xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:doc="http://www.lyncode.com/xoai" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:dc="http://purl.org/dc/elements/1.1/" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
<dc:title>A Robust Morphological Analysis System for the Moroccan Dialect</dc:title>
<dc:creator>Khlif, Nadia</dc:creator>
<dc:creator>Mazroui, Azzedine</dc:creator>
<dc:creator>Nahli, Ouafae</dc:creator>
<dc:contributor>Azrour, Mourade</dc:contributor>
<dc:contributor> Guezzaz, Azidine</dc:contributor>
<dc:contributor> Jabbour, Said</dc:contributor>
<dc:contributor>Khlif, Nadia</dc:contributor>
<dc:contributor> Mazroui, Azzedine</dc:contributor>
<dc:contributor> Nahli, Ouafae</dc:contributor>
<dc:subject>Morphological engine, DiMorph, Moroccan dialect, Multiword expressions, Darija, Text processing.</dc:subject>
<dc:description>This work presents DiMorph, a morphological engine for Moroccan Arabic (Darija), integrating custom pre- and post-processing techniques to address orthographic inconsistency and lack of standardization. A key feature of DiMorph is its multiword expression (MWE) recognition module, which enhances analysis by detecting and processing MWEs based on a predefined lexicon, leading to more accurate gloss generation. Tested on a Facebook corpus of 11,085 tokens, DiMorph achieved 97.84% in-vocabulary (INV) coverage, with an out-of-vocabulary (OOV) rate of 2.16%, mostly consisting of foreign terms, proper names and emerging words. In all, 40.48% of tokens had a single interpretation, while 59.52% exhibited ambiguity, largely due to homography (89.71%), polysemy (9.31%) and morphological syncretism (0.98%). By providing robust morphological analysis and MWE handling, DiMorph significantly enhances Darija text processing. Its linguistic resources will be released as open-source, fostering further advancements in Arabic dialect natural language processing (NLP).</dc:description>
<dc:date>2026</dc:date>
<dc:type>info:eu-repo/semantics/conferenceObject</dc:type>
<dc:identifier>https://hdl.handle.net/20.500.14243/566042</dc:identifier>
<dc:identifier>10.1201/9781003671602</dc:identifier>
<dc:relation>info:eu-repo/semantics/altIdentifier/isbn/9781003671602</dc:relation>
<dc:identifier>https://doi.org/10.1201/9781003671602</dc:identifier>
<dc:language>eng</dc:language>
<dc:relation>ispartofbook:Smart Technologies for a Sustainable Environment</dc:relation>
<dc:publisher>CRC Press – Taylor &amp; Francis Group</dc:publisher>
<dc:publisher>country:USA</dc:publisher>
<dc:publisher>place:Boca Raton</dc:publisher>
</oai_dc:dc></metadata></record></GetRecord></OAI-PMH>