AraMorph's components are essentially two: the rule engine for morphological analysis and a repository of linguistic resources mainly composed of three lexicons: i) the dictStems lexicon, which contains 38.600 lemmas; ii) the dictPrefixes lexicon, which consists of sequences of proclitics and inflectional prefixes; iii) the dictSuffixes lexicon, which consists of sequences of inflectional suffixes and enclitics. These lexica are accompanied by three compatibility tables used for checking combinations of A (proclitics+prefixes), B (stems) and C (suffixes+enclitics). To cut down on arabic parse overgeneration, one has to enforce further restrictions in compatibility tables, e.g. the verb's ability to accept nominative and accusative pronouns, and to select a rational subject. We then augmented verb entries with subcategorization information such as case assignment and the restriction on rational subjects. At the same time, it was necessary to update compatibility tables.

Aggiornamenti banca dati del Motore morfologico Aramorph

Ouafae Nahli
2015

Abstract

AraMorph's components are essentially two: the rule engine for morphological analysis and a repository of linguistic resources mainly composed of three lexicons: i) the dictStems lexicon, which contains 38.600 lemmas; ii) the dictPrefixes lexicon, which consists of sequences of proclitics and inflectional prefixes; iii) the dictSuffixes lexicon, which consists of sequences of inflectional suffixes and enclitics. These lexica are accompanied by three compatibility tables used for checking combinations of A (proclitics+prefixes), B (stems) and C (suffixes+enclitics). To cut down on arabic parse overgeneration, one has to enforce further restrictions in compatibility tables, e.g. the verb's ability to accept nominative and accusative pronouns, and to select a rational subject. We then augmented verb entries with subcategorization information such as case assignment and the restriction on rational subjects. At the same time, it was necessary to update compatibility tables.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Ouafae Nahli it
dc.authority.project Greek into Arabic: Philosophical Concepts and Linguistic Bridges -
dc.collection.id.s 9b78cb77-0866-4cb5-8ca6-af14a97a08ef *
dc.collection.name 11.04 Banca dati *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/17 19:05:27 -
dc.date.available 2024/02/17 19:05:27 -
dc.date.issued 2015 -
dc.description.abstracteng AraMorph's components are essentially two: the rule engine for morphological analysis and a repository of linguistic resources mainly composed of three lexicons: i) the dictStems lexicon, which contains 38.600 lemmas; ii) the dictPrefixes lexicon, which consists of sequences of proclitics and inflectional prefixes; iii) the dictSuffixes lexicon, which consists of sequences of inflectional suffixes and enclitics. These lexica are accompanied by three compatibility tables used for checking combinations of A (proclitics+prefixes), B (stems) and C (suffixes+enclitics). To cut down on arabic parse overgeneration, one has to enforce further restrictions in compatibility tables, e.g. the verb's ability to accept nominative and accusative pronouns, and to select a rational subject. We then augmented verb entries with subcategorization information such as case assignment and the restriction on rational subjects. At the same time, it was necessary to update compatibility tables. -
dc.description.affiliations Istituto Linguistica Computazionale "A. Zampolli" (ILC-CNR) -
dc.description.allpeople Ouafae Nahli -
dc.description.allpeopleoriginal Ouafae Nahli -
dc.description.fulltext none en
dc.description.numberofauthors 1 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/370088 -
dc.identifier.url http://hdl.handle.net/20.500.11752/ILC-94 -
dc.language.iso ara -
dc.relation.projectAcronym GREEK INTO ARABIC -
dc.relation.projectAwardNumber 249431 -
dc.relation.projectAwardTitle Greek into Arabic: Philosophical Concepts and Linguistic Bridges -
dc.relation.projectFunderName - en
dc.relation.projectFundingStream FP7 -
dc.subject.keywords analisi morfo-sintattica -
dc.subject.keywords Lingua araba -
dc.subject.keywords Aramorph -
dc.subject.singlekeyword analisi morfo-sintattica *
dc.subject.singlekeyword Lingua araba *
dc.subject.singlekeyword Aramorph *
dc.title Aggiornamenti banca dati del Motore morfologico Aramorph en
dc.type.driver info:eu-repo/semantics/other -
dc.type.full 11 Applicazione o prodotto multimediale::11.04 Banca dati it
dc.type.miur 295 -
dc.ugov.descaux1 390727 -
iris.orcid.lastModifiedDate 2024/03/01 12:45:47 *
iris.orcid.lastModifiedMillisecond 1709293547611 *
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 11.04 Banca dati
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/370088
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact