AraMorph's components are essentially two: the rule engine for morphological analysis and a repository of linguistic resources mainly composed of three lexicons: i) the dictStems lexicon, which contains 38.600 lemmas; ii) the dictPrefixes lexicon, which consists of sequences of proclitics and inflectional prefixes; iii) the dictSuffixes lexicon, which consists of sequences of inflectional suffixes and enclitics. These lexica are accompanied by three compatibility tables used for checking combinations of A (proclitics+prefixes), B (stems) and C (suffixes+enclitics). To cut down on arabic parse overgeneration, one has to enforce further restrictions in compatibility tables, e.g. the verb's ability to accept nominative and accusative pronouns, and to select a rational subject. We then augmented verb entries with subcategorization information such as case assignment and the restriction on rational subjects. At the same time, it was necessary to update compatibility tables.
Aggiornamenti banca dati del Motore morfologico Aramorph
Ouafae Nahli
2015
Abstract
AraMorph's components are essentially two: the rule engine for morphological analysis and a repository of linguistic resources mainly composed of three lexicons: i) the dictStems lexicon, which contains 38.600 lemmas; ii) the dictPrefixes lexicon, which consists of sequences of proclitics and inflectional prefixes; iii) the dictSuffixes lexicon, which consists of sequences of inflectional suffixes and enclitics. These lexica are accompanied by three compatibility tables used for checking combinations of A (proclitics+prefixes), B (stems) and C (suffixes+enclitics). To cut down on arabic parse overgeneration, one has to enforce further restrictions in compatibility tables, e.g. the verb's ability to accept nominative and accusative pronouns, and to select a rational subject. We then augmented verb entries with subcategorization information such as case assignment and the restriction on rational subjects. At the same time, it was necessary to update compatibility tables.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | Ouafae Nahli | it |
| dc.authority.project | Greek into Arabic: Philosophical Concepts and Linguistic Bridges | - |
| dc.collection.id.s | 9b78cb77-0866-4cb5-8ca6-af14a97a08ef | * |
| dc.collection.name | 11.04 Banca dati | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/17 19:05:27 | - |
| dc.date.available | 2024/02/17 19:05:27 | - |
| dc.date.issued | 2015 | - |
| dc.description.abstracteng | AraMorph's components are essentially two: the rule engine for morphological analysis and a repository of linguistic resources mainly composed of three lexicons: i) the dictStems lexicon, which contains 38.600 lemmas; ii) the dictPrefixes lexicon, which consists of sequences of proclitics and inflectional prefixes; iii) the dictSuffixes lexicon, which consists of sequences of inflectional suffixes and enclitics. These lexica are accompanied by three compatibility tables used for checking combinations of A (proclitics+prefixes), B (stems) and C (suffixes+enclitics). To cut down on arabic parse overgeneration, one has to enforce further restrictions in compatibility tables, e.g. the verb's ability to accept nominative and accusative pronouns, and to select a rational subject. We then augmented verb entries with subcategorization information such as case assignment and the restriction on rational subjects. At the same time, it was necessary to update compatibility tables. | - |
| dc.description.affiliations | Istituto Linguistica Computazionale "A. Zampolli" (ILC-CNR) | - |
| dc.description.allpeople | Ouafae Nahli | - |
| dc.description.allpeopleoriginal | Ouafae Nahli | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 1 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/370088 | - |
| dc.identifier.url | http://hdl.handle.net/20.500.11752/ILC-94 | - |
| dc.language.iso | ara | - |
| dc.relation.projectAcronym | GREEK INTO ARABIC | - |
| dc.relation.projectAwardNumber | 249431 | - |
| dc.relation.projectAwardTitle | Greek into Arabic: Philosophical Concepts and Linguistic Bridges | - |
| dc.relation.projectFunderName | - | en |
| dc.relation.projectFundingStream | FP7 | - |
| dc.subject.keywords | analisi morfo-sintattica | - |
| dc.subject.keywords | Lingua araba | - |
| dc.subject.keywords | Aramorph | - |
| dc.subject.singlekeyword | analisi morfo-sintattica | * |
| dc.subject.singlekeyword | Lingua araba | * |
| dc.subject.singlekeyword | Aramorph | * |
| dc.title | Aggiornamenti banca dati del Motore morfologico Aramorph | en |
| dc.type.driver | info:eu-repo/semantics/other | - |
| dc.type.full | 11 Applicazione o prodotto multimediale::11.04 Banca dati | it |
| dc.type.miur | 295 | - |
| dc.ugov.descaux1 | 390727 | - |
| iris.orcid.lastModifiedDate | 2024/03/01 12:45:47 | * |
| iris.orcid.lastModifiedMillisecond | 1709293547611 | * |
| iris.sitodocente.maxattempts | 1 | - |
| Appare nelle tipologie: | 11.04 Banca dati | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


