In this article, we will introduce two of the new parts of the new multi-part version of the Lexical Markup Framework (LMF) ISO standard, namely Part 3 of the standard (ISO 24613-3), which deals with etymological and diachronic data, and Part 4 (ISO 24613-4), which consists of a TEI serialisation of all of the prior parts of the model. We will demonstrate the use of both standards by describing the LMF encoding of a small number of examples taken from a sample conversion of the reference Portuguese dictionary Grande Dicion´ario Houaiss da L´?ngua Portuguesa, part of a broader experiment comprising the analysis of different, heterogeneously encoded, Portuguese lexical resources. We present the examples in the Unified Modelling Language (UML) and also in a couple of cases in TEI.
Modelling Etymology in LMF/TEI: The Grande Dicionário Houaiss da Língua Portuguesa Dictionary as a Use Case
2020
Abstract
In this article, we will introduce two of the new parts of the new multi-part version of the Lexical Markup Framework (LMF) ISO standard, namely Part 3 of the standard (ISO 24613-3), which deals with etymological and diachronic data, and Part 4 (ISO 24613-4), which consists of a TEI serialisation of all of the prior parts of the model. We will demonstrate the use of both standards by describing the LMF encoding of a small number of examples taken from a sample conversion of the reference Portuguese dictionary Grande Dicion´ario Houaiss da L´?ngua Portuguesa, part of a broader experiment comprising the analysis of different, heterogeneously encoded, Portuguese lexical resources. We present the examples in the Unified Modelling Language (UML) and also in a couple of cases in TEI.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.people | Fahad Khan | it |
| dc.authority.people | Laurent Romary | it |
| dc.authority.people | Ana Salgado | it |
| dc.authority.people | Jack Bowers | it |
| dc.authority.people | Mohamed Khemakhem | it |
| dc.authority.people | Toma Tasovac | it |
| dc.collection.id.s | 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d | * |
| dc.collection.name | 04.01 Contributo in Atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/19 10:12:52 | - |
| dc.date.available | 2024/02/19 10:12:52 | - |
| dc.date.issued | 2020 | - |
| dc.description.abstracteng | In this article, we will introduce two of the new parts of the new multi-part version of the Lexical Markup Framework (LMF) ISO standard, namely Part 3 of the standard (ISO 24613-3), which deals with etymological and diachronic data, and Part 4 (ISO 24613-4), which consists of a TEI serialisation of all of the prior parts of the model. We will demonstrate the use of both standards by describing the LMF encoding of a small number of examples taken from a sample conversion of the reference Portuguese dictionary Grande Dicion´ario Houaiss da L´?ngua Portuguesa, part of a broader experiment comprising the analysis of different, heterogeneously encoded, Portuguese lexical resources. We present the examples in the Unified Modelling Language (UML) and also in a couple of cases in TEI. | - |
| dc.description.affiliations | Istituto di Linguistica Computazionale "A. Zampolli- CNR", Pisa, Italy, Inria-ALMAnaCH - Automatic Language Modelling and ANAlysis, Computational Humanities, Paris, France, NOVA CLUNL, Universidade NOVA de Lisboa, Lisbon, Portugal, Academia das Ciencias de Lisboa, Lisbon, Portugal , ACDH-CH - Austrian Center for Digital Humanities and Cultural Heritage, Vienna, Austria, Litt & Arts - UMR 5316, Grenoble, Universite Paris Diderot, Paris, France ´, Centre Marc Bloch, Berlin, Germany, Belgrade Center for Digital Humanities, Belgrade, Serbia | - |
| dc.description.allpeople | Fahad Khan; Laurent Romary; Ana Salgado; Jack Bowers; Mohamed Khemakhem; Toma Tasovac | - |
| dc.description.allpeopleoriginal | Fahad Khan, Laurent Romary, Ana Salgado, Jack Bowers, Mohamed Khemakhem, Toma Tasovac | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 1 | - |
| dc.identifier.isbn | 979-10-95546-36-8 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/404921 | - |
| dc.identifier.url | https://aclanthology.org/2020.lrec-1.388.pdf | - |
| dc.language.iso | eng | - |
| dc.relation.conferencedate | 11-16/05/2020 | - |
| dc.relation.conferencename | Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020) | - |
| dc.subject.keywords | LMF | - |
| dc.subject.keywords | TEI | - |
| dc.subject.keywords | Portuguese Language Resources | - |
| dc.subject.keywords | Dictionaries | - |
| dc.subject.singlekeyword | LMF | * |
| dc.subject.singlekeyword | TEI | * |
| dc.subject.singlekeyword | Portuguese Language Resources | * |
| dc.subject.singlekeyword | Dictionaries | * |
| dc.title | Modelling Etymology in LMF/TEI: The Grande Dicionário Houaiss da Língua Portuguesa Dictionary as a Use Case | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.01 Contributo in Atti di convegno | it |
| dc.type.miur | 273 | - |
| dc.ugov.descaux1 | 429351 | - |
| iris.orcid.lastModifiedDate | 2024/03/02 05:34:24 | * |
| iris.orcid.lastModifiedMillisecond | 1709354064631 | * |
| iris.sitodocente.maxattempts | 1 | - |
| Appare nelle tipologie: | 04.01 Contributo in Atti di convegno | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


