In this article, we will introduce two of the new parts of the new multi-part version of the Lexical Markup Framework (LMF) ISO standard, namely Part 3 of the standard (ISO 24613-3), which deals with etymological and diachronic data, and Part 4 (ISO 24613-4), which consists of a TEI serialisation of all of the prior parts of the model. We will demonstrate the use of both standards by describing the LMF encoding of a small number of examples taken from a sample conversion of the reference Portuguese dictionary Grande Dicion´ario Houaiss da L´?ngua Portuguesa, part of a broader experiment comprising the analysis of different, heterogeneously encoded, Portuguese lexical resources. We present the examples in the Unified Modelling Language (UML) and also in a couple of cases in TEI.

Modelling Etymology in LMF/TEI: The Grande Dicionário Houaiss da Língua Portuguesa Dictionary as a Use Case

2020

Abstract

In this article, we will introduce two of the new parts of the new multi-part version of the Lexical Markup Framework (LMF) ISO standard, namely Part 3 of the standard (ISO 24613-3), which deals with etymological and diachronic data, and Part 4 (ISO 24613-4), which consists of a TEI serialisation of all of the prior parts of the model. We will demonstrate the use of both standards by describing the LMF encoding of a small number of examples taken from a sample conversion of the reference Portuguese dictionary Grande Dicion´ario Houaiss da L´?ngua Portuguesa, part of a broader experiment comprising the analysis of different, heterogeneously encoded, Portuguese lexical resources. We present the examples in the Unified Modelling Language (UML) and also in a couple of cases in TEI.
Campo DC Valore Lingua
dc.authority.people Fahad Khan it
dc.authority.people Laurent Romary it
dc.authority.people Ana Salgado it
dc.authority.people Jack Bowers it
dc.authority.people Mohamed Khemakhem it
dc.authority.people Toma Tasovac it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/19 10:12:52 -
dc.date.available 2024/02/19 10:12:52 -
dc.date.issued 2020 -
dc.description.abstracteng In this article, we will introduce two of the new parts of the new multi-part version of the Lexical Markup Framework (LMF) ISO standard, namely Part 3 of the standard (ISO 24613-3), which deals with etymological and diachronic data, and Part 4 (ISO 24613-4), which consists of a TEI serialisation of all of the prior parts of the model. We will demonstrate the use of both standards by describing the LMF encoding of a small number of examples taken from a sample conversion of the reference Portuguese dictionary Grande Dicion´ario Houaiss da L´?ngua Portuguesa, part of a broader experiment comprising the analysis of different, heterogeneously encoded, Portuguese lexical resources. We present the examples in the Unified Modelling Language (UML) and also in a couple of cases in TEI. -
dc.description.affiliations Istituto di Linguistica Computazionale "A. Zampolli- CNR", Pisa, Italy, Inria-ALMAnaCH - Automatic Language Modelling and ANAlysis, Computational Humanities, Paris, France, NOVA CLUNL, Universidade NOVA de Lisboa, Lisbon, Portugal, Academia das Ciencias de Lisboa, Lisbon, Portugal , ACDH-CH - Austrian Center for Digital Humanities and Cultural Heritage, Vienna, Austria, Litt & Arts - UMR 5316, Grenoble, Universite Paris Diderot, Paris, France ´, Centre Marc Bloch, Berlin, Germany, Belgrade Center for Digital Humanities, Belgrade, Serbia -
dc.description.allpeople Fahad Khan; Laurent Romary; Ana Salgado; Jack Bowers; Mohamed Khemakhem; Toma Tasovac -
dc.description.allpeopleoriginal Fahad Khan, Laurent Romary, Ana Salgado, Jack Bowers, Mohamed Khemakhem, Toma Tasovac -
dc.description.fulltext none en
dc.description.numberofauthors 1 -
dc.identifier.isbn 979-10-95546-36-8 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/404921 -
dc.identifier.url https://aclanthology.org/2020.lrec-1.388.pdf -
dc.language.iso eng -
dc.relation.conferencedate 11-16/05/2020 -
dc.relation.conferencename Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020) -
dc.subject.keywords LMF -
dc.subject.keywords TEI -
dc.subject.keywords Portuguese Language Resources -
dc.subject.keywords Dictionaries -
dc.subject.singlekeyword LMF *
dc.subject.singlekeyword TEI *
dc.subject.singlekeyword Portuguese Language Resources *
dc.subject.singlekeyword Dictionaries *
dc.title Modelling Etymology in LMF/TEI: The Grande Dicionário Houaiss da Língua Portuguesa Dictionary as a Use Case en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.ugov.descaux1 429351 -
iris.orcid.lastModifiedDate 2024/03/02 05:34:24 *
iris.orcid.lastModifiedMillisecond 1709354064631 *
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/404921
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact