The present paper describes a large-scale lexical resource for the biology domain designed both for human and for machine use. This lexicon aims at semantic interoperability and extendability, through the adoption of ISO-LMF standard for lexical representation and through a granular and distributed encoding of relevant information. The first part of this contribution focuses on three aspects of the model that are of particular interest to the biology community: the treatment of term variants, the representation on bio events and the alignment with a domain ontology. The second part of the paper describes the physical implementation of the model: a relational database equipped with a set of automatic uploading procedures. Peculiarity of the BioLexicon is that it combines features of both terminologies and lexicons. A set verbs relevant for the domain is also represented with full details on their syntactic and semantic argument structure.

Toward a Standard Lexical Resource in the Bio Domain

Quochi V;Del Gratta R;Sassolini E;Monachini M;
2007

Abstract

The present paper describes a large-scale lexical resource for the biology domain designed both for human and for machine use. This lexicon aims at semantic interoperability and extendability, through the adoption of ISO-LMF standard for lexical representation and through a granular and distributed encoding of relevant information. The first part of this contribution focuses on three aspects of the model that are of particular interest to the biology community: the treatment of term variants, the representation on bio events and the alignment with a domain ontology. The second part of the paper describes the physical implementation of the model: a relational database equipped with a set of automatic uploading procedures. Peculiarity of the BioLexicon is that it combines features of both terminologies and lexicons. A set verbs relevant for the domain is also represented with full details on their syntactic and semantic argument structure.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Quochi V it
dc.authority.people Del Gratta R it
dc.authority.people Sassolini E it
dc.authority.people Monachini M it
dc.authority.people Calzolari N it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.date.accessioned 2024/02/19 19:49:40 -
dc.date.available 2024/02/19 19:49:40 -
dc.date.issued 2007 -
dc.description.abstract The present paper describes a large-scale lexical resource for the biology domain designed both for human and for machine use. This lexicon aims at semantic interoperability and extendability, through the adoption of ISO-LMF standard for lexical representation and through a granular and distributed encoding of relevant information. The first part of this contribution focuses on three aspects of the model that are of particular interest to the biology community: the treatment of term variants, the representation on bio events and the alignment with a domain ontology. The second part of the paper describes the physical implementation of the model: a relational database equipped with a set of automatic uploading procedures. Peculiarity of the BioLexicon is that it combines features of both terminologies and lexicons. A set verbs relevant for the domain is also represented with full details on their syntactic and semantic argument structure. -
dc.description.affiliations CNR-ILC, Pisa -
dc.description.allpeople Quochi, V; Del Gratta, R; Sassolini, E; Monachini, M; Calzolari, N -
dc.description.allpeopleoriginal Quochi V.; Del Gratta R.; Sassolini E.; Monachini M.; Calzolari N. -
dc.description.fulltext none en
dc.description.numberofauthors 5 -
dc.identifier.isbn 978-83-7177-413-3 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/65109 -
dc.language.iso eng -
dc.miur.last.status.update 2024-10-02T14:23:45Z *
dc.publisher.country POL -
dc.publisher.name Fundacja Uniwersytetu im A. Mickiewicza -
dc.publisher.place Poznan -
dc.relation.conferencedate 5-7 Ottobre 2007 -
dc.relation.conferencename LTC07 - 3rd Language and Technology Conference: Human Language Technology. Challenges of the Information Society: -
dc.relation.conferenceplace Poznan, Poland -
dc.relation.firstpage 295 -
dc.relation.lastpage 299 -
dc.subject.keywords Lexical representation model -
dc.subject.keywords Lexical Database -
dc.subject.keywords Computational Lexicography -
dc.subject.keywords Special Domains -
dc.subject.keywords Standards -
dc.subject.singlekeyword Lexical representation model *
dc.subject.singlekeyword Lexical Database *
dc.subject.singlekeyword Computational Lexicography *
dc.subject.singlekeyword Special Domains *
dc.subject.singlekeyword Standards *
dc.title Toward a Standard Lexical Resource in the Bio Domain en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 84735 -
iris.orcid.lastModifiedDate 2024/10/02 16:24:39 *
iris.orcid.lastModifiedMillisecond 1727879079222 *
iris.sitodocente.maxattempts 3 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/65109
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact