The extraction of information from texts requires resources that contain both syntactic and semantic properties of lexical units. As the use Of language in specialized domains, such as biology, can be very different to the general domain, there is a need for domain-specific resources to ensure that the information extracted is as accurate as possible. We are building a large-scale lexical resource for the biology domain. providing information about predicate-argument structure that has been bootstrapped from a biomedical corpus on the subject of E. Coli. The lexicon is currently focussed on verbs, and includes both automatically-extracted syntactic subcategorization frames, as well as semantic event frames that are based on annotation by domain experts. In addition, the lexicon contains manually-added explicit links between semantic and syntactic slots in corresponding frames. To Our knowledge, this lexicon currently represents a unique resource within in the biomedical domain.

Bootstrapping a Verb Lexicon for Biomedical Information Extraction

Venturi Giulia;Montemagni Simonetta;Marchi Simone;
2009

Abstract

The extraction of information from texts requires resources that contain both syntactic and semantic properties of lexical units. As the use Of language in specialized domains, such as biology, can be very different to the general domain, there is a need for domain-specific resources to ensure that the information extracted is as accurate as possible. We are building a large-scale lexical resource for the biology domain. providing information about predicate-argument structure that has been bootstrapped from a biomedical corpus on the subject of E. Coli. The lexicon is currently focussed on verbs, and includes both automatically-extracted syntactic subcategorization frames, as well as semantic event frames that are based on annotation by domain experts. In addition, the lexicon contains manually-added explicit links between semantic and syntactic slots in corresponding frames. To Our knowledge, this lexicon currently represents a unique resource within in the biomedical domain.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.people Venturi Giulia it
dc.authority.people Montemagni Simonetta it
dc.authority.people Marchi Simone it
dc.authority.people Sasaki Yutaka it
dc.authority.people Thompson Paul it
dc.authority.people McNaught John it
dc.authority.people Ananiadou Sophia it
dc.collection.id.s 33fc2b58-b895-438b-9d2a-2c5bc86a83a6 *
dc.collection.name 04.04 Presentazione/Comunicazione non pubblicata in atti di convegno *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/19 19:34:32 -
dc.date.available 2024/02/19 19:34:32 -
dc.date.issued 2009 -
dc.description.abstract The extraction of information from texts requires resources that contain both syntactic and semantic properties of lexical units. As the use Of language in specialized domains, such as biology, can be very different to the general domain, there is a need for domain-specific resources to ensure that the information extracted is as accurate as possible. We are building a large-scale lexical resource for the biology domain. providing information about predicate-argument structure that has been bootstrapped from a biomedical corpus on the subject of E. Coli. The lexicon is currently focussed on verbs, and includes both automatically-extracted syntactic subcategorization frames, as well as semantic event frames that are based on annotation by domain experts. In addition, the lexicon contains manually-added explicit links between semantic and syntactic slots in corresponding frames. To Our knowledge, this lexicon currently represents a unique resource within in the biomedical domain. -
dc.description.affiliations Consiglio Nazionale delle Ricerche (CNR); Nactem (Manchester, UK) -
dc.description.allpeople Venturi, Giulia; Montemagni, Simonetta; Marchi, Simone; Sasaki, Yutaka; Thompson, Paul; Mcnaught, John; Ananiadou, Sophia -
dc.description.allpeopleoriginal Venturi, Giulia; Montemagni, Simonetta; Marchi, Simone; Sasaki, Yutaka; Thompson, Paul; McNaught, John; Ananiadou, Sophia -
dc.description.fulltext none en
dc.description.numberofauthors 7 -
dc.identifier.isbn 978-3-642-00381-3 -
dc.identifier.isi WOS:000265681200011 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/106756 -
dc.language.iso eng -
dc.relation.alleditors Alexander Gelbukh -
dc.relation.conferencedate March 1-7, 2009 -
dc.relation.conferencename International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2009) -
dc.relation.conferenceplace Mexico City, Mexico -
dc.relation.firstpage 137 -
dc.relation.ispartofbook Proceedings of the 10th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2009) -
dc.relation.lastpage 148 -
dc.relation.numberofpages 12 -
dc.relation.volume 5449 -
dc.subject.keywords domain-specific lexical resources -
dc.subject.keywords lexical acquisition -
dc.subject.keywords syntax-semantics linking -
dc.subject.keywords Information Extraction -
dc.subject.keywords Biological Language Processing -
dc.subject.singlekeyword domain-specific lexical resources *
dc.subject.singlekeyword lexical acquisition *
dc.subject.singlekeyword syntax-semantics linking *
dc.subject.singlekeyword Information Extraction *
dc.subject.singlekeyword Biological Language Processing *
dc.title Bootstrapping a Verb Lexicon for Biomedical Information Extraction en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.04 Presentazione/Comunicazione non pubblicata in atti di convegno it
dc.type.miur -2.0 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 112956 -
iris.isi.extIssued 2009 -
iris.isi.extTitle Bootstrapping a Verb Lexicon for Biomedical Information Extraction -
iris.isi.metadataErrorDescription 0 -
iris.isi.metadataErrorType ERROR_NO_MATCH -
iris.isi.metadataStatus ERROR -
iris.orcid.lastModifiedDate 2025/04/05 01:21:02 *
iris.orcid.lastModifiedMillisecond 1743808862341 *
iris.sitodocente.maxattempts 4 -
isi.authority.anceserie LECTURE NOTES IN COMPUTER SCIENCE###0302-9743 *
isi.authority.sdg Goal 3: Good health and well-being###12083 *
isi.category EX *
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation University of Manchester -
isi.contributor.affiliation University of Manchester -
isi.contributor.affiliation University of Manchester -
isi.contributor.affiliation University of Manchester -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country England -
isi.contributor.country England -
isi.contributor.country England -
isi.contributor.country England -
isi.contributor.name Giulia -
isi.contributor.name Simonetta -
isi.contributor.name Simone -
isi.contributor.name Yutaka -
isi.contributor.name Paul -
isi.contributor.name John -
isi.contributor.name Sophia -
isi.contributor.researcherId AAY-3932-2020 -
isi.contributor.researcherId B-8000-2015 -
isi.contributor.researcherId A-4098-2016 -
isi.contributor.researcherId MSG-9150-2025 -
isi.contributor.researcherId MLV-9755-2025 -
isi.contributor.researcherId DHA-6073-2022 -
isi.contributor.researcherId GBF-3762-2022 -
isi.contributor.subaffiliation Ist Linguist Computaz -
isi.contributor.subaffiliation Ist Linguist Computaz -
isi.contributor.subaffiliation Ist Linguist Computaz -
isi.contributor.subaffiliation Sch Comp Sci -
isi.contributor.subaffiliation Sch Comp Sci -
isi.contributor.subaffiliation Sch Comp Sci -
isi.contributor.subaffiliation Sch Comp Sci -
isi.contributor.surname Venturi -
isi.contributor.surname Montemagni -
isi.contributor.surname Marchi -
isi.contributor.surname Sasaki -
isi.contributor.surname Thompson -
isi.contributor.surname McNaught -
isi.contributor.surname Ananiadou -
isi.date.issued 2009 *
isi.description.allpeopleoriginal Venturi, G; Montemagni, S; Marchi, S; Sasaki, Y; Thompson, P; McNaught, J; Ananiadou, S; *
isi.document.sourcetype WOS.ISTP *
isi.document.type Proceedings Paper *
isi.document.types Proceedings Paper *
isi.identifier.eissn 1611-3349 *
isi.identifier.isi WOS:000265681200011 *
isi.journal.journaltitle COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING *
isi.journal.journaltitleabbrev LECT NOTES COMPUT SC *
isi.language.original English *
isi.publisher.place HEIDELBERGER PLATZ 3, D-14197 BERLIN, GERMANY *
isi.relation.firstpage 137 *
isi.relation.lastpage + *
isi.relation.volume 5449 *
isi.title Bootstrapping a Verb Lexicon for Biomedical Information Extraction *
Appare nelle tipologie: 04.04 Presentazione/Comunicazione non pubblicata (convegno, evento, webinar...)
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/106756
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 4
social impact