The extraction of information from texts requires resources that contain both syntactic and semantic properties of lexical units. As the use Of language in specialized domains, such as biology, can be very different to the general domain, there is a need for domain-specific resources to ensure that the information extracted is as accurate as possible. We are building a large-scale lexical resource for the biology domain. providing information about predicate-argument structure that has been bootstrapped from a biomedical corpus on the subject of E. Coli. The lexicon is currently focussed on verbs, and includes both automatically-extracted syntactic subcategorization frames, as well as semantic event frames that are based on annotation by domain experts. In addition, the lexicon contains manually-added explicit links between semantic and syntactic slots in corresponding frames. To Our knowledge, this lexicon currently represents a unique resource within in the biomedical domain.
Bootstrapping a Verb Lexicon for Biomedical Information Extraction
Venturi Giulia;Montemagni Simonetta;Marchi Simone;
2009
Abstract
The extraction of information from texts requires resources that contain both syntactic and semantic properties of lexical units. As the use Of language in specialized domains, such as biology, can be very different to the general domain, there is a need for domain-specific resources to ensure that the information extracted is as accurate as possible. We are building a large-scale lexical resource for the biology domain. providing information about predicate-argument structure that has been bootstrapped from a biomedical corpus on the subject of E. Coli. The lexicon is currently focussed on verbs, and includes both automatically-extracted syntactic subcategorization frames, as well as semantic event frames that are based on annotation by domain experts. In addition, the lexicon contains manually-added explicit links between semantic and syntactic slots in corresponding frames. To Our knowledge, this lexicon currently represents a unique resource within in the biomedical domain.| Campo DC | Valore | Lingua |
|---|---|---|
| dc.authority.orgunit | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | - |
| dc.authority.people | Venturi Giulia | it |
| dc.authority.people | Montemagni Simonetta | it |
| dc.authority.people | Marchi Simone | it |
| dc.authority.people | Sasaki Yutaka | it |
| dc.authority.people | Thompson Paul | it |
| dc.authority.people | McNaught John | it |
| dc.authority.people | Ananiadou Sophia | it |
| dc.collection.id.s | 33fc2b58-b895-438b-9d2a-2c5bc86a83a6 | * |
| dc.collection.name | 04.04 Presentazione/Comunicazione non pubblicata in atti di convegno | * |
| dc.contributor.appartenenza | Istituto di linguistica computazionale "Antonio Zampolli" - ILC | * |
| dc.contributor.appartenenza.mi | 918 | * |
| dc.date.accessioned | 2024/02/19 19:34:32 | - |
| dc.date.available | 2024/02/19 19:34:32 | - |
| dc.date.issued | 2009 | - |
| dc.description.abstract | The extraction of information from texts requires resources that contain both syntactic and semantic properties of lexical units. As the use Of language in specialized domains, such as biology, can be very different to the general domain, there is a need for domain-specific resources to ensure that the information extracted is as accurate as possible. We are building a large-scale lexical resource for the biology domain. providing information about predicate-argument structure that has been bootstrapped from a biomedical corpus on the subject of E. Coli. The lexicon is currently focussed on verbs, and includes both automatically-extracted syntactic subcategorization frames, as well as semantic event frames that are based on annotation by domain experts. In addition, the lexicon contains manually-added explicit links between semantic and syntactic slots in corresponding frames. To Our knowledge, this lexicon currently represents a unique resource within in the biomedical domain. | - |
| dc.description.affiliations | Consiglio Nazionale delle Ricerche (CNR); Nactem (Manchester, UK) | - |
| dc.description.allpeople | Venturi, Giulia; Montemagni, Simonetta; Marchi, Simone; Sasaki, Yutaka; Thompson, Paul; Mcnaught, John; Ananiadou, Sophia | - |
| dc.description.allpeopleoriginal | Venturi, Giulia; Montemagni, Simonetta; Marchi, Simone; Sasaki, Yutaka; Thompson, Paul; McNaught, John; Ananiadou, Sophia | - |
| dc.description.fulltext | none | en |
| dc.description.numberofauthors | 7 | - |
| dc.identifier.isbn | 978-3-642-00381-3 | - |
| dc.identifier.isi | WOS:000265681200011 | - |
| dc.identifier.uri | https://hdl.handle.net/20.500.14243/106756 | - |
| dc.language.iso | eng | - |
| dc.relation.alleditors | Alexander Gelbukh | - |
| dc.relation.conferencedate | March 1-7, 2009 | - |
| dc.relation.conferencename | International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2009) | - |
| dc.relation.conferenceplace | Mexico City, Mexico | - |
| dc.relation.firstpage | 137 | - |
| dc.relation.ispartofbook | Proceedings of the 10th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2009) | - |
| dc.relation.lastpage | 148 | - |
| dc.relation.numberofpages | 12 | - |
| dc.relation.volume | 5449 | - |
| dc.subject.keywords | domain-specific lexical resources | - |
| dc.subject.keywords | lexical acquisition | - |
| dc.subject.keywords | syntax-semantics linking | - |
| dc.subject.keywords | Information Extraction | - |
| dc.subject.keywords | Biological Language Processing | - |
| dc.subject.singlekeyword | domain-specific lexical resources | * |
| dc.subject.singlekeyword | lexical acquisition | * |
| dc.subject.singlekeyword | syntax-semantics linking | * |
| dc.subject.singlekeyword | Information Extraction | * |
| dc.subject.singlekeyword | Biological Language Processing | * |
| dc.title | Bootstrapping a Verb Lexicon for Biomedical Information Extraction | en |
| dc.type.driver | info:eu-repo/semantics/conferenceObject | - |
| dc.type.full | 04 Contributo in convegno::04.04 Presentazione/Comunicazione non pubblicata in atti di convegno | it |
| dc.type.miur | -2.0 | - |
| dc.type.referee | Sì, ma tipo non specificato | - |
| dc.ugov.descaux1 | 112956 | - |
| iris.isi.extIssued | 2009 | - |
| iris.isi.extTitle | Bootstrapping a Verb Lexicon for Biomedical Information Extraction | - |
| iris.isi.metadataErrorDescription | 0 | - |
| iris.isi.metadataErrorType | ERROR_NO_MATCH | - |
| iris.isi.metadataStatus | ERROR | - |
| iris.orcid.lastModifiedDate | 2025/04/05 01:21:02 | * |
| iris.orcid.lastModifiedMillisecond | 1743808862341 | * |
| iris.sitodocente.maxattempts | 4 | - |
| isi.authority.anceserie | LECTURE NOTES IN COMPUTER SCIENCE###0302-9743 | * |
| isi.authority.sdg | Goal 3: Good health and well-being###12083 | * |
| isi.category | EX | * |
| isi.contributor.affiliation | Consiglio Nazionale delle Ricerche (CNR) | - |
| isi.contributor.affiliation | Consiglio Nazionale delle Ricerche (CNR) | - |
| isi.contributor.affiliation | Consiglio Nazionale delle Ricerche (CNR) | - |
| isi.contributor.affiliation | University of Manchester | - |
| isi.contributor.affiliation | University of Manchester | - |
| isi.contributor.affiliation | University of Manchester | - |
| isi.contributor.affiliation | University of Manchester | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | Italy | - |
| isi.contributor.country | England | - |
| isi.contributor.country | England | - |
| isi.contributor.country | England | - |
| isi.contributor.country | England | - |
| isi.contributor.name | Giulia | - |
| isi.contributor.name | Simonetta | - |
| isi.contributor.name | Simone | - |
| isi.contributor.name | Yutaka | - |
| isi.contributor.name | Paul | - |
| isi.contributor.name | John | - |
| isi.contributor.name | Sophia | - |
| isi.contributor.researcherId | AAY-3932-2020 | - |
| isi.contributor.researcherId | B-8000-2015 | - |
| isi.contributor.researcherId | A-4098-2016 | - |
| isi.contributor.researcherId | MSG-9150-2025 | - |
| isi.contributor.researcherId | MLV-9755-2025 | - |
| isi.contributor.researcherId | DHA-6073-2022 | - |
| isi.contributor.researcherId | GBF-3762-2022 | - |
| isi.contributor.subaffiliation | Ist Linguist Computaz | - |
| isi.contributor.subaffiliation | Ist Linguist Computaz | - |
| isi.contributor.subaffiliation | Ist Linguist Computaz | - |
| isi.contributor.subaffiliation | Sch Comp Sci | - |
| isi.contributor.subaffiliation | Sch Comp Sci | - |
| isi.contributor.subaffiliation | Sch Comp Sci | - |
| isi.contributor.subaffiliation | Sch Comp Sci | - |
| isi.contributor.surname | Venturi | - |
| isi.contributor.surname | Montemagni | - |
| isi.contributor.surname | Marchi | - |
| isi.contributor.surname | Sasaki | - |
| isi.contributor.surname | Thompson | - |
| isi.contributor.surname | McNaught | - |
| isi.contributor.surname | Ananiadou | - |
| isi.date.issued | 2009 | * |
| isi.description.allpeopleoriginal | Venturi, G; Montemagni, S; Marchi, S; Sasaki, Y; Thompson, P; McNaught, J; Ananiadou, S; | * |
| isi.document.sourcetype | WOS.ISTP | * |
| isi.document.type | Proceedings Paper | * |
| isi.document.types | Proceedings Paper | * |
| isi.identifier.eissn | 1611-3349 | * |
| isi.identifier.isi | WOS:000265681200011 | * |
| isi.journal.journaltitle | COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING | * |
| isi.journal.journaltitleabbrev | LECT NOTES COMPUT SC | * |
| isi.language.original | English | * |
| isi.publisher.place | HEIDELBERGER PLATZ 3, D-14197 BERLIN, GERMANY | * |
| isi.relation.firstpage | 137 | * |
| isi.relation.lastpage | + | * |
| isi.relation.volume | 5449 | * |
| isi.title | Bootstrapping a Verb Lexicon for Biomedical Information Extraction | * |
| Appare nelle tipologie: | 04.04 Presentazione/Comunicazione non pubblicata (convegno, evento, webinar...) | |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


