This paper presents the results of a terminological work on a reference corpus in the domain of Biomedicine. In particular, the research tends to analyse the use of certain terms in Biomedicine in order to verify their change over the time with the aim of retrieving from the net the very essence of documentation. The terminological sample contains words used in BioNLP and biomedicine and identifies which terms are passing from scientific publications to the daily press and which are rather reserved to scientific production. The final scope of this work is to determine how scientific dissemination to an ever larger part of the society enables a public of common citizens to approach communication on biomedical research and development; and its main source is a reference corpus made up of three main repositories from which information related to BioNLP and Biomedicine is extracted. The paper is divided in three sections: 1) an introduction dedicated to data extracted from scientific documentation; 2) the second section devoted to methodology and data description; 3) the third part containing a statistical representation of terms extracted from the archive: indexes and concordances allow to reflect on the use of certain terms in this field and give possible keys for having access to the extraction of knowledge in the digital era.

From medical language processing to BioNLP domain

Pardelli G;Sassi M;Goggi S;Biagioni S
2012

Abstract

This paper presents the results of a terminological work on a reference corpus in the domain of Biomedicine. In particular, the research tends to analyse the use of certain terms in Biomedicine in order to verify their change over the time with the aim of retrieving from the net the very essence of documentation. The terminological sample contains words used in BioNLP and biomedicine and identifies which terms are passing from scientific publications to the daily press and which are rather reserved to scientific production. The final scope of this work is to determine how scientific dissemination to an ever larger part of the society enables a public of common citizens to approach communication on biomedical research and development; and its main source is a reference corpus made up of three main repositories from which information related to BioNLP and Biomedicine is extracted. The paper is divided in three sections: 1) an introduction dedicated to data extracted from scientific documentation; 2) the second section devoted to methodology and data description; 3) the third part containing a statistical representation of terms extracted from the archive: indexes and concordances allow to reflect on the use of certain terms in this field and give possible keys for having access to the extraction of knowledge in the digital era.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.orgunit Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI -
dc.authority.people Pardelli G it
dc.authority.people Sassi M it
dc.authority.people Goggi S it
dc.authority.people Biagioni S it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.appartenenza.mi 973 *
dc.date.accessioned 2024/02/16 04:46:07 -
dc.date.available 2024/02/16 04:46:07 -
dc.date.issued 2012 -
dc.description.abstracteng This paper presents the results of a terminological work on a reference corpus in the domain of Biomedicine. In particular, the research tends to analyse the use of certain terms in Biomedicine in order to verify their change over the time with the aim of retrieving from the net the very essence of documentation. The terminological sample contains words used in BioNLP and biomedicine and identifies which terms are passing from scientific publications to the daily press and which are rather reserved to scientific production. The final scope of this work is to determine how scientific dissemination to an ever larger part of the society enables a public of common citizens to approach communication on biomedical research and development; and its main source is a reference corpus made up of three main repositories from which information related to BioNLP and Biomedicine is extracted. The paper is divided in three sections: 1) an introduction dedicated to data extracted from scientific documentation; 2) the second section devoted to methodology and data description; 3) the third part containing a statistical representation of terms extracted from the archive: indexes and concordances allow to reflect on the use of certain terms in this field and give possible keys for having access to the extraction of knowledge in the digital era. -
dc.description.affiliations CNR-ILC, Pisa, Italy; CNR-ILC, Pisa, Italy; CNR-ILC, Pisa, Italy; CNR-ISTI, Pisa, Italy; -
dc.description.allpeople Pardelli, G; Sassi, M; Goggi, S; Biagioni, S -
dc.description.allpeopleoriginal Pardelli G.; Sassi M.; Goggi S.; Biagioni S. -
dc.description.fulltext restricted en
dc.description.note ISBN 978-2-9517408-7-7; Publisher: European Language Resources Association (ELRA); ACL Anthology - A Digital Archive of Research Papers in Computational Linguistics (http://www.aclweb.org/anthology/L/L12/) -
dc.description.numberofauthors 4 -
dc.identifier.isbn 978-2-9517408-7-7 -
dc.identifier.isi WOS:000323927702020 -
dc.identifier.scopus 2-s2.0-85037356940 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/6009 -
dc.identifier.url http://www.lrec-conf.org/proceedings/lrec2012/pdf/687_Paper.pdf -
dc.language.iso eng -
dc.publisher.country FRA -
dc.publisher.name European Language Resources Association (ELRA) - Evaluations and Language resources Distribution Agency (ELDA) -
dc.publisher.place Paris -
dc.relation.alleditors Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet U?ur Do?an and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis -
dc.relation.conferencedate 21-27 may 2012 -
dc.relation.conferencename Eight International Conference on Language Resources and Evaluation. LREC'12 -
dc.relation.conferenceplace Istanbul, Turkey -
dc.relation.firstpage 2049 -
dc.relation.ispartofbook Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12) -
dc.relation.lastpage 2055 -
dc.relation.numberofpages 7 -
dc.subject.keywords Information Extraction -
dc.subject.keywords Information Retrieval -
dc.subject.keywords Text mining -
dc.subject.keywords Digital Libraries -
dc.subject.singlekeyword Information Extraction *
dc.subject.singlekeyword Information Retrieval *
dc.subject.singlekeyword Text mining *
dc.subject.singlekeyword Digital Libraries *
dc.title From medical language processing to BioNLP domain en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 217173 -
iris.isi.extIssued 2012 -
iris.isi.extTitle From medical language processing to BioNLP domain -
iris.mediafilter.data 2025/04/06 03:14:36 *
iris.orcid.lastModifiedDate 2024/07/04 16:07:15 *
iris.orcid.lastModifiedMillisecond 1720102035171 *
iris.scopus.extIssued 2012 -
iris.scopus.extTitle From medical language processing to BioNLP domain -
iris.scopus.ideLinkStatusDate 2024/07/04 16:07:15 *
iris.scopus.ideLinkStatusMillisecond 1720102035177 *
iris.sitodocente.maxattempts 3 -
isi.authority.sdg Goal 3: Good health and well-being###12083 *
isi.category OT *
isi.category OY *
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation Consiglio Nazionale delle Ricerche (CNR) -
isi.contributor.affiliation -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country Italy -
isi.contributor.country -
isi.contributor.name Gabriella -
isi.contributor.name Manuela -
isi.contributor.name Sara -
isi.contributor.name Stefania -
isi.contributor.researcherId Z-4698-2019 -
isi.contributor.researcherId GGQ-1037-2022 -
isi.contributor.researcherId EZY-9908-2022 -
isi.contributor.researcherId FZF-8568-2022 -
isi.contributor.subaffiliation -
isi.contributor.subaffiliation -
isi.contributor.subaffiliation -
isi.contributor.subaffiliation -
isi.contributor.surname Pardelli -
isi.contributor.surname Sassi -
isi.contributor.surname Goggi -
isi.contributor.surname Biagioni -
isi.date.issued 2012 *
isi.description.abstracteng This paper presents the results of a terminological work on a reference corpus in the domain of Biomedicine. In particular, the research tends to analyse the use of certain terms in Biomedicine in order to verify their change over the time with the aim of retrieving from the net the very essence of documentation. The terminological sample contains words used in BioNLP and biomedicine and identifies which terms are passing from scientific publications to the daily press and which are rather reserved to scientific production. The final scope of this work is to determine how scientific dissemination to an ever larger part of the society enables a public of common citizens to approach communication on biomedical research and development; and its main source is a reference corpus made up of three main repositories from which information related to BioNLP and Biomedicine is extracted.This study is divided in three sections: 1) an introduction dedicated to data extracted from scientific documentation; 2) the second section devoted to methodology and data description; 3) the third part containing a statistical representation of terms extracted from the archive: indexes and concordances allow to reflect on the use of certain terms in this field and give possible keys for having access to the extraction of knowledge in the digital era. *
isi.description.allpeopleoriginal Pardelli, G; Sassi, M; Goggi, S; Biagioni, S; *
isi.document.sourcetype WOS.ISSHP *
isi.document.type Proceedings Paper *
isi.document.types Proceedings Paper *
isi.identifier.isi WOS:000323927702020 *
isi.journal.journaltitle LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION *
isi.language.original English *
isi.publisher.place 55-57, RUE BRILLAT-SAVARIN, PARIS, 75013, FRANCE *
isi.relation.firstpage 2049 *
isi.relation.lastpage 2055 *
isi.title From medical language processing to BioNLP domain *
scopus.category 1203 *
scopus.category 3304 *
scopus.category 3310 *
scopus.category 3309 *
scopus.contributor.affiliation Istituto di Linguistica Computazionale Antonio Zampolli -
scopus.contributor.affiliation Istituto di Linguistica Computazionale Antonio Zampolli -
scopus.contributor.affiliation Istituto di Linguistica Computazionale Antonio Zampolli -
scopus.contributor.affiliation Istituto di Scienza e Tecnologie Dell'Informazione Alessandro Faedo -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60008941 -
scopus.contributor.afid 60085207 -
scopus.contributor.auid 36621334400 -
scopus.contributor.auid 36621402800 -
scopus.contributor.auid 56538507900 -
scopus.contributor.auid 7004056440 -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.country Italy -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.dptid -
scopus.contributor.name Gabriella -
scopus.contributor.name Manuela -
scopus.contributor.name Sara -
scopus.contributor.name Stefania -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.subaffiliation -
scopus.contributor.surname Pardelli -
scopus.contributor.surname Sassi -
scopus.contributor.surname Goggi -
scopus.contributor.surname Biagioni -
scopus.date.issued 2012 *
scopus.description.abstracteng This paper presents the results of a terminological work on a reference corpus in the domain of Biomedicine. In particular, the research tends to analyse the use of certain terms in Biomedicine in order to verify their change over the time with the aim of retrieving from the net the very essence of documentation. The terminological sample contains words used in BioNLP and biomedicine and identifies which terms are passing from scientific publications to the daily press and which are rather reserved to scientific production. The final scope of this work is to determine how scientific dissemination to an ever larger part of the society enables a public of common citizens to approach communication on biomedical research and development; and its main source is a reference corpus made up of three main repositories from which information related to BioNLP and Biomedicine is extracted. This study is divided in three sections: 1) an introduction dedicated to data extracted from scientific documentation; 2) the second section devoted to methodology and data description; 3) the third part containing a statistical representation of terms extracted from the archive: indexes and concordances allow to reflect on the use of certain terms in this field and give possible keys for having access to the extraction of knowledge in the digital era. *
scopus.description.allpeopleoriginal Pardelli G.; Sassi M.; Goggi S.; Biagioni S. *
scopus.differences scopus.relation.conferencename *
scopus.differences scopus.publisher.name *
scopus.differences scopus.subject.keywords *
scopus.differences scopus.relation.conferencedate *
scopus.differences scopus.identifier.isbn *
scopus.differences scopus.description.abstracteng *
scopus.differences scopus.relation.conferenceplace *
scopus.document.type cp *
scopus.document.types cp *
scopus.identifier.isbn 9782951740877 *
scopus.identifier.pui 619588816 *
scopus.identifier.scopus 2-s2.0-85037356940 *
scopus.journal.sourceid 21100842151 *
scopus.language.iso eng *
scopus.publisher.name European Language Resources Association (ELRA) *
scopus.relation.conferencedate 2012 *
scopus.relation.conferencename 8th International Conference on Language Resources and Evaluation, LREC 2012 *
scopus.relation.conferenceplace Istanbul Lufti Kirdar Convention and Exhibition Centre, tur *
scopus.relation.firstpage 2049 *
scopus.relation.lastpage 2055 *
scopus.subject.keywords Biomedicine; Natural language processing; Terminology; *
scopus.title From medical language processing to BioNLP domain *
scopus.titleeng From medical language processing to BioNLP domain *
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_217173-doc_50569.pdf

solo utenti autorizzati

Descrizione: From medical language processing to BioNLP domain
Tipologia: Versione Editoriale (PDF)
Dimensione 555.9 kB
Formato Adobe PDF
555.9 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/6009
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 1
social impact