This paper presents the results of a terminological work conducted by the authors on a Digital Archives Net of the Italian National Research Council (CNR) in the field of Computer Science. In particular, the research tends to analyse the use of certain terms in Computer Science in order to verify their change over the time with the aim of retrieving from the net the very essence of documentation. Its main source is a reference corpus made up of 13,500 documents which collects the scientific productions of three CNR research Institutes. They are ISTI (Institute of Information Science and Technologies), IIT (Institute of Informatics and Telematics) and ILC (Institute of Computational Linguistics), all of them born from the "Centro Studi sulle Calcolatrici Elettroniche (CSCE)" and now belonging to the CNR Department of Information & Communication Technologies and Cultural Identity. This study is divided in three sections: an introductory one dedicated to the data extracted from the scientific documentation: the data have in common the use of some terms proper of the Computer Science lexicon although these term belong to different branches (Linguistics, Informatics and Telematics); the second section is devoted to the description of the contents managed by the PUMA (Publication Management System) system; the third section contains a statistical representation of terms extracted from archive: some comparison tables between the occurrences of the most used terms in the scientific documentation produced by the three Institutes will be created and diagrams with percentages about the most frequently used terms will be displayed too. Lastly, indexes and concordances will allow to reflect on the use of certain terms in this field and give possible keys for having access to the extraction of knowledge in the digital era.

An open archive of scientific communication

Pardelli G;Sassi M;Orsolini P;Biagioni S;Giannini S
2011

Abstract

This paper presents the results of a terminological work conducted by the authors on a Digital Archives Net of the Italian National Research Council (CNR) in the field of Computer Science. In particular, the research tends to analyse the use of certain terms in Computer Science in order to verify their change over the time with the aim of retrieving from the net the very essence of documentation. Its main source is a reference corpus made up of 13,500 documents which collects the scientific productions of three CNR research Institutes. They are ISTI (Institute of Information Science and Technologies), IIT (Institute of Informatics and Telematics) and ILC (Institute of Computational Linguistics), all of them born from the "Centro Studi sulle Calcolatrici Elettroniche (CSCE)" and now belonging to the CNR Department of Information & Communication Technologies and Cultural Identity. This study is divided in three sections: an introductory one dedicated to the data extracted from the scientific documentation: the data have in common the use of some terms proper of the Computer Science lexicon although these term belong to different branches (Linguistics, Informatics and Telematics); the second section is devoted to the description of the contents managed by the PUMA (Publication Management System) system; the third section contains a statistical representation of terms extracted from archive: some comparison tables between the occurrences of the most used terms in the scientific documentation produced by the three Institutes will be created and diagrams with percentages about the most frequently used terms will be displayed too. Lastly, indexes and concordances will allow to reflect on the use of certain terms in this field and give possible keys for having access to the extraction of knowledge in the digital era.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di linguistica computazionale "Antonio Zampolli" - ILC -
dc.authority.orgunit Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI -
dc.authority.people Pardelli G it
dc.authority.people Sassi M it
dc.authority.people Orsolini P it
dc.authority.people Biagioni S it
dc.authority.people Giannini S it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 918 *
dc.contributor.appartenenza.mi 973 *
dc.date.accessioned 2024/02/17 19:24:38 -
dc.date.available 2024/02/17 19:24:38 -
dc.date.issued 2011 -
dc.description.abstracteng This paper presents the results of a terminological work conducted by the authors on a Digital Archives Net of the Italian National Research Council (CNR) in the field of Computer Science. In particular, the research tends to analyse the use of certain terms in Computer Science in order to verify their change over the time with the aim of retrieving from the net the very essence of documentation. Its main source is a reference corpus made up of 13,500 documents which collects the scientific productions of three CNR research Institutes. They are ISTI (Institute of Information Science and Technologies), IIT (Institute of Informatics and Telematics) and ILC (Institute of Computational Linguistics), all of them born from the "Centro Studi sulle Calcolatrici Elettroniche (CSCE)" and now belonging to the CNR Department of Information & Communication Technologies and Cultural Identity. This study is divided in three sections: an introductory one dedicated to the data extracted from the scientific documentation: the data have in common the use of some terms proper of the Computer Science lexicon although these term belong to different branches (Linguistics, Informatics and Telematics); the second section is devoted to the description of the contents managed by the PUMA (Publication Management System) system; the third section contains a statistical representation of terms extracted from archive: some comparison tables between the occurrences of the most used terms in the scientific documentation produced by the three Institutes will be created and diagrams with percentages about the most frequently used terms will be displayed too. Lastly, indexes and concordances will allow to reflect on the use of certain terms in this field and give possible keys for having access to the extraction of knowledge in the digital era. -
dc.description.affiliations CNR-ILC, Pisa, Italy; CNR-ILC, Pisa, Italy; CNR-ILC, Pisa, Italy; CNR-ISTI, Pisa, Italy; CNR-ISTI, Pisa, Italy; -
dc.description.allpeople Pardelli, G; Sassi, M; Orsolini, P; Biagioni, S; Giannini, S -
dc.description.allpeopleoriginal Pardelli G., Sassi M., Orsolini P., Biagioni S., Giannini S. -
dc.description.fulltext restricted en
dc.description.note ISBN 978-959-7174-19-6 Codice PuMa: cnr.ilc/2011-A2-002; Codice PUMA: cnr.isti/2011-A2-001; Publisher: Centro de linguística aplicada, Ministerio de ciencia, tecnología y medio ambiente, Santiago de Cuba (Cuba). -
dc.description.numberofauthors 5 -
dc.identifier.isbn 978-959-7174-19-6 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/21492 -
dc.identifier.url http://www.santiago.cu/hosting/linguistica/simposios.php?s=XII -
dc.language.iso eng -
dc.publisher.country CUB -
dc.publisher.name Centro de linguística aplicada, Ministerio de ciencia, tecnología y medio ambiente -
dc.publisher.place Santiago de Cuba -
dc.relation.alleditors Leonel Ruiz Miyares, María Rosa Álvarez Silva -
dc.relation.conferencedate 17-21 gennaio 2011 -
dc.relation.conferencename Comunicación Social en el Siglo XXI. XII Simposio Internacional de Comunicacion Social -
dc.relation.conferenceplace Santiago de Cuba -
dc.relation.firstpage 914 -
dc.relation.ispartofbook Comunicacion social en el siglo XXI, vol. II -
dc.relation.lastpage 918 -
dc.subject.keywords Digital Archives -
dc.subject.keywords Communication -
dc.subject.keywords Terminology -
dc.subject.keywords Open Access -
dc.subject.singlekeyword Digital Archives *
dc.subject.singlekeyword Communication *
dc.subject.singlekeyword Terminology *
dc.subject.singlekeyword Open Access *
dc.title An open archive of scientific communication en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 199282 -
iris.mediafilter.data 2025/04/18 03:21:38 *
iris.orcid.lastModifiedDate 2024/04/04 17:47:31 *
iris.orcid.lastModifiedMillisecond 1712245651581 *
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_199282-doc_113416.pdf

solo utenti autorizzati

Descrizione: An Open Archive of Scientific Communication
Tipologia: Versione Editoriale (PDF)
Dimensione 632.08 kB
Formato Adobe PDF
632.08 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/21492
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact