The current data tsunami produced by the new advanced instruments and/or sensors and/or by running simulations and the progress of science is revolutionizing the way in which research is conducted and this poses new challenges to the existing e-infrastructures from both the data and the application sides. Science is becoming data-dominated and a new data-centric way of thinking, organizing and carrying out research activities is gaining ground which needs to be supported by a new type of e-infrastructure: the Scientific Data Infrastructure. Scientific Data Infrastructures can be defined as managed digital data-networked environments consisting of services and tools that support the full life cycle of data (capture, collection, curation, documentation, analysis, visualization, preservation, and publication) for the benefit of different communities of researchers involved in data-intensive activities. A Scientific Data Infrastructure should, thus, add the capacity of effectively and efficiently handling and publishing the current huge volumes of data to the computational capacity provided by the e-infrastructures. The next generation of scientific data infrastructures is facing two main challenges: i) to effectively and efficiently support data-intensive Science, and ii) to effectively and efficiently support multidisciplinary/interdisciplinary Science. In order to develop such data infrastructures several data, application, system, and organization/ policy challenges must be successfully tackled. The talk will address the main data challenges.

The global scientific data infrastructures: the big data challenges

Thanos C
2010

Abstract

The current data tsunami produced by the new advanced instruments and/or sensors and/or by running simulations and the progress of science is revolutionizing the way in which research is conducted and this poses new challenges to the existing e-infrastructures from both the data and the application sides. Science is becoming data-dominated and a new data-centric way of thinking, organizing and carrying out research activities is gaining ground which needs to be supported by a new type of e-infrastructure: the Scientific Data Infrastructure. Scientific Data Infrastructures can be defined as managed digital data-networked environments consisting of services and tools that support the full life cycle of data (capture, collection, curation, documentation, analysis, visualization, preservation, and publication) for the benefit of different communities of researchers involved in data-intensive activities. A Scientific Data Infrastructure should, thus, add the capacity of effectively and efficiently handling and publishing the current huge volumes of data to the computational capacity provided by the e-infrastructures. The next generation of scientific data infrastructures is facing two main challenges: i) to effectively and efficiently support data-intensive Science, and ii) to effectively and efficiently support multidisciplinary/interdisciplinary Science. In order to develop such data infrastructures several data, application, system, and organization/ policy challenges must be successfully tackled. The talk will address the main data challenges.
2010
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Digital Libraries
Scientific Data Infrastructures
Data-intensive activities
File in questo prodotto:
File Dimensione Formato  
prod_120708-doc_132416.pdf

accesso aperto

Descrizione: The global scientific data infrastructures: the big data challenges
Tipologia: Versione Editoriale (PDF)
Dimensione 56.39 kB
Formato Adobe PDF
56.39 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/86032
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact