New Science paradigms have recently evolved to promote open publication of scientific findings as well as multi-disciplinary collaborative approaches to scientific experimentation. These approaches can face modern scientific challenges but must deal with large quantities of data produced by industrial and scientific experiments. These data, so-called "Big Data", require to introduce new Computer Science systems to help scientists cooperate, extract information, and possibly produce new knowledge out of the data. E-Infrastructures are distributed computer systems that foster collaboration between users and can embed distributed and parallel processing systems to manage Big Data. However, in order to meet modern Science requirements, e-Infrastructures impose several requirements to computational systems in turn, e.g. being economically sustainable, managing community-provided processes, using standard representations for processes and data, managing Big Data size and heterogeneous representations, supporting reproducible Science, collaborative experimentation, and cooperative online environments, managing security and privacy for data and services. In this paper, we present a Cloud computing system (gCube DataMiner) that meets these requirements and operates in an e-Infrastructure, while sharing characteristics with state-of-the-art Cloud computing systems. To this aim, DataMiner uses the Web Processing Service standard of the Open Geospatial Consortium and introduces features like collaborative experimental spaces, automatic installation of processes and services on top of a flexible and sustainable Cloud computing architecture. We compare DataMiner with another mature Cloud computing system and highlight the benefits our system brings, the new paradigms requirements it satisfies, and the applications that can be developed based on this system.

Cloud computing in a distributed e-infrastructure using the Web processing service standard

Coro G;Panichi G;Scarponi P;Pagano P
2017

Abstract

New Science paradigms have recently evolved to promote open publication of scientific findings as well as multi-disciplinary collaborative approaches to scientific experimentation. These approaches can face modern scientific challenges but must deal with large quantities of data produced by industrial and scientific experiments. These data, so-called "Big Data", require to introduce new Computer Science systems to help scientists cooperate, extract information, and possibly produce new knowledge out of the data. E-Infrastructures are distributed computer systems that foster collaboration between users and can embed distributed and parallel processing systems to manage Big Data. However, in order to meet modern Science requirements, e-Infrastructures impose several requirements to computational systems in turn, e.g. being economically sustainable, managing community-provided processes, using standard representations for processes and data, managing Big Data size and heterogeneous representations, supporting reproducible Science, collaborative experimentation, and cooperative online environments, managing security and privacy for data and services. In this paper, we present a Cloud computing system (gCube DataMiner) that meets these requirements and operates in an e-Infrastructure, while sharing characteristics with state-of-the-art Cloud computing systems. To this aim, DataMiner uses the Web Processing Service standard of the Open Geospatial Consortium and introduces features like collaborative experimental spaces, automatic installation of processes and services on top of a flexible and sustainable Cloud computing architecture. We compare DataMiner with another mature Cloud computing system and highlight the benefits our system brings, the new paradigms requirements it satisfies, and the applications that can be developed based on this system.
2017
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Data Mining
Parallel Processing
Cloud Computing
Big Data Processing
Distributed Systems
e-Infrastructures
WPS
Science 2.0
File in questo prodotto:
File Dimensione Formato  
prod_374316-doc_125533.pdf

Open Access dal 10/07/2018

Descrizione: Cloud computing in a distributed e-infrastructure using the Web processing service standard
Tipologia: Versione Editoriale (PDF)
Dimensione 1.83 MB
Formato Adobe PDF
1.83 MB Adobe PDF Visualizza/Apri
prod_374316-doc_125534.pdf

Open Access dal 10/07/2018

Descrizione: Cloud computing in a distributed e-infrastructure using the Web processing service standard
Tipologia: Versione Editoriale (PDF)
Dimensione 2.06 MB
Formato Adobe PDF
2.06 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/337334
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 37
  • ???jsp.display-item.citation.isi??? ND
social impact