The public resource computing paradigm is often used as a successful and low cost mechanism for the management of several classes of scientific and commercial applications that require the execution of a large number of independent tasks. Public computing frameworks, also known as "Desktop Grids", exploit the computational power and storage facilities of private computers, or "workers". Despite the inherent decentralized nature of the applications for which they are devoted, these systems often adopt a centralized mechanism for the assignment of jobs and distribution of input data, as is the case for BOINC, the most popular framework in this realm. We present a decentralized framework that aims at increasing the flexibility and robustness of public computing applications, thanks to two basic features: (i) the adoption of a P2P protocol for dynamically matching the job specifications with the worker characteristics, without relying on centralized resources; (ii) the use of distributed cache servers for an efficient dissemination and reutilization of data files. This framework is exploitable for a wide set of applications. In this work, we describe how a Java prototype of the framework was used to tackle the problem of mining frequent itemsets from a transactional dataset, and show some preliminary yet interesting performance results that prove the efficiency improvements that can derive from the presented architecture.

Distributed Data Mining Using a Public Resource Computing Framework

Eugenio Cesario;Carlo Mastroianni;Domenico Talia
2009

Abstract

The public resource computing paradigm is often used as a successful and low cost mechanism for the management of several classes of scientific and commercial applications that require the execution of a large number of independent tasks. Public computing frameworks, also known as "Desktop Grids", exploit the computational power and storage facilities of private computers, or "workers". Despite the inherent decentralized nature of the applications for which they are devoted, these systems often adopt a centralized mechanism for the assignment of jobs and distribution of input data, as is the case for BOINC, the most popular framework in this realm. We present a decentralized framework that aims at increasing the flexibility and robustness of public computing applications, thanks to two basic features: (i) the adoption of a P2P protocol for dynamically matching the job specifications with the worker characteristics, without relying on centralized resources; (ii) the use of distributed cache servers for an efficient dissemination and reutilization of data files. This framework is exploitable for a wide set of applications. In this work, we describe how a Java prototype of the framework was used to tackle the problem of mining frequent itemsets from a transactional dataset, and show some preliminary yet interesting performance results that prove the efficiency improvements that can derive from the presented architecture.
2009
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
Inglese
Frédéric Desprez; Vladimir Getov; Thierry Priol; Ramin Yahyapour
Grids, P2P and Services Computing
CoreGRID ERCIM Working Group Workshop on Grids, P2P and Service computing, in conjunction with EuroPar 2009
33
44
12
978-1-4419-6793-0
http://www.springerlink.com/content/m517u0634rq5u4xw/?MUD=MP
SPRINGER, 233 SPRING ST
NEW YORK, NY 10013
STATI UNITI D'AMERICA
Sì, ma tipo non specificato
Delft
Volunteer computing; Distributed Data Mining
4
none
Cesario, Eugenio; De Caria, Nicola; Mastroianni, Carlo; Talia, Domenico
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/70956
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 3
social impact