The identification of the most central nodes of a graph is a fundamental task of data analysis. The current flow betweenness is a centrality index which considers how the information flows along all the paths of a graph, not only on the shortest ones. Finding the exact value of the current flow betweenness is computationally expensive for large graphs, so the definition of algorithms returning an approximation of this measure is mandatory. In this paper we propose a solution, based on the Gather Apply Scatter model, that estimates the current flow betweenness in a distributed setting using the Apache Spark framework. The experimental evaluation shows that the algorithm achieves high correlation with the exact value of the index and outperforms other algorithms.

Current flow betweenness centrality with Apache Spark

2016

Abstract

The identification of the most central nodes of a graph is a fundamental task of data analysis. The current flow betweenness is a centrality index which considers how the information flows along all the paths of a graph, not only on the shortest ones. Finding the exact value of the current flow betweenness is computationally expensive for large graphs, so the definition of algorithms returning an approximation of this measure is mandatory. In this paper we propose a solution, based on the Gather Apply Scatter model, that estimates the current flow betweenness in a distributed setting using the Apache Spark framework. The experimental evaluation shows that the algorithm achieves high correlation with the exact value of the index and outperforms other algorithms.
2016
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Apache Spark
Centrality measure
Thinking like a vertex
File in questo prodotto:
File Dimensione Formato  
prod_367414-doc_121549.pdf

solo utenti autorizzati

Descrizione: Current flow betweenness centrality with Apache Spark
Tipologia: Versione Editoriale (PDF)
Dimensione 455.12 kB
Formato Adobe PDF
455.12 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
prod_367414-doc_157585.pdf

accesso aperto

Descrizione: postprint version
Tipologia: Versione Editoriale (PDF)
Dimensione 555.17 kB
Formato Adobe PDF
555.17 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/355298
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact