The identification of the most central nodes of a graph is a fundamental task of data analysis. The current flow betweenness is a centrality index which considers how the information flows along all the paths of a graph, not only on the shortest ones. Finding the exact value of the current flow betweenness is computationally expensive for large graphs, so the definition of algorithms returning an approximation of this measure is mandatory. In this paper we propose a solution, based on the Gather Apply Scatter model, that estimates the current flow betweenness in a distributed setting using the Apache Spark framework. The experimental evaluation shows that the algorithm achieves high correlation with the exact value of the index and outperforms other algorithms.
Current flow betweenness centrality with Apache Spark
2016
Abstract
The identification of the most central nodes of a graph is a fundamental task of data analysis. The current flow betweenness is a centrality index which considers how the information flows along all the paths of a graph, not only on the shortest ones. Finding the exact value of the current flow betweenness is computationally expensive for large graphs, so the definition of algorithms returning an approximation of this measure is mandatory. In this paper we propose a solution, based on the Gather Apply Scatter model, that estimates the current flow betweenness in a distributed setting using the Apache Spark framework. The experimental evaluation shows that the algorithm achieves high correlation with the exact value of the index and outperforms other algorithms.File | Dimensione | Formato | |
---|---|---|---|
prod_367414-doc_121549.pdf
solo utenti autorizzati
Descrizione: Current flow betweenness centrality with Apache Spark
Tipologia:
Versione Editoriale (PDF)
Dimensione
455.12 kB
Formato
Adobe PDF
|
455.12 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
prod_367414-doc_157585.pdf
accesso aperto
Descrizione: postprint version
Tipologia:
Versione Editoriale (PDF)
Dimensione
555.17 kB
Formato
Adobe PDF
|
555.17 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.