Background and Objective: In traditional Machine Learning (ML) approaches, the data are collected and stored by a single node and subsequently used for training and testing. However, the acquisition and management of a large amount of data in some domains, as for example healthcare, can be problematic on account of the adoption of centralized architectures which entail certain security and privacy risks. Federated Learning (FL) has recently emerged as a technological solution to address such questions, even though communication efficiency may be a significant issue. Methods: This paper presents a novel learning strategy aimed at reducing the total number of parameters shared during the FL process and therefore, at evaluating a trade-off between the requirement to bring down communication costs and the need to guarantee the highest classification performance. Results: The results demonstrate the goodness of the solution in comparison with the traditional FedAvg algorithm since the accuracy of the proposed approach shows values ranging from 89.25% to 96.6% and, and in addition, the reduction of the communication overheads shows improvements ranging from 95.64% to 6%. Conclusion: The analysis of the proposed approach shows promising results in terms of performance and communication costs, especially in relation to the total amount of moved data since the challenge addressed by the paper concerns communication efficiency during the training process.

Evaluation of the trade-off between performance and communication costs in federated learning scenario

Paragliola G.
Primo
2022

Abstract

Background and Objective: In traditional Machine Learning (ML) approaches, the data are collected and stored by a single node and subsequently used for training and testing. However, the acquisition and management of a large amount of data in some domains, as for example healthcare, can be problematic on account of the adoption of centralized architectures which entail certain security and privacy risks. Federated Learning (FL) has recently emerged as a technological solution to address such questions, even though communication efficiency may be a significant issue. Methods: This paper presents a novel learning strategy aimed at reducing the total number of parameters shared during the FL process and therefore, at evaluating a trade-off between the requirement to bring down communication costs and the need to guarantee the highest classification performance. Results: The results demonstrate the goodness of the solution in comparison with the traditional FedAvg algorithm since the accuracy of the proposed approach shows values ranging from 89.25% to 96.6% and, and in addition, the reduction of the communication overheads shows improvements ranging from 95.64% to 6%. Conclusion: The analysis of the proposed approach shows promising results in terms of performance and communication costs, especially in relation to the total amount of moved data since the challenge addressed by the paper concerns communication efficiency during the training process.
2022
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR - Sede Secondaria Napoli
Communication costs
Federated learning
Healthcare informatics
Self-adaptive systems
Time series analysis and classification
File in questo prodotto:
File Dimensione Formato  
Evaluation of the trade-off between performance and communication.pdf

non disponibili

Tipologia: Versione Editoriale (PDF)
Licenza: Altro tipo di licenza
Dimensione 2.57 MB
Formato Adobe PDF
2.57 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/517913
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 21
  • ???jsp.display-item.citation.isi??? ND
social impact