Background and Objective: Contemporary Machine Learning approaches (e.g., Deep Learning) need huge volumes of data to build accurate and robust statistical models. Nowadays, very often, such data are collected by distinct and geographically distributed entities and successively transmitted to and stored by centralized nodes that implement the learning process. This practice, however, exposes data to security and privacy risks that may be even unacceptable in those environments regulated by the General Data Protection Regulation (GDPR). Methods: This paper defines a novel Federated Learning approach that avoids the transmission of sensitive data over the network and improves over the classic federated learning schemes by reducing the communication costs. This approach has been validated concerning a healthcare case study that aimed at building a Time-Series based predictive model to identify the level of risk for patients suffering from hypertension. Results: Experimental validation has shown that the proposed approach achieves excellent results both in terms of classification accuracy, superior to the state-of-the-art models with an improvement ranging from 3.01% to 11.09%, and in terms of communication costs with a reduction of about 34%. Conclusion: The analysis of the proposed approach shows promising results in terms of performance and communication cost.

Definition of a novel federated learning approach to reduce communication costs

Paragliola G;Coronato A
2022

Abstract

Background and Objective: Contemporary Machine Learning approaches (e.g., Deep Learning) need huge volumes of data to build accurate and robust statistical models. Nowadays, very often, such data are collected by distinct and geographically distributed entities and successively transmitted to and stored by centralized nodes that implement the learning process. This practice, however, exposes data to security and privacy risks that may be even unacceptable in those environments regulated by the General Data Protection Regulation (GDPR). Methods: This paper defines a novel Federated Learning approach that avoids the transmission of sensitive data over the network and improves over the classic federated learning schemes by reducing the communication costs. This approach has been validated concerning a healthcare case study that aimed at building a Time-Series based predictive model to identify the level of risk for patients suffering from hypertension. Results: Experimental validation has shown that the proposed approach achieves excellent results both in terms of classification accuracy, superior to the state-of-the-art models with an improvement ranging from 3.01% to 11.09%, and in terms of communication costs with a reduction of about 34%. Conclusion: The analysis of the proposed approach shows promising results in terms of performance and communication cost.
2022
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
Federated learning
Self-adaptive systems
Time series analysis
classification
Communication costs
Healthcare informatics
File in questo prodotto:
File Dimensione Formato  
prod_458808-doc_178470.pdf

solo utenti autorizzati

Descrizione: manuscript
Tipologia: Versione Editoriale (PDF)
Licenza: Nessuna licenza dichiarata (non attribuibile a prodotti successivi al 2023)
Dimensione 3.38 MB
Formato Adobe PDF
3.38 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/429200
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 39
  • ???jsp.display-item.citation.isi??? 32
social impact