CNR Institutional Research Information System

In this paper, we address the problem of anomaly detection in decentralised settings. We took inspiration from the current edge computing trend, pushing towards the developmentof decentralised ML algorithms, i.e., the devices that collected or generated data are in charge of collaborating to train the ML models without sharing raw data . The challengesconnected to this scenario are (i) data distributions of local datasets might be different, (ii) data is very often unlabelled, and (iii) devices have limited computational resources. Weaddress them by proposing an unsupervised ensemble method for decentralised anomaly detection where the base learners are lightweight autoencoders. We aim to investigatewhether an ensemble of lightweight models trained in isolation on non-IID and unlabelled local data can compete with heavier models trained in centralised settings. In a task ofmulti-category anomaly detection, our results show that our method exploits the data imbalance successfully to make accurate predictions.

Centralised vs decentralised anomaly detection: when local and imbalanced data are beneficial

M Nardi;L Valerio;A Passarella

2021

Abstract

In this paper, we address the problem of anomaly detection in decentralised settings. We took inspiration from the current edge computing trend, pushing towards the developmentof decentralised ML algorithms, i.e., the devices that collected or generated data are in charge of collaborating to train the ML models without sharing raw data . The challengesconnected to this scenario are (i) data distributions of local datasets might be different, (ii) data is very often unlabelled, and (iii) devices have limited computational resources. Weaddress them by proposing an unsupervised ensemble method for decentralised anomaly detection where the base learners are lightweight autoencoders. We aim to investigatewhether an ensemble of lightweight models trained in isolation on non-IID and unlabelled local data can compete with heavier models trained in centralised settings. In a task ofmulti-category anomaly detection, our results show that our method exploits the data imbalance successfully to make accurate predictions.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2021
			
	Strutture organizzative
	
				Istituto di informatica e telematica - IIT
			
	Parole chiave
	
				centralised vs decentralised
unsupervised anomaly detection
data imbalance
autoencoders ensemble
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
prod_460319-doc_179432.pdf solo utenti autorizzati Descrizione: Centralised vs decentralised anomaly detection Tipologia: Versione Editoriale (PDF) Licenza: Nessuna licenza dichiarata (non attribuibile a prodotti successivi al 2023) Dimensione 364.3 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	364.3 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/449385

Citazioni

ND

10

8

social impact