The widespread diffusion of connected smart devices has greatly contributed to the rapid expansion and evolution of the Internet at its edge, where personal mobile devices follow the behavior of their human users and interact with other smart objects located in the surroundings. In such a scenario, the user context is represented by a large variety of information that can rapidly change, and the ability of personal mobile devices to locally process this data is fundamental to make the system able to quickly adapt its behavior to the current situation. This ability, in practice, can be represented by a single elaboration process integrated in the final user application, or by a middleware platform aimed at implementing different context processing and reasoning to support third-party applications. However, the lack of public datasets that take into account the complexity of the user context in the mobile environment strongly limits the advance of the research in this field. In this paper, we present MyDigitalFootprint, a novel large-scale dataset composed of smartphone embedded sensors data, physical proximity information, and Online Social Networks interactions aimed at supporting multimodal context-recognition and social relationships modeling. The dataset includes two months of measurements and information collected from the personal mobile devices of 31 volunteer users, in their natural environment, without limiting their usual behavior. Existing public datasets generally consist of a limited set of context data, aimed at optimizing specific application domains (human activity recognition is the most common example). On the contrary, our dataset contains a comprehensive set of information describing the user context in the mobile environment. In order to demonstrate the efficacy of the proposed dataset, we present three context-aware applications based on different machine learning tasks: (i) a social link prediction algorithm based on physical proximity data, (ii) the recognition of daily-life activities based on smartphone-embedded sensors data, and (iii) a pervasive context-aware recommender system. To the best of our knowledge, this is the first large-scale dataset containing such heterogeneity of information, representing an invaluable source of data to validate new research in mobile and edge computing.

MyDigitalFootprint: An extensive context dataset for pervasive computing applications at the edge

Campana MG;Delmastro F
2021

Abstract

The widespread diffusion of connected smart devices has greatly contributed to the rapid expansion and evolution of the Internet at its edge, where personal mobile devices follow the behavior of their human users and interact with other smart objects located in the surroundings. In such a scenario, the user context is represented by a large variety of information that can rapidly change, and the ability of personal mobile devices to locally process this data is fundamental to make the system able to quickly adapt its behavior to the current situation. This ability, in practice, can be represented by a single elaboration process integrated in the final user application, or by a middleware platform aimed at implementing different context processing and reasoning to support third-party applications. However, the lack of public datasets that take into account the complexity of the user context in the mobile environment strongly limits the advance of the research in this field. In this paper, we present MyDigitalFootprint, a novel large-scale dataset composed of smartphone embedded sensors data, physical proximity information, and Online Social Networks interactions aimed at supporting multimodal context-recognition and social relationships modeling. The dataset includes two months of measurements and information collected from the personal mobile devices of 31 volunteer users, in their natural environment, without limiting their usual behavior. Existing public datasets generally consist of a limited set of context data, aimed at optimizing specific application domains (human activity recognition is the most common example). On the contrary, our dataset contains a comprehensive set of information describing the user context in the mobile environment. In order to demonstrate the efficacy of the proposed dataset, we present three context-aware applications based on different machine learning tasks: (i) a social link prediction algorithm based on physical proximity data, (ii) the recognition of daily-life activities based on smartphone-embedded sensors data, and (iii) a pervasive context-aware recommender system. To the best of our knowledge, this is the first large-scale dataset containing such heterogeneity of information, representing an invaluable source of data to validate new research in mobile and edge computing.
2021
Istituto di informatica e telematica - IIT
Real-world dataset
Edge computing
Phone-embedded sensors
Online social network
Pervasive mobile computing
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/399377
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact