The management of thermal comfort in a building is a challenging and multi-faced problem because it requires considering subjective parameters, such as human preferences and behaviors, and also objective parameters, which can be related to other environmental aspects like the reduction of energy consumption. This chapter exploits cognitive technologies, based on deep reinforcement learning (DRL), for automatically learning how to control the HVAC system in an office. The goal is to develop a cyber-controller able to minimize both the perceived thermal discomfort and the needed energy. The learning process is driven through the definition of a cumulative reward, which includes and combines two reward components that consider, respectively, user comfort and energy consumption. Moreover, a human reward, inferred by the frequency of user interactions with the HVAC system, helps the DRL controller learn the requirements of users and readily adapt to them. Simulation experiments are performed to assess the impact that the two components of the reward have on the behavior of the DRL controller and on the learning process.

Cognitive Systems for Energy Efficiency and Thermal Comfort in Smart Buildings

Scarcello L;Mastroianni C
2023

Abstract

The management of thermal comfort in a building is a challenging and multi-faced problem because it requires considering subjective parameters, such as human preferences and behaviors, and also objective parameters, which can be related to other environmental aspects like the reduction of energy consumption. This chapter exploits cognitive technologies, based on deep reinforcement learning (DRL), for automatically learning how to control the HVAC system in an office. The goal is to develop a cyber-controller able to minimize both the perceived thermal discomfort and the needed energy. The learning process is driven through the definition of a cumulative reward, which includes and combines two reward components that consider, respectively, user comfort and energy consumption. Moreover, a human reward, inferred by the frequency of user interactions with the HVAC system, helps the DRL controller learn the requirements of users and readily adapt to them. Simulation experiments are performed to assess the impact that the two components of the reward have on the behavior of the DRL controller and on the learning process.
2023
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
Energy management systems
Deep reinforcement learning
User comfort
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/437183
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact