The management of thermal comfort in a building is a challenging and multifaced problem, because the use of objective parameters, for example, the energy consumption, should be combined with subjective requirements, related to human profile and preferences. This article exploits cognitive technologies, based on deep reinforcement learning (DRL), for the automatic control of the heating, ventilation, and air conditioning system in an office. The learning process is driven by a reward that includes multiple components, related to energy consumption, indoor temperature, and user perceptions, which are inferred by the human interactions with the system. This approach is inspired by the human-in-theloop paradigm, which in our case helps the DRL controller to learn the requirements of users and readily adapt to them. Experimental results show that the appropriate balance of the reward components can be efficiently exploited to give the desired importance to the different objectives.
Pursuing Energy Saving and Thermal Comfort With a Human-Driven DRL Approach
Luigi Scarcello;Franco Cicirelli;Antonio Guerrieri;Carlo Mastroianni;Giandomenico Spezzano;Andrea Vinci
2023
Abstract
The management of thermal comfort in a building is a challenging and multifaced problem, because the use of objective parameters, for example, the energy consumption, should be combined with subjective requirements, related to human profile and preferences. This article exploits cognitive technologies, based on deep reinforcement learning (DRL), for the automatic control of the heating, ventilation, and air conditioning system in an office. The learning process is driven by a reward that includes multiple components, related to energy consumption, indoor temperature, and user perceptions, which are inferred by the human interactions with the system. This approach is inspired by the human-in-theloop paradigm, which in our case helps the DRL controller to learn the requirements of users and readily adapt to them. Experimental results show that the appropriate balance of the reward components can be efficiently exploited to give the desired importance to the different objectives.File | Dimensione | Formato | |
---|---|---|---|
Pursuing_Energy_Saving_and_Thermal_Comfort_With_a_Human-Driven_DRL_Approach.pdf
solo utenti autorizzati
Licenza:
NON PUBBLICO - Accesso privato/ristretto
Dimensione
6.04 MB
Formato
Adobe PDF
|
6.04 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.