Deep neural networks have been regarded as accurate models to predict complex fermentation processes due to their capacity to learn from a high number of data sets via artificial intelligence. To enhance the performance of these models the main challenge is to select the appropriate hyperparameters, such as neural network architecture (number of hidden layers and hidden units), activation function, optimizer, learning rate, and others. This study aims to use a hybrid Bayesian optimization with tree-structured Parzen estimator (BO-TPE) to predict biogas production from real wastewater treatment plant data using deep neural network machine learning (DNN) with optimized hyperparameters. Due to the high number of missing sample measurement records, the data preprocessing process has been performed in three sequential steps: the first step is to remove columns with a high percentage of missing values. The second step concerns removing rows with a high number of missing values. The remaining missing values in the dataset were removed in the third step using the dropna function from Pandas' library. Then, on the remaining data, three normalization techniques (MinMaxScaler, RobustScaler, and StandardScaler) were used for 16 relevant features of the anaerobic digestion process (AD) and compared with Non-Normalized data. The RobustScaler technique demonstrated good prediction capabilities of biogas volume produced. The maximum predicted volume was 2236.105 Nm3/d, with the coefficient of determination (R2), the mean absolute error (MAE), and the root mean square error (RMSE) values of 0.712, 164.610, and 223.429, respectively. This value remains slightly higher than the actual biogas volume produced in the wastewater treatment plant, which was 2131 Nm3/d. Adopting StandardScaler, MinMaxScaler, and Non-Normalized data provided statistical performance indicator values of (0.690; 170.518; 231.519), (0.679; 180.575; 235.648), and (0.647; 183.403; 247.203), respectively. These findings offer an autonomous strategy to monitor the effective operational variables of a large-scale digester, ensuring the stability of the complex fermentation process as well as the long-term sustainability and economic viability of the renewable energy approach in the sector of waste treatment.

DNN model development of biogas production from an anaerobic wastewater treatment plant using Bayesian hyperparameter optimization

Marra, Francesco Saverio
2023

Abstract

Deep neural networks have been regarded as accurate models to predict complex fermentation processes due to their capacity to learn from a high number of data sets via artificial intelligence. To enhance the performance of these models the main challenge is to select the appropriate hyperparameters, such as neural network architecture (number of hidden layers and hidden units), activation function, optimizer, learning rate, and others. This study aims to use a hybrid Bayesian optimization with tree-structured Parzen estimator (BO-TPE) to predict biogas production from real wastewater treatment plant data using deep neural network machine learning (DNN) with optimized hyperparameters. Due to the high number of missing sample measurement records, the data preprocessing process has been performed in three sequential steps: the first step is to remove columns with a high percentage of missing values. The second step concerns removing rows with a high number of missing values. The remaining missing values in the dataset were removed in the third step using the dropna function from Pandas' library. Then, on the remaining data, three normalization techniques (MinMaxScaler, RobustScaler, and StandardScaler) were used for 16 relevant features of the anaerobic digestion process (AD) and compared with Non-Normalized data. The RobustScaler technique demonstrated good prediction capabilities of biogas volume produced. The maximum predicted volume was 2236.105 Nm3/d, with the coefficient of determination (R2), the mean absolute error (MAE), and the root mean square error (RMSE) values of 0.712, 164.610, and 223.429, respectively. This value remains slightly higher than the actual biogas volume produced in the wastewater treatment plant, which was 2131 Nm3/d. Adopting StandardScaler, MinMaxScaler, and Non-Normalized data provided statistical performance indicator values of (0.690; 170.518; 231.519), (0.679; 180.575; 235.648), and (0.647; 183.403; 247.203), respectively. These findings offer an autonomous strategy to monitor the effective operational variables of a large-scale digester, ensuring the stability of the complex fermentation process as well as the long-term sustainability and economic viability of the renewable energy approach in the sector of waste treatment.
2023
Istituto di Scienze e Tecnologie per l'Energia e la Mobilità Sostenibili - STEMS
Anaerobic digestion
Biogas
Deep neural network
Hybrid BO-TPE
Hyperparameters
File in questo prodotto:
File Dimensione Formato  
Chemical Engineering Journal_471144671_2023.pdf

solo utenti autorizzati

Descrizione: Articolo pdf
Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 7.75 MB
Formato Adobe PDF
7.75 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/533270
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 26
  • ???jsp.display-item.citation.isi??? 23
social impact