Modern wastewater treatment plants base their biological processes on advanced control systems which ensure compliance with discharge limits and minimize energy consumption by responding to information from on-line probes. The correct probe readings are particularly crucial for intermittent aeration controllers, which rely on real-time measurements of ammonia and oxygen in biological tanks. These data are an important resource for developing artificial intelligence algorithms that can identify process or sensor anomalies. However, using anomaly detection and classification algorithms in real-time wastewater treatment is challenging due to the multiclass and imbalanced nature of the problem, the difficulty in obtaining labeled data from real plants, and the complex and interdependent mechanisms that govern biological processes. This thesis introduces a solution that uses machine learning to detect anomalies within wastewater treatment plants, focusing on activated sludge compartments and systems that utilize intermittent aeration based on ammonia and oxygen measurements. The study analyzes the main anomalies that may arise in such systems (including both sensor inaccuracies and process-related issues), explores the features that can enable their detection using only common available measurements, and develops a multiclass classification model and suitable post-processing and automation strategies. Among the tested models, the best-performing were tree-based algorithms, particularly gradient boosting methods such as LightGBM. This model was implemented in real plants as a Decision Support System that can alert plant operators, and subsequently integrated into a new aeration controller that automatically reacts to events without the need of operator intervention, improving operational efficiency and reaching a recall of 82% and a precision of 75%. To address the scarcity of labeled data, an active learning methodology was employed, specifically uncertainty sampling, to iteratively select the most informative samples for annotation. This approach enabled efficient model adaptation to new plants with minimal labeling effort. Tests on operational plants showed significant improvements in anomaly detection, reducing labeling time and achieving optimal performance with only 6% of labeled data.

Machine Learning Methods for Wastewater Treatment Plants / Bellamoli, Francesca. - (2025 Apr 11), pp. 1-146.

Machine Learning Methods for Wastewater Treatment Plants

Bellamoli, Francesca
2025-04-11

Abstract

Modern wastewater treatment plants base their biological processes on advanced control systems which ensure compliance with discharge limits and minimize energy consumption by responding to information from on-line probes. The correct probe readings are particularly crucial for intermittent aeration controllers, which rely on real-time measurements of ammonia and oxygen in biological tanks. These data are an important resource for developing artificial intelligence algorithms that can identify process or sensor anomalies. However, using anomaly detection and classification algorithms in real-time wastewater treatment is challenging due to the multiclass and imbalanced nature of the problem, the difficulty in obtaining labeled data from real plants, and the complex and interdependent mechanisms that govern biological processes. This thesis introduces a solution that uses machine learning to detect anomalies within wastewater treatment plants, focusing on activated sludge compartments and systems that utilize intermittent aeration based on ammonia and oxygen measurements. The study analyzes the main anomalies that may arise in such systems (including both sensor inaccuracies and process-related issues), explores the features that can enable their detection using only common available measurements, and develops a multiclass classification model and suitable post-processing and automation strategies. Among the tested models, the best-performing were tree-based algorithms, particularly gradient boosting methods such as LightGBM. This model was implemented in real plants as a Decision Support System that can alert plant operators, and subsequently integrated into a new aeration controller that automatically reacts to events without the need of operator intervention, improving operational efficiency and reaching a recall of 82% and a precision of 75%. To address the scarcity of labeled data, an active learning methodology was employed, specifically uncertainty sampling, to iteratively select the most informative samples for annotation. This approach enabled efficient model adaptation to new plants with minimal labeling effort. Tests on operational plants showed significant improvements in anomaly detection, reducing labeling time and achieving optimal performance with only 6% of labeled data.
11-apr-2025
XXXVII
2023-2024
Ingegneria e scienza dell'Informaz (29/10/12-)
Industrial Innovation
Melgani, Farid
no
Inglese
File in questo prodotto:
File Dimensione Formato  
PhDThesis_FrancescaBellamoli.pdf

accesso aperto

Descrizione: Machine Learning Methods for Wastewater Treatment Plants
Tipologia: Tesi di dottorato (Doctoral Thesis)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 96.08 MB
Formato Adobe PDF
96.08 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/450038
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact