The increasing use of intermittent aeration controllers in wastewater treatment plants (WWTPs) aims to reduce aeration costs via continuous ammonia and oxygen measurements but faces challenges in detecting sensor and process anomalies. Applying machine learning to this unbalanced, multivariate, multiclass classification challenge requires much data, difficult to obtain from a new plant. This study develops a machine learning algorithm to identify anomalies in intermittent aeration WWTPs, adaptable to new plants with limited data. Utilizing active learning, the method iteratively selects samples from the target domain to fine-tune a gradient-boosting model initially trained on data from 17 plants. Three sampling strategies were tested, with low probability and high entropy sampling proving effective in early adaptation, achieving an F2-score close to the optimal with minimal sample use. The objective is to deploy these models as decision support systems for WWTP management, providing a strategy for efficient model adaptation to new plants, and optimizing labeling efforts.

Domain adaptation through active learning strategies for anomaly classification in wastewater treatment plants / Bellamoli, Francesca; Vian, Marco; Di Iorio, Mattia; Melgani, Farid. - In: WATER SCIENCE AND TECHNOLOGY. - ISSN 0273-1223. - 90:11(2024), pp. 3123-3138. [10.2166/wst.2024.387]

Domain adaptation through active learning strategies for anomaly classification in wastewater treatment plants

Francesca Bellamoli;Marco Vian;Farid Melgani
2024-01-01

Abstract

The increasing use of intermittent aeration controllers in wastewater treatment plants (WWTPs) aims to reduce aeration costs via continuous ammonia and oxygen measurements but faces challenges in detecting sensor and process anomalies. Applying machine learning to this unbalanced, multivariate, multiclass classification challenge requires much data, difficult to obtain from a new plant. This study develops a machine learning algorithm to identify anomalies in intermittent aeration WWTPs, adaptable to new plants with limited data. Utilizing active learning, the method iteratively selects samples from the target domain to fine-tune a gradient-boosting model initially trained on data from 17 plants. Three sampling strategies were tested, with low probability and high entropy sampling proving effective in early adaptation, achieving an F2-score close to the optimal with minimal sample use. The objective is to deploy these models as decision support systems for WWTP management, providing a strategy for efficient model adaptation to new plants, and optimizing labeling efforts.
2024
11
Settore ING-INF/03 - Telecomunicazioni
Bellamoli, Francesca; Vian, Marco; Di Iorio, Mattia; Melgani, Farid
Domain adaptation through active learning strategies for anomaly classification in wastewater treatment plants / Bellamoli, Francesca; Vian, Marco; Di Iorio, Mattia; Melgani, Farid. - In: WATER SCIENCE AND TECHNOLOGY. - ISSN 0273-1223. - 90:11(2024), pp. 3123-3138. [10.2166/wst.2024.387]
File in questo prodotto:
File Dimensione Formato  
2024_WST-Active Learning-Francesca.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Creative commons
Dimensione 703.1 kB
Formato Adobe PDF
703.1 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/444690
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex ND
social impact