General Convolutional Neural Networks (CNN) process the data in the spatial domain and lack of efficacy in modeling the spectral context in multispectral image processing task. To address this limitation, in this letter we propose a Wavelet Inspired Attention-based Convolution Neural Network (WIANet) architecture that combines the modeling of the spatio-spectral resolution of multispectral remote sensing images with a wavelet convolution and attention unit into a single deep learning architecture for land cover classification. Our aim is to add the characteristics of both Wavelet Transform and attention mechanism into an UNet based architecture to better exploit the spectral and texture information for distinguishing classes with high similarity in the spectral signatures. We evaluate the performance of the proposed approach on a multi-label multispectral Sentinel-2 dataset. The experiment shows that the proposed approach performs better than the reference methods in different conditions on the number of training samples.
WIANet: A Wavelet-Inspired Attention-based Convolution Neural Network for Land Cover Classification / Singh, Abhishek; Bruzzone, Lorenzo. - In: IEEE GEOSCIENCE AND REMOTE SENSING LETTERS. - ISSN 1558-0571. - ELETTRONICO. - 2022:(2022). [10.1109/LGRS.2022.3232224]
WIANet: A Wavelet-Inspired Attention-based Convolution Neural Network for Land Cover Classification
Abhishek SinghPrimo
;Lorenzo Bruzzone
Secondo
2022-01-01
Abstract
General Convolutional Neural Networks (CNN) process the data in the spatial domain and lack of efficacy in modeling the spectral context in multispectral image processing task. To address this limitation, in this letter we propose a Wavelet Inspired Attention-based Convolution Neural Network (WIANet) architecture that combines the modeling of the spatio-spectral resolution of multispectral remote sensing images with a wavelet convolution and attention unit into a single deep learning architecture for land cover classification. Our aim is to add the characteristics of both Wavelet Transform and attention mechanism into an UNet based architecture to better exploit the spectral and texture information for distinguishing classes with high similarity in the spectral signatures. We evaluate the performance of the proposed approach on a multi-label multispectral Sentinel-2 dataset. The experiment shows that the proposed approach performs better than the reference methods in different conditions on the number of training samples.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione