Building detection in VHR remote sensing images using a novel dual attention residual-based U-Net (DAttResU-Net)

An application to generating building change maps

verfasst von
Ehsan Khankeshizadeh, Ali Mohammadzadeh, Amin Mohsenifar, Armin Moghimi, Saied Pirasteh, Sheng Feng, Keli Hu, Jonathan Li
Abstract

In today's era, increasing access to very high-resolution remote sensing images (VHR-RSIs) has enhanced building detection and change assessment capabilities. These applications provide accurate urban mapping, facilitate effective land management, and support disaster assessment by delivering detailed insights into building structures and their temporal changes. This study uses a two-stage process to present a pioneering approach for generating precise building maps (BMs) and subsequent building change maps (BCMs) from VHR-RSIs. The primary question addressed by the research is how to enhance the U-Net architecture to improve its sensitivity to both high-level semantic features (HLSF) and low-level spatial features (LLSF) in the building detection task. For this purpose, in the initial stage of the method, a novel deep learning model called dual attention residual-based U-Net (DAttResU-Net) is introduced. This model incorporates two significant modifications to the conventional U-Net, enhancing its capacity to yield bi-temporal BMs. Firstly, each standard convolutional block (CB) is replaced with an optimized CB incorporating a channel-spatial attention module attuned to the building objects' crucial HLSF. Secondly, an additional attention module is integrated into the encoder-decoder path of the model, heightening the sensitivity of U-Net to vital LLSF of buildings while disregarding extraneous background spatial information during the fusion of HLSF and LLSF. In the subsequent stage, the bi-temporal BMs generated by the DAttResU-Net are subjected to a box-based class-object change detection methodology to produce accurate BCMs. The effectiveness of the proposed architecture is rigorously evaluated against state-of-the-art models in both BM and BCM generation contexts, utilizing the well-established WHU dataset for experimentation. The experimental results indicated that the DAttResU-Net model, boasting an average of PFN/ PFP value of 2.33/1.34 (%) surpasses the performance of the state-of-the-art models in generating bi-temporal BMs. Furthermore, the building change detection outcomes demonstrated the proficient role of the bi-temporal BMs predicted by the proposed model in leading to the most optimal BCMs, exhibiting average PFN/ PFP value of 2.63/8.93 (%), outperforming comparative networks. Finally, we concluded that the proposed DAttResU-Net architecture is a highly promising and applicable model for producing reliable BMs and BCMs.

Organisationseinheit(en)
Ludwig-Franzius-Institut für Wasserbau, Ästuar- und Küsteningenieurwesen
Externe Organisation(en)
K.N. Toosi University of Technology
Shaoxing University
University of Waterloo
Typ
Artikel
Journal
Remote Sensing Applications: Society and Environment
Band
36
Publikationsdatum
11.2024
Publikationsstatus
Veröffentlicht
Peer-reviewed
Ja
ASJC Scopus Sachgebiete
Geografie, Planung und Entwicklung, Computer in den Geowissenschaften
Ziele für nachhaltige Entwicklung
SDG 11 – Nachhaltige Städte und Gemeinschaften
Elektronische Version(en)
https://doi.org/10.1016/j.rsase.2024.101336 (Zugang: Geschlossen)