IEEE Access | |
Super-Resolution Integrated Building Semantic Segmentation for Multi-Source Remote Sensing Imagery | |
Xiaowei Shao1  Xiaoya Song1  Xiaodan Shi1  Yongwei Xu1  Wei Yuan1  Ryosuke Shibasaki1  Zhiling Guo1  Haoran Zhang1  Guangming Wu1  Mingzhou Xu1  Qi Chen2  | |
[1] Center for Spatial Information Science, The University of Tokyo, Kashiwa, Japan;School of Geography and Information Engineering, China University of Geosciences, Wuhan, China; | |
关键词: Building segmentation; deep learning; remote sensing; super-resolution; | |
DOI : 10.1109/ACCESS.2019.2928646 | |
来源: DOAJ |
【 摘 要 】
Multi-source remote sensing imagery has become widely accessible owing to the development of data acquisition systems. In this paper, we address the challenging task of the semantic segmentation of buildings via multi-source remote sensing imagery with different spatial resolutions. Unlike previous works that mainly focused on optimizing the segmentation model, which did not enable the severe problems caused by the unaligned resolution between the training and testing data to be fundamentally solved, we propose to integrate SR techniques with the existing framework to enhance the segmentation performance. The feasibility of the proposed method was evaluated by utilizing representative multi-source study materials: high-resolution (HR) aerial and low-resolution (LR) panchromatic satellite imagery as the training and testing data, respectively. Instead of directly conducting building segmentation from the LR imagery by using the model trained using the HR imagery, the deep learning-based super-resolution (SR) model was first adopted to super-resolved LR imagery into SR space, which could mitigate the influence of the difference in resolution between the training and testing data. The experimental results obtained from the test area in Tokyo, Japan, demonstrate that the proposed SR-integrated method significantly outperforms that without SR, improving the Jaccard index and kappa by approximately 19.01% and 19.10%, respectively. The results confirmed that the proposed method is a viable tool for building semantic segmentation, especially when the resolution is unaligned.
【 授权许可】
Unknown