Electronics | |
FASSD: A Feature Fusion and Spatial Attention-Based Single Shot Detector for Small Object Detection | |
Deng Jiang1  Zhen Zuo1  Peng Wu1  Xiaopeng Tan1  Bei Sun1  Shaojing Su1  | |
[1] College of Intelligence Science and Technology, National University of Defense Technology, Changsha 410073, China; | |
关键词: small object detection; feature fusion; spatial attention; deep learning; | |
DOI : 10.3390/electronics9091536 | |
来源: DOAJ |
【 摘 要 】
Deep learning methods have significantly improved object detection performance, but small object detection remains an extremely difficult and challenging task in computer vision. We propose a feature fusion and spatial attention-based single shot detector (FASSD) for small object detection. We fuse high-level semantic information into shallow layers to generate discriminative feature representations for small objects. To adaptively enhance the expression of small object areas and suppress the feature response of background regions, the spatial attention block learns a self-attention mask to enhance the original feature maps. We also establish a small object dataset (LAKE-BOAT) of a scene with a boat on a lake and tested our algorithm to evaluate its performance. The results show that our FASSD achieves 79.3% mAP (mean average precision) on the PASCAL VOC2007 test with input 300 × 300, which outperforms the original single shot multibox detector (SSD) by 1.6 points, as well as most improved algorithms based on SSD. The corresponding detection speed was 45.3 FPS (frame per second) on the VOC2007 test using a single NVIDIA TITAN RTX GPU. The test results of a simplified FASSD on the LAKE-BOAT dataset indicate that our model achieved an improvement of 3.5% mAP on the baseline network while maintaining a real-time detection speed (64.4 FPS).
【 授权许可】
Unknown