NEUROCOMPUTING | 卷:401 |
Single-shot bidirectional pyramid networks for high-quality object detection | |
Article | |
Wu, Xiongwei1  Sahoo, Doyen3  Zhang, Daoxin1,2  Zhu, Jianke1  Hoi, Steven C. H.1,3  | |
[1] Singapore Management Univ, Sch Informat Syst, Singapore, Singapore | |
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China | |
[3] Salesforce Res Asia, Singapore, Singapore | |
关键词: Object detection; Deep learning; Computer vision; Anchor refinement; | |
DOI : 10.1016/j.neucom.2020.02.116 | |
来源: Elsevier | |
【 摘 要 】
Recent years have witnessed significant advances in deep learning based object detection. Despite being extensively explored, most existing detectors are designed to detect objects with relatively low-quality prediction of locations, i.e., they are often trained with the threshold of Intersection over Union (IoU) set as 0.5. This can yield low-quality or even noisy detections. Designing high quality object detectors which have a more precise localization (e.g. IoU > 0.5) remains an open challenge. In this paper, we propose a novel single-shot detection framework called Bidirectional Pyramid Networks (BPN) for high-quality object detection. It comprises two novel components: (i) Bidirectional Feature Pyramid structure and Anchor Refinement (AR). The bidirectional feature pyramid structure aims to use semantic-rich deep layer features to enhance the quality of the shallow layer features, and simultaneously use the spatially-rich shallow layer features to enhance the quality of deep layer features, leading to a stronger representation of both small and large objects for high quality detection. Our anchor refinement scheme gradually refines the quality of pre-designed anchors by learning multi-level regressors, giving more precise localization predictions. We performed extensive experiments on both PASCAL VOC and MSCOCO datasets, and achieved the best performance among all single-shot detectors. The performance was especially superior in the regime of high-quality detection. (C) 2020 Elsevier B.V. All rights reserved.
【 授权许可】
Free
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
10_1016_j_neucom_2020_02_116.pdf | 1199KB | download |