期刊论文详细信息
EURASIP Journal on Advances in Signal Processing
PSENet-based efficient scene text detection
Guanglong Liao1  Zhongjie Zhu1  Tingna Liu1  Yongqiang Bai1  Zhibo Xie1 
[1] Ningbo Key Lab of Digital Signal Processing, Zhejiang Wanli University, 315100, Ningbo, China;
关键词: Scene text detection;    PSENet;    Mixed Pooling Module;   
DOI  :  10.1186/s13634-021-00808-5
来源: Springer
PDF
【 摘 要 】

Text detection is a key technique and plays an important role in computer vision applications, but efficient and precise text detection is still challenging. In this paper, an efficient scene text detection scheme is proposed based on the Progressive Scale Expansion Network (PSENet). A Mixed Pooling Module (MPM) is designed to effectively capture the dependence of text information at different distances, where different pooling operations are employed to better extract information of text shape. The backbone network is optimized by combining two extensions of the Residual Network (ResNet), i.e., ResNeXt and Res2Net, to enhance feature extraction effectiveness. Experimental results show that the precision of our scheme is improved more than by 5% compared with the original PSENet.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202110286463752ZK.pdf 1396KB PDF download
  文献评价指标  
  下载次数:2次 浏览次数:3次