期刊论文详细信息
International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
COMPARISON OF TWO METHODS FOR 2D POSE ESTIMATION OF INDUSTRIAL WORKPIECES IN IMAGES – CNN VS. CLASSICAL IMAGE PROCESSING SYSTEM
Siegfarth, C.^11 
[1] Institute of Photogrammetry and Remote Sensing (IPF), Karlsruhe Institute of Technology (KIT), Germany^1
关键词: Automatic image analysis;    CNN;    Shape model;    Industrial Application;   
DOI  :  10.5194/isprs-archives-XLII-1-401-2018
学科分类:地球科学(综合)
来源: Copernicus Publications
PDF
【 摘 要 】

Today, automatic image analysis is one of the basic approaches in the field of industrial applications. One of frequent tasks is pose estimation of objects which can be solved by different methods of image analysis. For comparison two of them have been selected and investigated in this project: Convolutional Neural Networks (CNNs) and a classical method of image analysis based on contour extraction. The main point of interest was to investigate the potential and limits of CNNs to fulfil the requirements of this special task regarding accuracy, reliability and time performance. The classical approach served as comparison to a state-of-the-art solution. The workpiece for these investigations was a commonly used transistor element. As database an image archive consisting of 9000 images with different illumination and perspective conditions has been generated. One part was used for training of the CNN and the creation of a so-called shape model respectively, the rest for the investigation of the extraction quality. With CNN technique two different approaches have been realised. Even if CNNs are predestined for classification this method delivered insufficient results. In a more sophisticated approach the system learns the parameters of an affine transformation including the sought-after parameters of translation and rotation. Our experiments confirm that CNNs are able to obtain at best only a medium accuracy of rotation angles (about ± 2°), in contrast to the classical approach (about ± 0.5°). Concerning the determination of translations both methods deliver comparable results, about ± 0.5 pixel from CNN and about ± 0.4 pixel from classical approach.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO201911044486612ZK.pdf 912KB PDF download
  文献评价指标  
  下载次数:14次 浏览次数:9次