期刊论文详细信息
PATTERN RECOGNITION 卷:45
Scalable image quality assessment with 2D mel-cepstrum and machine learning approach
Article
Narwaria, Manish1  Lin, Weisi1  Cetin, A. Enis2 
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
[2] Bilkent Univ, Dept Elect & Elect Engn, TR-06800 Ankara, Turkey
关键词: Image quality assessment;    Machine learning;    Feature extraction;    20 mel-cepstral features;   
DOI  :  10.1016/j.patcog.2011.06.023
来源: Elsevier
PDF
【 摘 要 】

Measurement of image quality is of fundamental importance to numerous image and video processing applications. Objective image quality assessment (IQA) is a two-stage process comprising of the following: (a) extraction of important information and discarding the redundant one, (b) pooling the detected features using appropriate weights. These two stages are not easy to tackle due to the complex nature of the human visual system (HVS). In this paper, we first investigate image features based on two-dimensional (20) mel-cepstrum for the purpose of IQA. It is shown that these features are effective since they can represent the structural information, which is crucial for IQA. Moreover, they are also beneficial in a reduced-reference scenario where only partial reference image information is used for quality assessment. We address the second issue by exploiting machine learning. In our opinion, the well established methodology of machine learning/pattern recognition has not been adequately used for IQA so far; we believe that it will be an effective tool for feature pooling since the required weights/parameters can be determined in a more convincing way via training with the ground truth obtained according to subjective scores. This helps to overcome the limitations of the existing pooling methods, which tend to be over simplistic and lack theoretical justification. Therefore, we propose a new metric by formulating IQA as a pattern recognition problem. Extensive experiments conducted using six publicly available image databases (totally 3211 images with diverse distortions) and one video database (with 78 video sequences) demonstrate the effectiveness and efficiency of the proposed metric, in comparison with seven relevant existing metrics. (C) 2011 Elsevier Ltd. All rights reserved.

【 授权许可】

Free   

【 预 览 】
附件列表
Files Size Format View
10_1016_j_patcog_2011_06_023.pdf 898KB PDF download
  文献评价指标  
  下载次数:5次 浏览次数:0次