期刊论文详细信息
Journal of vision
An image-computable psychophysical spatial vision model
Felix A. Wichmann1  Heiko H. Schütt2 
[1] Department of Experimental and Biological Psychology, University of Potsdam, Germany;Neural Information Processing Group, University of Tübingen, Tübingen, Germany
关键词: perceptual masking;    psychophysics;    noise;    spatial vision;    spatial frequency;    natural scenes;    filters;    pixel;    luminance;   
DOI  :  10.1167/17.12.12
学科分类:眼科学
来源: Association for Research in Vision and Ophthalmology
PDF
【 摘 要 】

A large part of classical visual psychophysics was concerned with the fundamental question of how pattern information is initially encoded in the human visual system. From these studies a relatively standard model of early spatial vision emerged, based on spatial frequency and orientation-specific channels followed by an accelerating nonlinearity and divisive normalization: contrast gain-control. Here we implement such a model in an image-computable way, allowing it to take arbitrary luminance images as input. Testing our implementation on classical psychophysical data, we find that it explains contrast detection data including the ModelFest data, contrast discrimination data, and oblique masking data, using a single set of parameters. Leveraging the advantage of an image-computable model, we test our model against a recent dataset using natural images as masks. We find that the model explains these data reasonably well, too. To explain data obtained at different presentation durations, our model requires different parameters to achieve an acceptable fit. In addition, we show that contrast gain-control with the fitted parameters results in a very sparse encoding of luminance information, in line with notions from efficient coding. Translating the standard early spatial vision model to be image-computable resulted in two further insights: First, the nonlinear processing requires a denser sampling of spatial frequency and orientation than optimal coding suggests. Second, the normalization needs to be fairly local in space to fit the data obtained with natural image masks. Finally, our image-computable model can serve as tool in future quantitative analyses: It allows optimized stimuli to be used to test the model and variants of it, with potential applications as an image-quality metric. In addition, it may serve as a building block for models of higher level processing.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO201902198927197ZK.pdf 2795KB PDF download
  文献评价指标  
  下载次数:8次 浏览次数:23次