期刊论文详细信息
Sensors
Explaining Neural Networks Using Attentive Knowledge Distillation
Hyeonseok Lee1  Sungchan Kim1 
[1] Division of Computer Science and Engineering, Jeonbuk National University, Jeollabuk-do 54896, Korea;
关键词: deep neural networks;    visual explanation;    attention;    knowledge distillation;    fine-grained classification;   
DOI  :  10.3390/s21041280
来源: DOAJ
【 摘 要 】

Explaining the prediction of deep neural networks makes the networks more understandable and trusted, leading to their use in various mission critical tasks. Recent progress in the learning capability of networks has primarily been due to the enormous number of model parameters, so that it is usually hard to interpret their operations, as opposed to classical white-box models. For this purpose, generating saliency maps is a popular approach to identify the important input features used for the model prediction. Existing explanation methods typically only use the output of the last convolution layer of the model to generate a saliency map, lacking the information included in intermediate layers. Thus, the corresponding explanations are coarse and result in limited accuracy. Although the accuracy can be improved by iteratively developing a saliency map, this is too time-consuming and is thus impractical. To address these problems, we proposed a novel approach to explain the model prediction by developing an attentive surrogate network using the knowledge distillation. The surrogate network aims to generate a fine-grained saliency map corresponding to the model prediction using meaningful regional information presented over all network layers. Experiments demonstrated that the saliency maps are the result of spatially attentive features learned from the distillation. Thus, they are useful for fine-grained classification tasks. Moreover, the proposed method runs at the rate of 24.3 frames per second, which is much faster than the existing methods by orders of magnitude.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次