期刊论文

期刊论文详细信息

IEICE Electronics Express
Sample-wise dynamic precision quantization for neural network acceleration
article
Bowen Li¹ Dongliang Xiong¹ Kai Huang¹ Xiaowen Jiang¹ Hao Yao² Junjian Chen² Luc Claesen³
[1] School of Micro-Nano Electronics, Zhejiang University;Digital Grid Research Institute;Engineering Technology-Electronics-ICT Department, University of Hasselt
关键词: convolutional neural networks; dynamic quantization; hardware accelerators;
DOI : 10.1587/elex.19.20220229
学科分类：电子、光学、磁材料
来源: Denshi Jouhou Tsuushin Gakkai
PDF

【摘要】

Quantization is a well-known method for deep neural networks (DNNs) compression and acceleration. In this work, we propose the Sample-Wise Dynamic Precision (SWDP) quantization scheme, which can switch the bit-width of weights and activations in the model according to the task difficulty of input samples at runtime. Using low-precision networks for easy input images brings advantages in terms of computational and energy efficiency. We also propose an adaptive hardware design for the efficient implementation of our SWDP networks. The experimental results on various networks and datasets demonstrate that our SWDP achieves an average of 3.3× speedup and 3.0× energy saving over the bit-level dynamically composable architecture BitFusion.

【授权许可】

CC BY

【预览】

附件列表
Files	Size	Format	View
RO202306290004490ZK.pdf	2679KB	PDF	download


	文献评价指标
	下载次数：0次	浏览次数：0次

京公网安备340104078870146号 878987797 028-85220240

OAinOne平台基于对开放资源的发现、遴选和评价方式，发现、获取、集成9类优质的开放科技资源，包括开放期刊、开放会议论文、开放课件、科技政策、开放学位论文、开放图书、开放科技报告、科研项目、开放科学数据。同时，为实现开放知识资源普遍服务、个性化服务、精准服务，基于OAinONE集成的丰富开放资源，开发建设领域开放知识资源服务定制工具(OAtoYOU)、开放资源评价评估体系（OAEvaluation），建设集成OAinONE资源及其他第三方资源的OA Hub，及其面向我院分布式大数据知识资源系统及其他第三方的开放接口服务，并打造特色专题数据库产品建设，包括科技政策集成及趋势平台、开放课程大讲堂等。此外，OAinOne构建开放知识资源建设的可持续发展机制，支持我院研究所特色馆藏资源、自建资源、古籍资源等在OAinONE平台上的集成、开放、共享。

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】