期刊论文详细信息
IEEE Access 卷:9
User Driven FPGA-Based Design Automated Framework of Deep Neural Networks for Low-Power Low-Cost Edge Computing
Marcelo A. C. Fernandes1  Maria Gracielly F. Coutinho2  Carlos Valderrama Sakuyama3  Chokri Souani3  Tarek Belabed4 
[1] Polytechnique, SEMi, Mons, Belgium;
[2] de Mons, Facult&x00E9;
[3] Department of Computer and Automation Engineering, Federal University of Rio Grande do Norte, Natal, Brazil;
[4] Universit&x00E9;
关键词: Deep learning;    electronic design automation;    edge computing;    FPGA;    low power systems;   
DOI  :  10.1109/ACCESS.2021.3090196
来源: DOAJ
【 摘 要 】

Deep Learning techniques have been successfully applied to solve many Artificial Intelligence (AI) applications problems. However, owing to topologies with many hidden layers, Deep Neural Networks (DNNs) have high computational complexity, which makes their deployment difficult in contexts highly constrained by requirements such as performance, real-time processing, or energy efficiency. Numerous hardware/software optimization techniques using GPUs, ASICs, and reconfigurable computing (i.e, FPGAs), have been proposed in the literature. With FPGAs, very specialized architectures have been developed to provide an optimal balance between high-speed and low power. However, when targeting edge computing, user requirements and hardware constraints must be efficiently met. Therefore, in this work, we only focus on reconfigurable embedded systems based on the Xilinx ZYNQ SoC and popular DNNs that can be implemented on Embedded Edge improving performance per watt while maintaining accuracy. In this context, we propose an automated framework for the implementation of hardware-accelerated DNN architectures. This framework provides an end-to-end solution that facilitates the efficient deployment of topologies on FPGAs by combining custom hardware scalability with optimization strategies. Cutting-edge comparisons and experimental results demonstrate that the architectures developed by our framework offer the best compromise between performance, energy consumption, and system costs. For instance, the low power (0.266W) DNN topologies generated for the MNIST database achieved a high throughput of 3,626 FPS.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:4次