期刊论文详细信息
IEEE Journal on Exploratory Solid-State Computational Devices and Circuits 卷:6
A Fully Integrated Reprogrammable CMOS-RRAM Compute-in-Memory Coprocessor for Neuromorphic Applications
Vishishtha Bothra1  Fuxi Cai2  Justin M. Correll3  Seungjong Lee3  Zhengya Zhang3  Wei D. Lu3  Michael P. Flynn3  Seung Hwan Lee4  Yong Lim5 
[1] Apple, Cupertino, CA, USA;
[2] Applied Materials, Santa Clara, CA;
[3] Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, USA;
[4] Intel, Albuquerque, NM, USA;
[5] Samsung Electronics, Yongin, South Korea;
关键词: ADC;    analog;    compute-in-memory;    DAC;    resistive random access memory (RRAM);    vector-matrix multiplication (VMM);   
DOI  :  10.1109/JXCDC.2020.2992228
来源: DOAJ
【 摘 要 】

Analog compute-in-memory with resistive random access memory (RRAM) devices promises to overcome the data movement bottleneck in data-intensive artificial intelligence (AI) and machine learning. RRAM crossbar arrays improve the efficiency of vector-matrix multiplications (VMMs), which is a vital operation in these applications. The prototype IC is the first complete, fully integrated analog-RRAM CMOS coprocessor. This article focuses on the digital and analog circuitry that supports efficient and flexible RRAM-based computation. A passive $54\times108$ RRAM crossbar array performs VMM in the analog domain. Specialized mixed-signal circuits stimulate and read the outputs of the RRAM crossbar. The single-chip CMOS prototype includes a reduced instruction set computer (RISC) processor interfaced to a memory-mapped mixed-signal core. In the mixed-signal core, ADCs and DACs interface with the passive RRAM crossbar. The RISC processor controls the mixed-signal circuits and the algorithm data path. The system is fully programmable and supports forward and backward propagation. As proof of concept, a fully integrated 0.18- $\mu \text{m}$ CMOS prototype with a postprocessed RRAM array demonstrates several key functions of machine learning, including online learning. The mixed-signal core consumes 64 mW at an operating frequency of 148 MHz. The total system power consumption considering the mixed-signal circuitry, the digital processor, and the passive RRAM array is 307 mW. The maximum theoretical throughput is 2.6 GOPS at an efficiency of 8.5 GOPS/W.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次