期刊论文详细信息
Electronics
MemBox: Shared Memory Device for Memory-Centric Computing Applicable to Deep Learning Problems
Yongseok Choi1  Eunji Lim1  Jaekwon Shin2  Cheol-Hoon Lee3 
[1] Artificial Intelligence Research Laboratory, ETRI, Daejeon 34129, Korea;Aviation Drone Laboratory, LIG Nex1, Yongin 16961, Korea;Department of Computer Engineering, Chungnam National University, Daejeon 34134, Korea;
关键词: distributed system;    shared memory;    deep learning;    big data;    FPGA;    ASIC;   
DOI  :  10.3390/electronics10212720
来源: DOAJ
【 摘 要 】

Large-scale computational problems that need to be addressed in modern computers, such as deep learning or big data analysis, cannot be solved in a single computer, but can be solved with distributed computer systems. Since most distributed computing systems, consisting of a large number of networked computers, should propagate their computational results to each other, they can suffer the problem of an increasing overhead, resulting in lower computational efficiencies. To solve these problems, we proposed an architecture of a distributed system that used a shared memory that is simultaneously accessible by multiple computers. Our architecture aimed to be implemented in FPGA or ASIC. Using an FPGA board that implemented our architecture, we configured the actual distributed system and showed the feasibility of our system. We compared the results of the deep learning application test using our architecture with that using Google Tensorflow’s parameter server mechanism. We showed improvements in our architecture beyond Google Tensorflow’s parameter server mechanism and we determined the future direction of research by deriving the expected problems.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次