期刊论文详细信息
Electronics 卷:10
Similarity-Aware Architecture/Compiler Co-Designed Context-Reduction Framework for Modulo-Scheduled CGRA
Qin Wang1  Zhongyuan Zhao1  Weiguang Sheng1  Zhigang Mao1  Jinchao Li2  Pengfei Ye3 
[1] Department of Micro/Nano Electronics, Shanghai Jiaotong University, Shanghai 200240, China;
[2] Huawei Technologies Shanghai, Shanghai 200299, China;
[3] Intel Aisa Pacific Development, Shanghai 200241, China;
关键词: CGRA;    similarity-aware;    context reduction;    modulo scheduling;    simulated annealing;   
DOI  :  10.3390/electronics10182210
来源: DOAJ
【 摘 要 】

Modulo-scheduled coarse-grained reconfigurable array (CGRA) processors have shown their potential for exploiting loop-level parallelism at high energy efficiency. However, these CGRAs need frequent reconfiguration during their execution, which makes them suffer from large area and power overhead for context memory and context-fetching. To tackle this challenge, this paper uses an architecture/compiler co-designed method for context reduction. From an architecture perspective, we carefully partition the context into several subsections and only fetch the subsections that are different to the former context word whenever fetching the new context. We package each different subsection with an opcode and index value to formulate a context-fetching primitive (CFP) and explore the hardware design space by providing the centralized and distributed CFP-fetching CGRA to support this CFP-based context-fetching scheme. From the software side, we develop a similarity-aware tuning algorithm and integrate it into state-of-the-art modulo scheduling and memory access conflict optimization algorithms. The whole compilation flow can efficiently improve the similarities between contexts in each PE for the purpose of reducing both context-fetching latency and context footprint. Experimental results show that our HW/SW co-designed framework can improve the area efficiency and energy efficiency to at most 34% and 21% higher with only 2% performance overhead.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次