学位论文详细信息
Software Thread Integration for Converting TLP to ILP on VLIW/EPIC Architectures
Smart RTOS;Software Thread Integration;STI;VLIW;EPIC;TLP;ILP
So, Won ; Eric Rotenberg, Committee Member,Tom Conte, Committee Member,Alexander G. Dean, Committee Chair,So, Won ; Eric Rotenberg ; Committee Member ; Tom Conte ; Committee Member ; Alexander G. Dean ; Committee Chair
University:North Carolina State University
关键词: Smart RTOS;    Software Thread Integration;    STI;    VLIW;    EPIC;    TLP;    ILP;   
Others  :  https://repository.lib.ncsu.edu/bitstream/handle/1840.16/73/etd.pdf?sequence=1&isAllowed=y
美国|英语
来源: null
PDF
【 摘 要 】
Multimedia applications are pervasive in modern systems. They generally require a significantly higher level of performance than previous workloads of embedded systems. They have driven digital signal processor makers to adopt high-performance architectures like VLIW (Very-Long Instruction Word) or EPIC (Explicitly Parallel Instruction Computing). Despite many efforts to exploit instruction level parallelism (ILP) in the application, typical utilization levels for compiler-generated VLIW/EPIC code range from one-eighth to one-half because a single instruction stream has limited ILP. Software Thread Integration (STI) is a software technique which interleaves multiple threads at the machine instruction level. Integration of threads increases the number of independent instructions, allowing the compiler to generate a more efficient instruction schedule and hence faster runtime performance. We have developed techniques to use STI for converting thread level parallelism (TLP) to ILP on VLIW/EPIC architectures. By focusing on the abundant parallelism at the procedure level in the multimedia applications, we integrate parallel procedure calls, which can be seen as threads, by gathering work in the application. We rely on the programmer to identify parallel procedures, rather than rely on compiler identification. Our methods extend whole-program optimization by expanding the scope of the compiler through software thread integration and procedure cloning. It is effectively a superset of loop jamming as it allows a larger variety of threads to be jammed together. This thesis proposes a methodology to integrate multiple threads in multimedia applications and introduce the concept of a 'Smart RTOS' as an execution model for utilizing integrated threads efficiently in embedded systems. We demonstrate our technique by integrating three procedures from a JPEG application at C source code level, compiling with four compilers for the Itanium EPIC architecture and measuring the performance with the on-chip performance measurement units. Experimental results show procedure speedup of up to 18% and program speedup up to 11%. Detailed performance analysis demonstrates the primary bottleneck to be the Itanium's 16K instruction cache, which has limited room for the code expansion by STI.
【 预 览 】
附件列表
Files Size Format View
Software Thread Integration for Converting TLP to ILP on VLIW/EPIC Architectures 516KB PDF download
  文献评价指标  
  下载次数:12次 浏览次数:28次