学位论文详细信息
A faster FFT in the Mid-West
High Performance Computing (HPC);Fast Fourier Transform (FFT);Libraries
Yee, Alexander ; Snir ; Marc
关键词: High Performance Computing (HPC);    Fast Fourier Transform (FFT);    Libraries;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/44116/Alexander_Yee.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

FFT implementations today generally fall into two categories: Library generators (such as FFTW and Spiral) and specialized FFTs (such as prime95). Specialized FFTs have the obvious limitation of being specialized. However they are hand-tuned and generally offer superior performance. Library generators are generic and easier to port. But their performance is generally suboptimal.We describe in this paper an FFT library that was built while paying special attention to locality. The library achieves significantly better performance than FFTW, for long vectors. Unlike FFTW or Spiral, the recursive decomposition of the FFT is not created by a library generator;it is created by macro expansion that has a few selectable parameters. This provides an interface that can be more easily modified by users.

【 预 览 】
附件列表
Files Size Format View
A faster FFT in the Mid-West 916KB PDF download
  文献评价指标  
  下载次数:7次 浏览次数:15次