Introduction

GPUFFTW is a fast FFT library designed to exploit the computational performance and memory bandwidth on GPUs. Our library exploits the data parallelism available on current GPUs and pipelines the computation to the different stages of the graphics processor. Moreover, our library uses an efficient tiling strategy to further improve the memory performance of our algorithm. GPUFFTW can efficiently handle large real and complex 1-D arrays at 32-bit floating point precision on commodity GPUs. Using a NVIDIA 8800 GPU and the FFTW metric for measuring performance, our algorithm is able to achieve over 29 GFLOPS of performance on large 1-D FFTs. Furthermore, our FFT algorithm achieves comparable precision to the IEEE 32-bit FFT algorithms on CPUs even on large 1-D arrays. The library supports both Windows and Linux platforms.

Please refer to the documentation for details regarding the API and the contents of the distribution. Also, please read through the system requirements below before using the library.

Note: GPUFFTW does run correctly on Windows XP and 8800 GTX using the latest NVIDIA drivers 158.19. It also runs on Windows Vista and earlier NVIDIA GPUs and drivers