SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2009, Pages

Auto-tuning 3-D FFT library for CUDA GPUs

Author keywords

[No Author keywords available]

Indexed keywords

AUTOTUNING; DENSE KERNELS; NUMBER OF THREADS; PROBLEM SIZE; SHARED MEMORIES; SINGLE PROCESSORS; TRANSFORM SIZE;

FAST FOURIER TRANSFORMS; PROGRAM PROCESSORS; TUNING;

THREE DIMENSIONAL;

EID: 74049114159 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1654059.1654090 Document Type: Conference Paper

Times cited : (105)

References (19)

1
- 84968470212
- An Algorithm for the Machine Calculation of Complex Fourier Series
- J. W. Cooley and J. W. Tukey. An Algorithm for the Machine Calculation of Complex Fourier Series. Math. Comput., Vol. 19:297-301, 1965.
- (1965) Math. Comput , vol.19 , pp. 297-301
- Cooley, J.W.¹ Tukey, J.W.²

3
- 20744449792
- M. Frigo and S. G. Johnson. The Design and Implementation of FFTW3. Proceedings of the IEEE, 93(2):216-231, 2005. special issue on Program Generation, Optimization, and Platform Adaptation.
- M. Frigo and S. G. Johnson. The Design and Implementation of FFTW3. Proceedings of the IEEE, 93(2):216-231, 2005. special issue on "Program Generation, Optimization, and Platform Adaptation".

4
- 74049116773
- General-Purpose Computation Using Graphics Hardware. http://www.gpgpu. org/.
- General-Purpose Computation Using Graphics Hardware. http://www.gpgpu. org/.

6
- 70350754502
- High Performance Discrete Fourier Transforms on Graphics Processors
- N. K. Govindaraju, B. Lloyd, Y. Dotsenko, B. Smith, and J. Manferdelli. High Performance Discrete Fourier Transforms on Graphics Processors. In the 2008 ACM/IEEE conference on supercomputing, 2008.
- (2008) the 2008 ACM/IEEE conference on supercomputing
- Govindaraju, N.K.¹ Lloyd, B.² Dotsenko, Y.³ Smith, B.⁴ Manferdelli, J.⁵

7
- 74349092397
- Khronos Group
- Khronos Group. OpenCL - The open standard for parallel programming of heterogeneous systems. http://www.khronos.org/opencl/.
- OpenCL - The open standard for parallel programming of heterogeneous systems

10
- 35048828869
- The FFT on a GPU
- K. Moreland and E. Angel. The FFT on a GPU. In Proceedings of SIGGRAPH/Eurographics Workshop on Graphics Hardware 2003, pages 112-119, 2003.
- (2003) Proceedings of SIGGRAPH/Eurographics Workshop on Graphics Hardware 2003 , pp. 112-119
- Moreland, K.¹ Angel, E.²

13
- 19344368072
- M. Püschel, J. M. F. Moura, J. Johnson, D. Padua, M. Veloso, B. Singer, J. Xiong, F. Franchetti, A. Gacic, Y. Voronenko, K. Chen, R. W. Johnson, and N. Rizzolo. SPIRAL: Code Generation for DSP Transforms. Proceedings of the IEEE, special issue on Program Generation, Optimization, and Adaptation, 93(2):232-275, 2005.
- M. Püschel, J. M. F. Moura, J. Johnson, D. Padua, M. Veloso, B. Singer, J. Xiong, F. Franchetti, A. Gacic, Y. Voronenko, K. Chen, R. W. Johnson, and N. Rizzolo. SPIRAL: Code Generation for DSP Transforms. Proceedings of the IEEE, special issue on "Program Generation, Optimization, and Adaptation", 93(2):232-275, 2005.

14
- 54049117366
- Implementing a GPU-efficient FFT
- J. Spitzer. Implementing a GPU-efficient FFT. In SIGGRAPH Course on Interactive Geometric and Scientific Computations with Graphics Hardware, 2003.
- (2003) SIGGRAPH Course on Interactive Geometric and Scientific Computations with Graphics Hardware
- Spitzer, J.¹

16
- 0003417587
- SIAM Press, Philadelphia, PA
- C. Van Loan. Computational Frameworks for the Fast Fourier Transform. SIAM Press, Philadelphia, PA, 1992.
- (1992) Computational Frameworks for the Fast Fourier Transform
- Van Loan, C.¹

18
- 68849103234
- V. Volkov and B. Kazian. Fitting FFT onto the G80 architecture, 2008. http://www.cs.berkeley.edu/~kubitron/courses/cs258-S08/projects/reports/ project6-report.pdf.
- (2008) Fitting FFT onto the G80 architecture
- Volkov, V.¹ Kazian, B.²

19
- 0343462141
- Automated empirical optimizations of software and the atlas project
- R. C. Whaley, A. Petitet, and J. J. Dongarra. Automated empirical optimizations of software and the atlas project. Parallel Computing, 27(1-2):3-35, 2001.
- (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
- Whaley, R.C.¹ Petitet, A.² Dongarra, J.J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.