SCOPUS 정보 검색 플랫폼

Proceedings - IEEE 27th International Parallel and Distributed Processing Symposium, IPDPS 2013

Volumn , Issue , 2013, Pages 115-125

High performance FFT based poisson solver on a CPU-GPU heterogeneous platform

(2) Wu, Jing a JaJa, Joseph a

a UNIVERSITY OF MARYLAND (United States)

Author keywords

CUDA; Fast Fourier Transforms; GPU; Parallel and Vector Implementations; Poisson Equations

Indexed keywords

BETTER PERFORMANCE; CUDA; GPU; HETEROGENEOUS PLATFORMS; MEMORY-BOUND COMPUTATIONS; NEUMANN BOUNDARY CONDITION; PARALLEL AND VECTOR IMPLEMENTATIONS; PERIODIC BOUNDARY CONDITIONS;

BOUNDARY CONDITIONS; DATA TRANSFER; DISTRIBUTED PARAMETER NETWORKS; POISSON EQUATION; THREE DIMENSIONAL;

FAST FOURIER TRANSFORMS;

EID: 84884825242 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IPDPS.2013.18 Document Type: Conference Paper

Times cited : (16)

References (18)

1
- 77954741573
- Large-scale FFT on GPU clusters
- New York, NY, USA, ACM
- Y. Chen, X. Cui, and H. Mei. Large-scale FFT on GPU clusters. In Proceedings of the 24th ACM International Conference on Supercomputing, ICS '10, pages 315-324, New York, NY, USA, 2010. ACM.
- (2010) Proceedings of the 24th ACM International Conference on Supercomputing, ICS '10 , pp. 315-324
- Chen, Y.¹ Cui, X.² Mei, H.³

2
- 84968470212
- An algorithm for the machine calculation of complex Fourier series
- J. Cooley and J. Tukey. An algorithm for the machine calculation of complex Fourier series. Mathematics of Computation, 19(90):297-301, 1965.
- (1965) Mathematics of Computation , vol.19 , Issue.90 , pp. 297-301
- Cooley, J.¹ Tukey, J.²

3
- 79952782168
- Auto-tuning of fast Fourier transform on graphics processors
- New York, NY, USA, ACM
- Y. Dotsenko, S. Baghsorkhi, B. Lloyd, and N. Govindaraju. Auto-tuning of fast Fourier transform on graphics processors. In Proceedings of the 16th ACM symposium on Principles and practice of parallel programming, PPoPP '11, pages 257-266, New York, NY, USA, 2011. ACM.
- (2011) Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming, PPoPP '11 , pp. 257-266
- Dotsenko, Y.¹ Baghsorkhi, S.² Lloyd, B.³ Govindaraju, N.⁴

4
- 0348209599
- A fast Fourier transform compiler
- M. Frigo. A fast Fourier transform compiler. SIGPLAN Not., 34(5):169-180, May 1999. (Pubitemid 129686086)
- (1999) SIGPLAN Notices (ACM Special Interest Group on Programming Languages) , vol.34 , Issue.5 , pp. 169-180
- Frigo, M.¹

5
- 84884849756
- website
- M. Frigo and G. Johnson. The FFTW website, 2012. http: //www.fftw.org.
- (2012)
- Frigo, M.¹ Johnson, G.²

6
- 20744449792
- The design and implementation of FFTW3
- M. Frigo, Steven, and G. Johnson. The design and implementation of FFTW3. In Proceedings of the IEEE, pages 216-231, 2005.
- (2005) Proceedings of the IEEE , pp. 216-231
- Frigo, M.¹ Steven² Johnson, G.³

7
- 78649807974
- Cyclic reduction tridiagonal solvers on GPUs applied to mixed-precision multigrid
- Jan.
- D. Goddeke and R. Strzodka. Cyclic reduction tridiagonal solvers on GPUs applied to mixed-precision multigrid. IEEE Trans. Parallel Distrib. Syst., 22(1):22-32, Jan. 2011.
- (2011) IEEE Trans. Parallel Distrib. Syst. , vol.22 , Issue.1 , pp. 22-32
- Goddeke, D.¹ Strzodka, R.²

8
- 34548292052
- A memory model for scientific algorithms on graphics processors
- ACM
- N. K. Govindaraju, S. Larsen, J. Gray, and D. Manocha. A memory model for scientific algorithms on graphics processors. In Proceedings of the 2006 ACM/IEEE conference on Supercomputing, SC '06, New York, NY, USA, 2006. ACM.
- Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, SC '06, New York, NY, USA, 2006
- Govindaraju, N.K.¹ Larsen, S.² Gray, J.³ Manocha, D.⁴

9
- 70350754502
- High performance discrete Fourier transforms on graphics processors
- Piscataway, NJ, USA, IEEE Press
- N. K. Govindaraju, B. Lloyd, Y. Dotsenko, B. Smith, and J. Manferdelli. High performance discrete Fourier transforms on graphics processors. In Proceedings of the 2008 ACM/IEEE conference on Supercomputing, SC '08, pages 2:1-2:12, Piscataway, NJ, USA, 2008. IEEE Press.
- (2008) Proceedings of the 2008 ACM/IEEE Conference on Supercomputing, SC '08
- Govindaraju, N.K.¹ Lloyd, B.² Dotsenko, Y.³ Smith, B.⁴ Manferdelli, J.⁵

10
- 77954713684
- An empirically tuned 2D and 3D FFT library on CUDA GPU
- New York, NY, USA, ACM
- L. Gu, X. Li, and J. Siegel. An empirically tuned 2D and 3D FFT library on CUDA GPU. In Proceedings of the 24th ACM International Conference on Supercomputing, ICS '10, pages 305-314, New York, NY, USA, 2010. ACM.
- (2010) Proceedings of the 24th ACM International Conference on Supercomputing, ICS '10 , pp. 305-314
- Gu, L.¹ Li, X.² Siegel, J.³

11
- 79959598034
- Using GPUs to compute large out-of-card FFTs
- New York, NY, USA, ACM
- L. Gu, J. Siegel, and X. Li. Using GPUs to compute large out-of-card FFTs. In Proceedings of the international conference on Supercomputing, ICS '11, pages 255-264, New York, NY, USA, 2011. ACM.
- (2011) Proceedings of the International Conference on Supercomputing, ICS '11 , pp. 255-264
- Gu, L.¹ Siegel, J.² Li, X.³

12
- 14544293952
- Immersed boundary methods
- R. Mittal and G. Iaccarino. Immersed boundary methods. In Ann. Rev. Fluid Mech. 37, pages 239-261, 2005.
- (2005) Ann. Rev. Fluid Mech. , vol.37 , pp. 239-261
- Mittal, R.¹ Iaccarino, G.²

13
- 74049114159
- Auto-tuning 3-D FFT library for CUDA GPUs
- New York, NY, USA, ACM
- A. Nukada and S. Matsuoka. Auto-tuning 3-D FFT library for CUDA GPUs. In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, SC '09, pages 30:1-30:10, New York, NY, USA, 2009. ACM.
- (2009) Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, SC '09
- Nukada, A.¹ Matsuoka, S.²

14
- 70350759823
- Bandwidth intensive 3-D FFT kernel for GPUs using CUDA
- Piscataway, NJ, USA, IEEE Press
- A. Nukada, Y. Ogata, T. Endo, and S. Matsuoka. Bandwidth intensive 3-D FFT kernel for GPUs using CUDA. In Proceedings of the 2008 ACM/IEEE conference on Supercomputing, SC '08, pages 5:1-5:11, Piscataway, NJ, USA, 2008. IEEE Press.
- (2008) Proceedings of the 2008 ACM/IEEE Conference on Supercomputing, SC '08
- Nukada, A.¹ Ogata, Y.² Endo, T.³ Matsuoka, S.⁴

15
- 79551704836
- NVIDIA Corporation
- NVIDIA Corporation. NVIDIA CUDA C programming guide, 2011.
- (2011) NVIDIA CUDA C Programming Guide

16
- 84886395162
- B. C. Sidney. Fast Fourier Transforms. Appendix 1: FFT flowgraphs, 2012. http://cnx.org/content/m16352/latest/?collection=col10550/1.20.
- (2012) Fast Fourier Transforms. Appendix 1: FFT Flowgraphs
- Sidney, B.C.¹

17
- 84884877516
- An optimized FFT-based direct Poisson solver on CUDA GPUs
- To appear
- J. Wu and J. JaJa. An optimized FFT-based direct Poisson solver on CUDA GPUs. IEEE Trans. Parallel Distrib. Syst. To appear.
- IEEE Trans. Parallel Distrib. Syst.
- Wu, J.¹ JaJa, J.²

18
- 84870704125
- Optimized strategies for mapping three-dimensional FFTs onto CUDA GPUs
- IEEE Press
- J. Wu and J. JaJa. Optimized strategies for mapping three-dimensional FFTs onto CUDA GPUs. In Innovative Parallel Computing (INPAR). IEEE Press, 2012.
- (2012) Innovative Parallel Computing (INPAR)
- Wu, J.¹ JaJa, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.