메뉴 건너뛰기




Volumn 6067 LNCS, Issue PART 1, 2010, Pages 606-614

An implementation of parallel 3-D FFT with 2-D decomposition on a massively parallel cluster of multi-core processors

Author keywords

[No Author keywords available]

Indexed keywords

COMMUNICATION TIME; D-DECOMPOSITION; FFT ALGORITHM; MPI PROCESS; MULTI-CORE PROCESSOR; PARALLEL CLUSTERS; PEAK PERFORMANCE;

EID: 77955106795     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-14390-8_63     Document Type: Conference Paper
Times cited : (36)

References (10)
  • 1
    • 84968470212 scopus 로고
    • An algorithm for the machine calculation of complex Fourier series
    • Cooley, J.W., Tukey, J.W.: An algorithm for the machine calculation of complex Fourier series. Math. Comput. 19, 297-301 (1965)
    • (1965) Math. Comput. , vol.19 , pp. 297-301
    • Cooley, J.W.1    Tukey, J.W.2
  • 2
    • 0022721382 scopus 로고
    • Two and three dimensional FFTs on highly parallel computers
    • Brass, A., Pawley, G.S.: Two and three dimensional FFTs on highly parallel computers. Parallel Computing 3, 167-184 (1986)
    • (1986) Parallel Computing , vol.3 , pp. 167-184
    • Brass, A.1    Pawley, G.S.2
  • 4
    • 0037402659 scopus 로고    scopus 로고
    • Efficient implementation of parallel three-dimensional FFT on clusters of PCs
    • Takahashi, D.: Efficient implementation of parallel three-dimensional FFT on clusters of PCs. Computer Physics Communications 152, 144-150 (2003)
    • (2003) Computer Physics Communications , vol.152 , pp. 144-150
    • Takahashi, D.1
  • 5
    • 19344378421 scopus 로고    scopus 로고
    • Scalable framework for 3D FFTs on the Blue Gene/L supercomputer: Implementation and early performance measurements
    • Eleftheriou, M., Fitch, B.G., Rayshubskiy, A., Ward, T.J.C., Germain, R.S.: Scalable framework for 3D FFTs on the Blue Gene/L supercomputer: Implementation and early performance measurements. IBM J. Res. Dev. 49, 457-464 (2005)
    • (2005) IBM J. Res. Dev. , vol.49 , pp. 457-464
    • Eleftheriou, M.1    Fitch, B.G.2    Rayshubskiy, A.3    Ward, T.J.C.4    Germain, R.S.5
  • 6
    • 20744449792 scopus 로고    scopus 로고
    • The design and implementation of FFTW3
    • Frigo, M., Johnson, S.G.: The design and implementation of FFTW3. Proc. IEEE 93, 216-231 (2005)
    • (2005) Proc. IEEE , vol.93 , pp. 216-231
    • Frigo, M.1    Johnson, S.G.2
  • 7
    • 33745786084 scopus 로고    scopus 로고
    • A hybrid MPI/OpenMP implementation of a parallel 3-D FFT on SMP clusters
    • Wyrzykowski, R., Dongarra, J., Meyer, N., Wásniewski, J. (eds.) PPAM 2005. Springer, Heidelberg
    • Takahashi, D.: A hybrid MPI/OpenMP implementation of a parallel 3-D FFT on SMP clusters. In: Wyrzykowski, R., Dongarra, J., Meyer, N., Wásniewski, J. (eds.) PPAM 2005. LNCS, vol.3911, pp. 970-977. Springer, Heidelberg (2006)
    • (2006) LNCS , vol.3911 , pp. 970-977
    • Takahashi, D.1
  • 8
    • 33947229391 scopus 로고    scopus 로고
    • Performance of the 3D FFT on the 6D network torus QCDOC parallel supercomputer
    • Fang, B., Deng, Y., Martyna, G.: Performance of the 3D FFT on the 6D network torus QCDOC parallel supercomputer. Computer Physics Communications 176, 531-538 (2007)
    • (2007) Computer Physics Communications , vol.176 , pp. 531-538
    • Fang, B.1    Deng, Y.2    Martyna, G.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.