메뉴 건너뛰기




Volumn , Issue , 2012, Pages 344-350

An implementation of parallel 1-D FFT on the K computer

Author keywords

all to all communication; distributed memory parallel computer; Fast Fourier transform

Indexed keywords

ALL-TO-ALL COMMUNICATION; CACHE MISS; DISTRIBUTED-MEMORY PARALLEL COMPUTERS; FFT ALGORITHM; PEAK PERFORMANCE;

EID: 84870401849     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/HPCC.2012.53     Document Type: Conference Paper
Times cited : (11)

References (19)
  • 1
    • 84968470212 scopus 로고
    • An algorithm for the machine calculation of complex Fourier series
    • J. W. Cooley and J. W. Tukey, "An algorithm for the machine calculation of complex Fourier series," Math. Comput., vol. 19, pp. 297-301, 1965.
    • (1965) Math. Comput. , vol.19 , pp. 297-301
    • Cooley, J.W.1    Tukey, J.W.2
  • 3
    • 21844493537 scopus 로고
    • A self-sorting in-place fast Fourier transform algorithm suitable for vector and parallel processing
    • M. Hegland, "A self-sorting in-place fast Fourier transform algorithm suitable for vector and parallel processing," Numerische Mathematik, vol. 68, pp. 507-547, 1994.
    • (1994) Numerische Mathematik , vol.68 , pp. 507-547
    • Hegland, M.1
  • 4
    • 84949653778 scopus 로고    scopus 로고
    • Automatic performance tuning in the UHFFT library
    • Proc. 2001 International Conference on Computational Science (ICCS 2001), ser. Springer-Verlag
    • D. Mirković and L. Johnsson, S, "Automatic performance tuning in the UHFFT library," in Proc. 2001 International Conference on Computational Science (ICCS 2001), ser. Lecture Notes in Computer Science, vol. 2073. Springer-Verlag, 2001, pp. 71-80.
    • (2001) Lecture Notes in Computer Science , vol.2073 , pp. 71-80
    • Mirković, D.1    Johnsson, S.L.2
  • 5
    • 20744449792 scopus 로고    scopus 로고
    • The design and implementation of FFTW3
    • M. Frigo and S. G. Johnson, "The design and implementation of FFTW3," Proc. IEEE, vol. 93, pp. 216-231, 2005.
    • (2005) Proc. IEEE , vol.93 , pp. 216-231
    • Frigo, M.1    Johnson, S.G.2
  • 7
    • 84947753208 scopus 로고    scopus 로고
    • Automatic performance optimization of the discrete Fourier transform on distributed memory computers
    • Proc. 4th International Symposium on Parallel and Distributed Processing and Applications (ISPA 2006), ser. Springer-Verlag
    • A. Bonelli, F. Franchetti, J. Lorenz, M. Püschel, and C. W. Ueberhuber, "Automatic performance optimization of the discrete Fourier transform on distributed memory computers," in Proc. 4th International Symposium on Parallel and Distributed Processing and Applications (ISPA 2006), ser. Lecture Notes in Computer Science, vol. 4330. Springer-Verlag, 2006, pp. 818-832.
    • (2006) Lecture Notes in Computer Science , vol.4330 , pp. 818-832
    • Bonelli, A.1    Franchetti, F.2    Lorenz, J.3    Püschel, M.4    Ueberhuber, C.W.5
  • 9
    • 0025403252 scopus 로고
    • FFTs in external or hierarchical memory
    • D. H. Bailey, "FFTs in external or hierarchical memory," The Journal of Supercomputing, vol. 4, pp. 23-35, 1990.
    • (1990) The Journal of Supercomputing , vol.4 , pp. 23-35
    • Bailey, D.H.1
  • 12
    • 84856642103 scopus 로고    scopus 로고
    • SPARC64 (TM) VIIIfx Extensions, Fujitsu Limited, http://www.fujitsu.com/ downloads/TC/ sparc64viiifx-extensions.pdf.
    • SPARC64 (TM) VIIIfx Extensions
  • 13
    • 70450200710 scopus 로고    scopus 로고
    • Tofu: A 6D mesh/torus interconnect for exascale computers
    • Y. Ajima, S. Sumimoto, and T. Shimizu, "Tofu: A 6D mesh/torus interconnect for exascale computers," IEEE Computer, vol. 42, pp. 36-40, 2009.
    • (2009) IEEE Computer , vol.42 , pp. 36-40
    • Ajima, Y.1    Sumimoto, S.2    Shimizu, T.3
  • 15
    • 84957016016 scopus 로고    scopus 로고
    • A blocking algorithm for FFT on cache-based processors
    • Proc. 9th International Conference on High Performance Computing and Networking Europe (HPCN Europe 2001), ser.
    • D. Takahashi, "A blocking algorithm for FFT on cache-based processors," in Proc. 9th International Conference on High Performance Computing and Networking Europe (HPCN Europe 2001), ser. Lecture Notes in Computer Science, vol. 2110. Springer-Verlag, 2001, pp. 551-554.
    • (2001) Lecture Notes in Computer Science , vol.2110 , pp. 551-554
    • Takahashi, D.1
  • 16
    • 0021470572 scopus 로고
    • FFT algorithms for vector computers
    • P. N. Swarztrauber, "FFT algorithms for vector computers," Parallel Computing, vol. 1, pp. 45-63, 1984.
    • (1984) Parallel Computing , vol.1 , pp. 45-63
    • Swarztrauber, P.N.1
  • 17
    • 0042208264 scopus 로고
    • Self-sorting mixed-radix fast Fourier transforms
    • C. Temperton, "Self-sorting mixed-radix fast Fourier transforms," J. Comput. Phys., vol. 52, pp. 1-23, 1983.
    • (1983) J. Comput. Phys. , vol.52 , pp. 1-23
    • Temperton, C.1
  • 18
    • 0001249667 scopus 로고
    • Discrete Fourier transforms when the number of data samples is prime
    • C. M. Rader, "Discrete Fourier transforms when the number of data samples is prime," in Proc. IEEE, vol. 56, 1968, pp. 1107-1108.
    • (1968) Proc. IEEE , vol.56 , pp. 1107-1108
    • Rader, C.M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.