메뉴 건너뛰기




Volumn 93, Issue 2, 2005, Pages 216-231

The design and implementation of FFTW3

Author keywords

Adaptive software; Cosine transform; Fast Fourier transform (FFT); Fourier transform; Hartley transform; I O tensor

Indexed keywords

ALGORITHMS; CODES (STANDARDS); COMPUTER ARCHITECTURE; COMPUTER SOFTWARE; COSINE TRANSFORMS; DATA STRUCTURES; DISCRETE FOURIER TRANSFORMS; OPTIMIZATION; PROBLEM SOLVING;

EID: 20744449792     PISSN: 00189219     EISSN: None     Source Type: Journal    
DOI: 10.1109/JPROC.2004.840301     Document Type: Conference Paper
Times cited : (4247)

References (60)
  • 6
    • 0030661485 scopus 로고    scopus 로고
    • Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
    • Vienna, Austria
    • J. Bilmes, K. Asanović, C.-W. Chin, and J. Demmel, "Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology," presented at the Int. Conf. Supercomputing, Vienna, Austria, 1997.
    • (1997) Int. Conf. Supercomputing
    • Bilmes, J.1    Asanović, K.2    Chin, C.-W.3    Demmel, J.4
  • 7
    • 0003418094 scopus 로고    scopus 로고
    • Automatically Tuned Linear Algebra Software
    • Comput. Sci. Dept., Univ. Tennessee, Knoxville
    • R. Whaley and J. Dongarra, "Automatically Tuned Linear Algebra Software," Comput. Sci. Dept., Univ. Tennessee, Knoxville, Tech. Rep. CS-97-366, 1997.
    • (1997) Tech. Rep. , vol.CS-97-366
    • Whaley, R.1    Dongarra, J.2
  • 8
    • 0030143875 scopus 로고    scopus 로고
    • A framework for generating distributed-memory parallel programs for block recursive algorithms
    • May
    • S. K. S. Gupta, C. Huang, P. Sadayappan, and R. W. Johnson, "A framework for generating distributed-memory parallel programs for block recursive algorithms," J. Parallel Distrib. Comput., vol. 34, no. 2, pp. 137-153, May 1996.
    • (1996) J. Parallel Distrib. Comput. , vol.34 , Issue.2 , pp. 137-153
    • Gupta, S.K.S.1    Huang, C.2    Sadayappan, P.3    Johnson, R.W.4
  • 13
    • 0141496142 scopus 로고    scopus 로고
    • Learning to construct fast signal processing implementations
    • B. Singer and M. Veloso, "Learning to construct fast signal processing implementations," J. Mach. Learn. Res., vol. 3, pp. 887-919, 2002.
    • (2002) J. Mach. Learn. Res. , vol.3 , pp. 887-919
    • Singer, B.1    Veloso, M.2
  • 14
    • 84968470212 scopus 로고
    • An algorithm for the machine computation of the complex Fourier series
    • Apr.
    • J. W. Cooley and J. W. Tukey, "An algorithm for the machine computation of the complex Fourier series," Math. Comput., vol. 19, pp. 297-301, Apr. 1965.
    • (1965) Math. Comput. , vol.19 , pp. 297-301
    • Cooley, J.W.1    Tukey, J.W.2
  • 15
    • 0021510651 scopus 로고
    • Gauss and the history of the fast Fourier transform
    • Oct.
    • M. T. Heideman, D. H. Johnson, and C. S. Burrus, "Gauss and the history of the fast Fourier transform," IEEE ASSP Mag., vol. 1, no. 4, pp. 14-21, Oct. 1984.
    • (1984) IEEE ASSP Mag. , vol.1 , Issue.4 , pp. 14-21
    • Heideman, M.T.1    Johnson, D.H.2    Burrus, C.S.3
  • 16
    • 0025418849 scopus 로고
    • Fast Fourier transforms: A tutorial review and a state of the art
    • Apr.
    • P. Duhamel and M. Vetterli, "Fast Fourier transforms: A tutorial review and a state of the art," Signal Process., vol. 19, pp. 259-299, Apr. 1990.
    • (1990) Signal Process. , vol.19 , pp. 259-299
    • Duhamel, P.1    Vetterli, M.2
  • 18
    • 0025403252 scopus 로고
    • FFT's in external or hierarchical memory
    • May
    • D. H. Bailey, "FFT's in external or hierarchical memory," J. Supercomput., vol. 4, no. 1, pp. 23-35, May 1990.
    • (1990) J. Supercomput. , vol.4 , Issue.1 , pp. 23-35
    • Bailey, D.H.1
  • 19
    • 20744446407 scopus 로고
    • On computing the fast Fourier transform
    • R. C. Singleton, "On computing the fast Fourier transform," Commun. ACM, vol. 10, pp. 647-654, 1967.
    • (1967) Commun. ACM , vol.10 , pp. 647-654
    • Singleton, R.C.1
  • 20
    • 0030105461 scopus 로고    scopus 로고
    • Bit reversal on uniprocessors
    • A. H. Karp, "Bit reversal on uniprocessors," SIAM Rev., vol. 38, no. 1, pp. 1-26, 1996.
    • (1996) SIAM Rev. , vol.38 , Issue.1 , pp. 1-26
    • Karp, A.H.1
  • 22
    • 0002004984 scopus 로고
    • Vectorizing the FFTs
    • G. Rodrigue, Ed. New York: Academic
    • P. N. Swarztrauber, "Vectorizing the FFTs," in Parallel Computations, G. Rodrigue, Ed. New York: Academic, 1982, pp. 51-83.
    • (1982) Parallel Computations , pp. 51-83
    • Swarztrauber, P.N.1
  • 24
    • 0008486956 scopus 로고
    • Self-sorting in-place fast Fourier transforms
    • C. Temperton, "Self-sorting in-place fast Fourier transforms," SIAM J. Sci. Stat. Comput., vol. 12, no. 4, pp. 808-823, 1991.
    • (1991) SIAM J. Sci. Stat. Comput. , vol.12 , Issue.4 , pp. 808-823
    • Temperton, C.1
  • 25
    • 0028517483 scopus 로고
    • Self-sorting in-place FFT algorithm with minimum working space
    • Oct.
    • Z. Qian, C. Lu, M. An, and R. Tolimieri, "Self-sorting in-place FFT algorithm with minimum working space," IEEE Trans. Acoust., Speech, Signal Process., vol. 42, no. 10, pp. 2835-2836, Oct. 1994.
    • (1994) IEEE Trans. Acoust., Speech, Signal Process. , vol.42 , Issue.10 , pp. 2835-2836
    • Qian, Z.1    Lu, C.2    An, M.3    Tolimieri, R.4
  • 26
    • 21844493537 scopus 로고
    • A self-sorting in-place fast Fourier transform algorithm suitable for vector and parallel processing
    • M. Hegland, "A self-sorting in-place fast Fourier transform algorithm suitable for vector and parallel processing," Numerische Mathematik, vol. 68, no. 4, pp. 507-547, 1994.
    • (1994) Numerische Mathematik , vol.68 , Issue.4 , pp. 507-547
    • Hegland, M.1
  • 28
    • 0001249667 scopus 로고
    • Discrete Fourier transforms when the number of data samples is prime
    • Jun.
    • C. M. Rader, "Discrete Fourier transforms when the number of data samples is prime," Proc. IEEE, vol. 56, no. 6, pp. 1107-1108, Jun. 1968.
    • (1968) Proc. IEEE , vol.56 , Issue.6 , pp. 1107-1108
    • Rader, C.M.1
  • 29
    • 20744444659 scopus 로고
    • A linear filtering approach to the computation of the discrete Fourier transform
    • L. I. Bluestein, "A linear filtering approach to the computation of the discrete Fourier transform," in Northeast Electronics Research and Engineering Meeting Rec., vol. 10, 1968, pp. 218-219.
    • (1968) Northeast Electronics Research and Engineering Meeting Rec. , vol.10 , pp. 218-219
    • Bluestein, L.I.1
  • 30
    • 84966217500 scopus 로고
    • On computing the discrete Fourier transform
    • Jan.
    • S. Winograd, "On computing the discrete Fourier transform," Math. Comput., vol. 32, no. 1, pp. 175-199, Jan. 1978.
    • (1978) Math. Comput. , vol.32 , Issue.1 , pp. 175-199
    • Winograd, S.1
  • 32
    • 0014520310 scopus 로고
    • An algorithm for computing the mixed radix fast Fourier transform
    • Jun.
    • R. C. Singleton, "An algorithm for computing the mixed radix fast Fourier transform," IEEE Trans. Audio Electroacoust., vol. AU-17, no. 2, pp. 93-103, Jun. 1969.
    • (1969) IEEE Trans. Audio Electroacoust. , vol.AU-17 , Issue.2 , pp. 93-103
    • Singleton, R.C.1
  • 36
    • 0029492798 scopus 로고
    • Transposing a matrix on a vector computer
    • M. Dow, "Transposing a matrix on a vector computer," Parallel Comput., vol. 21, no. 12, pp. 1997-2005, 1995.
    • (1995) Parallel Comput. , vol.21 , Issue.12 , pp. 1997-2005
    • Dow, M.1
  • 37
    • 0006359598 scopus 로고
    • Algorithm 513: Analysis of in-situ transposition
    • E. G. Cate and D. W. Twigg, "Algorithm 513: Analysis of in-situ transposition," ACM Trans. Math. Softw. (TOMS), vol. 3, no. 1, pp. 104-110, 1977.
    • (1977) ACM Trans. Math. Softw. (TOMS) , vol.3 , Issue.1 , pp. 104-110
    • Cate, E.G.1    Twigg, D.W.2
  • 39
    • 0023965029 scopus 로고
    • An improved fast Fourier transform algorithm using mixed frequency and time decimations
    • Feb.
    • K. Nakayama, "An improved fast Fourier transform algorithm using mixed frequency and time decimations," IEEE Trans. Acoust., Speech, Signal Process., vol. 36, no. 2, pp. 290-292, Feb. 1988.
    • (1988) IEEE Trans. Acoust., Speech, Signal Process. , vol.36 , Issue.2 , pp. 290-292
    • Nakayama, K.1
  • 41
    • 0003195066 scopus 로고
    • The MD5 message-digest algorithm
    • Network Working Group, Apr.
    • R. Rivest, "The MD5 message-digest algorithm," Network Working Group, Request for Comments (RFC) 1321, Apr. 1992.
    • (1992) Request for Comments (RFC) , vol.1321
    • Rivest, R.1
  • 42
    • 84901917964 scopus 로고    scopus 로고
    • A comprehensive DFT API for scientific computing
    • ser. IFIP Conference Proceedings, R. F. Boisvert and P. T. P. Tang, Eds. Ottawa, ON, Canada: Kluwer
    • P. T. P. Tang, "A comprehensive DFT API for scientific computing," in The Architecture of Scientific Software, ser. IFIP Conference Proceedings, R. F. Boisvert and P. T. P. Tang, Eds. Ottawa, ON, Canada: Kluwer, 2001, vol. 188, pp. 235-256.
    • (2001) The Architecture of Scientific Software , vol.188 , pp. 235-256
    • Tang, P.T.P.1
  • 44
    • 19344363982 scopus 로고    scopus 로고
    • Efficient utilization of SIMD extensions
    • Feb.
    • F. Franchetti, S. Kral, J. Lorenz, and C. Ueberhuber, "Efficient utilization of SIMD extensions," Proc. IEEE, vol. 93, no. 2, pp. 409-425, Feb. 2005.
    • (2005) Proc. IEEE , vol.93 , Issue.2 , pp. 409-425
    • Franchetti, F.1    Kral, S.2    Lorenz, J.3    Ueberhuber, C.4
  • 46
    • 0000327169 scopus 로고
    • Fast mixed-radix real Fourier transforms
    • C. Temperton, "Fast mixed-radix real Fourier transforms," J. Comput. Phys., vol. 52, pp. 340-350, 1983.
    • (1983) J. Comput. Phys. , vol.52 , pp. 340-350
    • Temperton, C.1
  • 47
    • 0000990936 scopus 로고
    • A fast Fourier transform algorithm for real-valued series
    • G. D. Bergland, "A fast Fourier transform algorithm for real-valued series," Commun. ACM, vol. 11, no. 10, pp. 703-710, 1968.
    • (1968) Commun. ACM , vol.11 , Issue.10 , pp. 703-710
    • Bergland, G.D.1
  • 50
    • 0034298503 scopus 로고    scopus 로고
    • An efficient design for one-dimensional discrete Hartley transform using parallel additions
    • Oct.
    • J.-I. Guo, "An efficient design for one-dimensional discrete Hartley transform using parallel additions," IEEE Trans. Signal Process., vol. 48, no. 10, pp. 2806-2813, Oct. 2000.
    • (2000) IEEE Trans. Signal Process. , vol.48 , Issue.10 , pp. 2806-2813
    • Guo, J.-I.1
  • 51
    • 0021473509 scopus 로고
    • Fast algorithms for the discrete W transform and for the discrete Fourier transform
    • Aug.
    • Z. Wang, "Fast algorithms for the discrete W transform and for the discrete Fourier transform," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 4, pp. 803-816, Aug. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-32 , Issue.4 , pp. 803-816
    • Wang, Z.1
  • 52
    • 0028429724 scopus 로고
    • Symmetric convolution and the discrete sine and cosine transforms
    • May
    • S. A. Martucci, "Symmetric convolution and the discrete sine and cosine transforms," IEEE Trans. Signal Process., vol. 42, no. 5, pp. 1038-1051, May 1994.
    • (1994) IEEE Trans. Signal Process. , vol.42 , Issue.5 , pp. 1038-1051
    • Martucci, S.A.1
  • 55
    • 0018983824 scopus 로고
    • A fast cosine transform in one and two dimensions
    • Feb.
    • J. Makhoul, "A fast cosine transform in one and two dimensions," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 1, pp. 27-34, Feb. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-28 , Issue.1 , pp. 27-34
    • Makhoul, J.1
  • 56
    • 0025530239 scopus 로고
    • Direct methods for computing discrete sinusoidal transforms
    • S. C. Chan and K. L. Ho, "Direct methods for computing discrete sinusoidal transforms," IEE Proc. F, vol. 137, no. 6, pp. 433-442, 1990.
    • (1990) IEE Proc. F , vol.137 , Issue.6 , pp. 433-442
    • Chan, S.C.1    Ho, K.L.2
  • 57
    • 0022137352 scopus 로고
    • On computing the discrete Fourier and cosine transforms
    • Oct.
    • Z. Wang, "On computing the discrete Fourier and cosine transforms," IEEE Trans. Acoust, Speech, Signal Process., vol. ASSP-33, no. 4, pp. 1341-1344, Oct. 1985.
    • (1985) IEEE Trans. Acoust, Speech, Signal Process. , vol.ASSP-33 , Issue.4 , pp. 1341-1344
    • Wang, Z.1
  • 59
    • 0012032535 scopus 로고    scopus 로고
    • Accuracy of the discrete Fourier transform and the fast Fourier transform
    • J. C. Schatzman, "Accuracy of the discrete Fourier transform and the fast Fourier transform," SIAM J. Sci. Comput.,vol. 17, no. 5, pp. 1150-1166, 1996.
    • (1996) SIAM J. Sci. Comput. , vol.17 , Issue.5 , pp. 1150-1166
    • Schatzman, J.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.