-
3
-
-
0031636309
-
FFTW: An adaptive software architecture for the FFT
-
M. Frigo and S. G. Johnson, "FFTW: An adaptive software architecture for the FFT," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. 3, 1998, pp. 1381-1384.
-
(1998)
Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing
, vol.3
, pp. 1381-1384
-
-
Frigo, M.1
Johnson, S.G.2
-
4
-
-
0023246968
-
Searching for the best Cooley-Tukey FFT algorithms
-
G. Jayasumana, "Searching for the best Cooley-Tukey FFT algorithms," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. 4, 1987, pp. 2408-2411.
-
(1987)
Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing
, vol.4
, pp. 2408-2411
-
-
Jayasumana, G.1
-
6
-
-
0030661485
-
Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
-
Vienna, Austria
-
J. Bilmes, K. Asanović, C.-W. Chin, and J. Demmel, "Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology," presented at the Int. Conf. Supercomputing, Vienna, Austria, 1997.
-
(1997)
Int. Conf. Supercomputing
-
-
Bilmes, J.1
Asanović, K.2
Chin, C.-W.3
Demmel, J.4
-
7
-
-
0003418094
-
Automatically Tuned Linear Algebra Software
-
Comput. Sci. Dept., Univ. Tennessee, Knoxville
-
R. Whaley and J. Dongarra, "Automatically Tuned Linear Algebra Software," Comput. Sci. Dept., Univ. Tennessee, Knoxville, Tech. Rep. CS-97-366, 1997.
-
(1997)
Tech. Rep.
, vol.CS-97-366
-
-
Whaley, R.1
Dongarra, J.2
-
8
-
-
0030143875
-
A framework for generating distributed-memory parallel programs for block recursive algorithms
-
May
-
S. K. S. Gupta, C. Huang, P. Sadayappan, and R. W. Johnson, "A framework for generating distributed-memory parallel programs for block recursive algorithms," J. Parallel Distrib. Comput., vol. 34, no. 2, pp. 137-153, May 1996.
-
(1996)
J. Parallel Distrib. Comput.
, vol.34
, Issue.2
, pp. 137-153
-
-
Gupta, S.K.S.1
Huang, C.2
Sadayappan, P.3
Johnson, R.W.4
-
9
-
-
1542396679
-
SPIRAL: A generator for platform-adapted libraries of signal processing algorithms
-
M. Püschel, B. Singer, J. Xiong, J. M. F. Moura, J. Johnson, D. Padua, M. M. Veloso, and R. W. Johnson, "SPIRAL: A generator for platform-adapted libraries of signal processing algorithms," J. High Perform. Comput. Applicat., vol. 18, no. 1, pp. 21-45, 2004.
-
(2004)
J. High Perform. Comput. Applicat.
, vol.18
, Issue.1
, pp. 21-45
-
-
Püschel, M.1
Singer, B.2
Xiong, J.3
Moura, J.M.F.4
Johnson, J.5
Padua, D.6
Veloso, M.M.7
Johnson, R.W.8
-
10
-
-
19344368072
-
SPIRAL: Code generation for DSP transforms
-
Feb.
-
M. Püschel, J. M. F. Moura, J. Johnson, D. Padua, M. Veloso, B. W. Singer, J. Xiong, F. Franchetti, A. Gačić, Y. Voronenko, K. Chen, R. W. Johnson, and N. Rizzolo, "SPIRAL: Code generation for DSP transforms," Proc. IEEE, vol. 93, no. 2, pp. 232-275, Feb. 2005.
-
(2005)
Proc. IEEE
, vol.93
, Issue.2
, pp. 232-275
-
-
Püschel, M.1
Moura, J.M.F.2
Johnson, J.3
Padua, D.4
Veloso, M.5
Singer, B.W.6
Xiong, J.7
Franchetti, F.8
Gačić, A.9
Voronenko, Y.10
Chen, K.11
Johnson, R.W.12
Rizzolo, N.13
-
13
-
-
0141496142
-
Learning to construct fast signal processing implementations
-
B. Singer and M. Veloso, "Learning to construct fast signal processing implementations," J. Mach. Learn. Res., vol. 3, pp. 887-919, 2002.
-
(2002)
J. Mach. Learn. Res.
, vol.3
, pp. 887-919
-
-
Singer, B.1
Veloso, M.2
-
14
-
-
84968470212
-
An algorithm for the machine computation of the complex Fourier series
-
Apr.
-
J. W. Cooley and J. W. Tukey, "An algorithm for the machine computation of the complex Fourier series," Math. Comput., vol. 19, pp. 297-301, Apr. 1965.
-
(1965)
Math. Comput.
, vol.19
, pp. 297-301
-
-
Cooley, J.W.1
Tukey, J.W.2
-
15
-
-
0021510651
-
Gauss and the history of the fast Fourier transform
-
Oct.
-
M. T. Heideman, D. H. Johnson, and C. S. Burrus, "Gauss and the history of the fast Fourier transform," IEEE ASSP Mag., vol. 1, no. 4, pp. 14-21, Oct. 1984.
-
(1984)
IEEE ASSP Mag.
, vol.1
, Issue.4
, pp. 14-21
-
-
Heideman, M.T.1
Johnson, D.H.2
Burrus, C.S.3
-
16
-
-
0025418849
-
Fast Fourier transforms: A tutorial review and a state of the art
-
Apr.
-
P. Duhamel and M. Vetterli, "Fast Fourier transforms: A tutorial review and a state of the art," Signal Process., vol. 19, pp. 259-299, Apr. 1990.
-
(1990)
Signal Process.
, vol.19
, pp. 259-299
-
-
Duhamel, P.1
Vetterli, M.2
-
18
-
-
0025403252
-
FFT's in external or hierarchical memory
-
May
-
D. H. Bailey, "FFT's in external or hierarchical memory," J. Supercomput., vol. 4, no. 1, pp. 23-35, May 1990.
-
(1990)
J. Supercomput.
, vol.4
, Issue.1
, pp. 23-35
-
-
Bailey, D.H.1
-
19
-
-
20744446407
-
On computing the fast Fourier transform
-
R. C. Singleton, "On computing the fast Fourier transform," Commun. ACM, vol. 10, pp. 647-654, 1967.
-
(1967)
Commun. ACM
, vol.10
, pp. 647-654
-
-
Singleton, R.C.1
-
20
-
-
0030105461
-
Bit reversal on uniprocessors
-
A. H. Karp, "Bit reversal on uniprocessors," SIAM Rev., vol. 38, no. 1, pp. 1-26, 1996.
-
(1996)
SIAM Rev.
, vol.38
, Issue.1
, pp. 1-26
-
-
Karp, A.H.1
-
22
-
-
0002004984
-
Vectorizing the FFTs
-
G. Rodrigue, Ed. New York: Academic
-
P. N. Swarztrauber, "Vectorizing the FFTs," in Parallel Computations, G. Rodrigue, Ed. New York: Academic, 1982, pp. 51-83.
-
(1982)
Parallel Computations
, pp. 51-83
-
-
Swarztrauber, P.N.1
-
23
-
-
0021158671
-
An in-place in-order radix-2 FFT
-
H. W. Johnson and C. S. Burrus, "An in-place in-order radix-2 FFT," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, 1984, pp. 28A.2.1-28A.2.4.
-
(1984)
Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing
-
-
Johnson, H.W.1
Burrus, C.S.2
-
24
-
-
0008486956
-
Self-sorting in-place fast Fourier transforms
-
C. Temperton, "Self-sorting in-place fast Fourier transforms," SIAM J. Sci. Stat. Comput., vol. 12, no. 4, pp. 808-823, 1991.
-
(1991)
SIAM J. Sci. Stat. Comput.
, vol.12
, Issue.4
, pp. 808-823
-
-
Temperton, C.1
-
25
-
-
0028517483
-
Self-sorting in-place FFT algorithm with minimum working space
-
Oct.
-
Z. Qian, C. Lu, M. An, and R. Tolimieri, "Self-sorting in-place FFT algorithm with minimum working space," IEEE Trans. Acoust., Speech, Signal Process., vol. 42, no. 10, pp. 2835-2836, Oct. 1994.
-
(1994)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.42
, Issue.10
, pp. 2835-2836
-
-
Qian, Z.1
Lu, C.2
An, M.3
Tolimieri, R.4
-
26
-
-
21844493537
-
A self-sorting in-place fast Fourier transform algorithm suitable for vector and parallel processing
-
M. Hegland, "A self-sorting in-place fast Fourier transform algorithm suitable for vector and parallel processing," Numerische Mathematik, vol. 68, no. 4, pp. 507-547, 1994.
-
(1994)
Numerische Mathematik
, vol.68
, Issue.4
, pp. 507-547
-
-
Hegland, M.1
-
27
-
-
0003513556
-
-
Upper Saddle River, NJ: Prentice-Hall
-
A. V. Oppenheim, R. W. Schafer, and J. R. Buck, Discrete-Time Signal Processing, 2nd ed. Upper Saddle River, NJ: Prentice-Hall, 1999.
-
(1999)
Discrete-Time Signal Processing, 2nd Ed.
-
-
Oppenheim, A.V.1
Schafer, R.W.2
Buck, J.R.3
-
28
-
-
0001249667
-
Discrete Fourier transforms when the number of data samples is prime
-
Jun.
-
C. M. Rader, "Discrete Fourier transforms when the number of data samples is prime," Proc. IEEE, vol. 56, no. 6, pp. 1107-1108, Jun. 1968.
-
(1968)
Proc. IEEE
, vol.56
, Issue.6
, pp. 1107-1108
-
-
Rader, C.M.1
-
29
-
-
20744444659
-
A linear filtering approach to the computation of the discrete Fourier transform
-
L. I. Bluestein, "A linear filtering approach to the computation of the discrete Fourier transform," in Northeast Electronics Research and Engineering Meeting Rec., vol. 10, 1968, pp. 218-219.
-
(1968)
Northeast Electronics Research and Engineering Meeting Rec.
, vol.10
, pp. 218-219
-
-
Bluestein, L.I.1
-
30
-
-
84966217500
-
On computing the discrete Fourier transform
-
Jan.
-
S. Winograd, "On computing the discrete Fourier transform," Math. Comput., vol. 32, no. 1, pp. 175-199, Jan. 1978.
-
(1978)
Math. Comput.
, vol.32
, Issue.1
, pp. 175-199
-
-
Winograd, S.1
-
31
-
-
0004161838
-
-
New York: Cambridge Univ. Press
-
W. H. Press, B. P. Flannery, S. A. Teukolsky, and W. T. Vetterling, Numerical Recipes in C: The Art of Scientific Computing, 2nd ed. New York: Cambridge Univ. Press, 1992.
-
(1992)
Numerical Recipes in C: The Art of Scientific Computing, 2nd Ed.
-
-
Press, W.H.1
Flannery, B.P.2
Teukolsky, S.A.3
Vetterling, W.T.4
-
32
-
-
0014520310
-
An algorithm for computing the mixed radix fast Fourier transform
-
Jun.
-
R. C. Singleton, "An algorithm for computing the mixed radix fast Fourier transform," IEEE Trans. Audio Electroacoust., vol. AU-17, no. 2, pp. 93-103, Jun. 1969.
-
(1969)
IEEE Trans. Audio Electroacoust.
, vol.AU-17
, Issue.2
, pp. 93-103
-
-
Singleton, R.C.1
-
33
-
-
0022665487
-
On computing the split-radix FFT
-
Feb.
-
H. V. Sorensen, M. T. Heideman, and C. S. Burrus, "On computing the split-radix FFT," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, no. 1, pp. 152-156, Feb. 1986.
-
(1986)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.ASSP-34
, Issue.1
, pp. 152-156
-
-
Sorensen, H.V.1
Heideman, M.T.2
Burrus, C.S.3
-
34
-
-
84957016016
-
A blocking algorithm for FFT on cache-based processors
-
Heidelberg, Germany: Springer-Verlag
-
D. Takahashi, "A blocking algorithm for FFT on cache-based processors," in Lecture Notes in Computer Science, High-Performance Computing and Networking. Heidelberg, Germany: Springer-Verlag, 2001, vol. 2110, pp. 551-554.
-
(2001)
Lecture Notes in Computer Science, High-performance Computing and Networking
, vol.2110
, pp. 551-554
-
-
Takahashi, D.1
-
35
-
-
0033350255
-
Cache-oblivious algorithms
-
M. Frigo, C. E. Leiserson, H. Prokop, and S. Ramachandran, "Cache-oblivious algorithms," in Proc. 40th Ann. Symp. Foundations of Computer Science (FOCS '99), 1999, pp. 285-297.
-
(1999)
Proc. 40th Ann. Symp. Foundations of Computer Science (FOCS '99)
, pp. 285-297
-
-
Frigo, M.1
Leiserson, C.E.2
Prokop, H.3
Ramachandran, S.4
-
36
-
-
0029492798
-
Transposing a matrix on a vector computer
-
M. Dow, "Transposing a matrix on a vector computer," Parallel Comput., vol. 21, no. 12, pp. 1997-2005, 1995.
-
(1995)
Parallel Comput.
, vol.21
, Issue.12
, pp. 1997-2005
-
-
Dow, M.1
-
37
-
-
0006359598
-
Algorithm 513: Analysis of in-situ transposition
-
E. G. Cate and D. W. Twigg, "Algorithm 513: Analysis of in-situ transposition," ACM Trans. Math. Softw. (TOMS), vol. 3, no. 1, pp. 104-110, 1977.
-
(1977)
ACM Trans. Math. Softw. (TOMS)
, vol.3
, Issue.1
, pp. 104-110
-
-
Cate, E.G.1
Twigg, D.W.2
-
39
-
-
0023965029
-
An improved fast Fourier transform algorithm using mixed frequency and time decimations
-
Feb.
-
K. Nakayama, "An improved fast Fourier transform algorithm using mixed frequency and time decimations," IEEE Trans. Acoust., Speech, Signal Process., vol. 36, no. 2, pp. 290-292, Feb. 1988.
-
(1988)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.36
, Issue.2
, pp. 290-292
-
-
Nakayama, K.1
-
40
-
-
0142181749
-
Decimation-in-time-frequency FFT algorithm
-
A. Saidi, "Decimation-in-time-frequency FFT algorithm," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. 3, 1994, pp. 453-456.
-
(1994)
Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing
, vol.3
, pp. 453-456
-
-
Saidi, A.1
-
41
-
-
0003195066
-
The MD5 message-digest algorithm
-
Network Working Group, Apr.
-
R. Rivest, "The MD5 message-digest algorithm," Network Working Group, Request for Comments (RFC) 1321, Apr. 1992.
-
(1992)
Request for Comments (RFC)
, vol.1321
-
-
Rivest, R.1
-
42
-
-
84901917964
-
A comprehensive DFT API for scientific computing
-
ser. IFIP Conference Proceedings, R. F. Boisvert and P. T. P. Tang, Eds. Ottawa, ON, Canada: Kluwer
-
P. T. P. Tang, "A comprehensive DFT API for scientific computing," in The Architecture of Scientific Software, ser. IFIP Conference Proceedings, R. F. Boisvert and P. T. P. Tang, Eds. Ottawa, ON, Canada: Kluwer, 2001, vol. 188, pp. 235-256.
-
(2001)
The Architecture of Scientific Software
, vol.188
, pp. 235-256
-
-
Tang, P.T.P.1
-
43
-
-
18844422753
-
SPL: A language and compiler for DSP algorithms
-
J. Xiong, D. Padua, and J. Johnson, "SPL: A language and compiler for DSP algorithms," in Proc. ACM SIGPLAN'01 Conf. Programming Language Design and Implementation (PLDI), 2001, pp. 298-308.
-
(2001)
Proc. ACM SIGPLAN'01 Conf. Programming Language Design and Implementation (PLDI)
, pp. 298-308
-
-
Xiong, J.1
Padua, D.2
Johnson, J.3
-
44
-
-
19344363982
-
Efficient utilization of SIMD extensions
-
Feb.
-
F. Franchetti, S. Kral, J. Lorenz, and C. Ueberhuber, "Efficient utilization of SIMD extensions," Proc. IEEE, vol. 93, no. 2, pp. 409-425, Feb. 2005.
-
(2005)
Proc. IEEE
, vol.93
, Issue.2
, pp. 409-425
-
-
Franchetti, F.1
Kral, S.2
Lorenz, J.3
Ueberhuber, C.4
-
45
-
-
0023364252
-
Real-valued fast Fourier transform algorithms
-
Jun.
-
H. V. Sorensen, D. L. Jones, M. T. Heideman, and C. S. Burrus, "Real-valued fast Fourier transform algorithms," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-35, no. 6, pp. 849-863, Jun. 1987.
-
(1987)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.ASSP-35
, Issue.6
, pp. 849-863
-
-
Sorensen, H.V.1
Jones, D.L.2
Heideman, M.T.3
Burrus, C.S.4
-
46
-
-
0000327169
-
Fast mixed-radix real Fourier transforms
-
C. Temperton, "Fast mixed-radix real Fourier transforms," J. Comput. Phys., vol. 52, pp. 340-350, 1983.
-
(1983)
J. Comput. Phys.
, vol.52
, pp. 340-350
-
-
Temperton, C.1
-
47
-
-
0000990936
-
A fast Fourier transform algorithm for real-valued series
-
G. D. Bergland, "A fast Fourier transform algorithm for real-valued series," Commun. ACM, vol. 11, no. 10, pp. 703-710, 1968.
-
(1968)
Commun. ACM
, vol.11
, Issue.10
, pp. 703-710
-
-
Bergland, G.D.1
-
50
-
-
0034298503
-
An efficient design for one-dimensional discrete Hartley transform using parallel additions
-
Oct.
-
J.-I. Guo, "An efficient design for one-dimensional discrete Hartley transform using parallel additions," IEEE Trans. Signal Process., vol. 48, no. 10, pp. 2806-2813, Oct. 2000.
-
(2000)
IEEE Trans. Signal Process.
, vol.48
, Issue.10
, pp. 2806-2813
-
-
Guo, J.-I.1
-
51
-
-
0021473509
-
Fast algorithms for the discrete W transform and for the discrete Fourier transform
-
Aug.
-
Z. Wang, "Fast algorithms for the discrete W transform and for the discrete Fourier transform," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 4, pp. 803-816, Aug. 1984.
-
(1984)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.ASSP-32
, Issue.4
, pp. 803-816
-
-
Wang, Z.1
-
52
-
-
0028429724
-
Symmetric convolution and the discrete sine and cosine transforms
-
May
-
S. A. Martucci, "Symmetric convolution and the discrete sine and cosine transforms," IEEE Trans. Signal Process., vol. 42, no. 5, pp. 1038-1051, May 1994.
-
(1994)
IEEE Trans. Signal Process.
, vol.42
, Issue.5
, pp. 1038-1051
-
-
Martucci, S.A.1
-
53
-
-
0003685404
-
-
Boston, MA: Academic
-
K. R. Rao and P. Yip, Discrete Cosine Transform: Algorithms, Advantages, Applications. Boston, MA: Academic, 1990.
-
(1990)
Discrete Cosine Transform: Algorithms, Advantages, Applications
-
-
Rao, K.R.1
Yip, P.2
-
54
-
-
12444302611
-
Code generators for automatic tuning of numerical kernels: Experiences with FFTW
-
Montreal, QC, Canada
-
R. Vuduc and J. Demmel, "Code generators for automatic tuning of numerical kernels: Experiences with FFTW," presented at the Semantics, Application, and Implementation of Code Generators Workshop, Montreal, QC, Canada, 2000.
-
(2000)
Semantics, Application, and Implementation of Code Generators Workshop
-
-
Vuduc, R.1
Demmel, J.2
-
55
-
-
0018983824
-
A fast cosine transform in one and two dimensions
-
Feb.
-
J. Makhoul, "A fast cosine transform in one and two dimensions," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 1, pp. 27-34, Feb. 1980.
-
(1980)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.ASSP-28
, Issue.1
, pp. 27-34
-
-
Makhoul, J.1
-
56
-
-
0025530239
-
Direct methods for computing discrete sinusoidal transforms
-
S. C. Chan and K. L. Ho, "Direct methods for computing discrete sinusoidal transforms," IEE Proc. F, vol. 137, no. 6, pp. 433-442, 1990.
-
(1990)
IEE Proc. F
, vol.137
, Issue.6
, pp. 433-442
-
-
Chan, S.C.1
Ho, K.L.2
-
57
-
-
0022137352
-
On computing the discrete Fourier and cosine transforms
-
Oct.
-
Z. Wang, "On computing the discrete Fourier and cosine transforms," IEEE Trans. Acoust, Speech, Signal Process., vol. ASSP-33, no. 4, pp. 1341-1344, Oct. 1985.
-
(1985)
IEEE Trans. Acoust, Speech, Signal Process.
, vol.ASSP-33
, Issue.4
, pp. 1341-1344
-
-
Wang, Z.1
-
58
-
-
0026838109
-
Fast algorithms for computing the discrete cosine transform
-
Mar.
-
S. C. Chan and K. L. Ho, "Fast algorithms for computing the discrete cosine transform," IEEE Trans. Circuits Syst. II: Analog Digit. Signal Process., vol. 39, no. 3, pp. 185-190, Mar. 1992.
-
(1992)
IEEE Trans. Circuits Syst. II: Analog Digit. Signal Process.
, vol.39
, Issue.3
, pp. 185-190
-
-
Chan, S.C.1
Ho, K.L.2
-
59
-
-
0012032535
-
Accuracy of the discrete Fourier transform and the fast Fourier transform
-
J. C. Schatzman, "Accuracy of the discrete Fourier transform and the fast Fourier transform," SIAM J. Sci. Comput.,vol. 17, no. 5, pp. 1150-1166, 1996.
-
(1996)
SIAM J. Sci. Comput.
, vol.17
, Issue.5
, pp. 1150-1166
-
-
Schatzman, J.C.1
-
60
-
-
0034848812
-
Architecture independent short vector FFTs
-
F. Franchetti, H. Karner, S. Kral, and C. W. Ueberhuber, "Architecture independent short vector FFTs," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. 2, 2001, pp. 1109-1112.
-
(2001)
Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing
, vol.2
, pp. 1109-1112
-
-
Franchetti, F.1
Karner, H.2
Kral, S.3
Ueberhuber, C.W.4
|