-
1
-
-
10844263523
-
-
Y. Nievergelt, Scalar fused multiply-add instructions produce floating-point matrix arithmetic provably accurate to the penultimate digit, ACM Trans. Math. Softw. (TOMS), 29, no. 1, pp. 27-48, 2003.
-
Y. Nievergelt, "Scalar fused multiply-add instructions produce floating-point matrix arithmetic provably accurate to the penultimate digit," ACM Trans. Math. Softw. (TOMS), vol. 29, no. 1, pp. 27-48, 2003.
-
-
-
-
2
-
-
0027274408
-
Implementation of efficient FFT algorithms on fused multiply-add architectures
-
Jan
-
E. Linzer and E. Feig, "Implementation of efficient FFT algorithms on fused multiply-add architectures," IEEE Trans. Signal Process., vol. 41, no. 1, p. 93, Jan. 1993.
-
(1993)
IEEE Trans. Signal Process
, vol.41
, Issue.1
, pp. 93
-
-
Linzer, E.1
Feig, E.2
-
3
-
-
0026396564
-
Implementation of multiply-add FFT algorithms for complex and real data sequences
-
C. Lu, "Implementation of multiply-add FFT algorithms for complex and real data sequences," in Proc. Int. Symp. Circuits Systems (ISCAS) 1991, vol. 1, pp. 480-483.
-
(1991)
Proc. Int. Symp. Circuits Systems (ISCAS)
, vol.1
, pp. 480-483
-
-
Lu, C.1
-
4
-
-
0031261334
-
-
S. Goedecker, Fast radix 2, 3, 4, and 5 kernels for fast Fourier transformations on computers with overlapping multiply-add instructions, SIAM J. Scientif. Comput., 18, no. 6, pp. 1605-1611, 1997.
-
S. Goedecker, "Fast radix 2, 3, 4, and 5 kernels for fast Fourier transformations on computers with overlapping multiply-add instructions," SIAM J. Scientif. Comput., vol. 18, no. 6, pp. 1605-1611, 1997.
-
-
-
-
5
-
-
0027540716
-
FFT algorithms for prime transform sizes and their implementations on VAX, IBM3090VF, and IBM RS/ 6000
-
Feb
-
C. Lu, J. W. Cooley, and R. Tolimieri, "FFT algorithms for prime transform sizes and their implementations on VAX, IBM3090VF, and IBM RS/ 6000," IEEE Trans. Signal Process., vol. 41, no. 2, pp. 638-648, Feb. 1993.
-
(1993)
IEEE Trans. Signal Process
, vol.41
, Issue.2
, pp. 638-648
-
-
Lu, C.1
Cooley, J.W.2
Tolimieri, R.3
-
6
-
-
0033693548
-
A new radix-6 FFT algorithm suitable for multiply-add instruction
-
Jun
-
D. Takahashi, "A new radix-6 FFT algorithm suitable for multiply-add instruction," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), Jun. 2000, vol. 6, pp. 3343-3346.
-
(2000)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP)
, vol.6
, pp. 3343-3346
-
-
Takahashi, D.1
-
7
-
-
0141676637
-
A radix-16 FFT algorithm suitable for multiply-add instruction based on Goedecker method
-
Apr
-
D. Takahashi, "A radix-16 FFT algorithm suitable for multiply-add instruction based on Goedecker method," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), Apr. 2003, vol. 2, pp. 665-668.
-
(2003)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP)
, vol.2
, pp. 665-668
-
-
Takahashi, D.1
-
8
-
-
0026299779
-
New scaled DCT algorithms for fused multiply/ add architectures
-
E. Linzer and E. Feig, "New scaled DCT algorithms for fused multiply/ add architectures," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), 1991, vol. 3, pp. 2201-2204.
-
(1991)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP)
, vol.3
, pp. 2201-2204
-
-
Linzer, E.1
Feig, E.2
-
9
-
-
0028410101
-
Hadamard transforms on multiply/ add architectures
-
Apr
-
E. F. D. Coppersmith and E. Linzer, "Hadamard transforms on multiply/ add architectures," IEEE Trans. Signal Process., vol. 42, no. 4, pp. 969-970, Apr. 1994.
-
(1994)
IEEE Trans. Signal Process
, vol.42
, Issue.4
, pp. 969-970
-
-
Coppersmith, E.F.D.1
Linzer, E.2
-
10
-
-
4544287691
-
Automatic generation of implementations for DSP transforms on fused multiply-add architectures
-
Y. Voronenko and M. Püschel, "Automatic generation of implementations for DSP transforms on fused multiply-add architectures," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), 2004, vol. 5, pp. 101-104.
-
(2004)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP)
, vol.5
, pp. 101-104
-
-
Voronenko, Y.1
Püschel, M.2
-
11
-
-
19344368072
-
SPIRAL: Code generation for DSP transforms
-
M. Püschel, J. M. F. Moura, J. Johnson, D. Padua, M. Veloso, B. W. Singer, J. Xiong, F. Franchetti, A. Gačić, Y. Voronenko, K. Chen, R. W. Johnson, and N. Rizzolo, "SPIRAL: Code generation for DSP transforms," Proc. IEEE (Special Issue on Program Generation, Optimization, and Adaptation), vol. 93, no. 2, pp. 232-275, 2005.
-
(2005)
Proc. IEEE (Special Issue on Program Generation, Optimization, and Adaptation)
, vol.93
, Issue.2
, pp. 232-275
-
-
Püschel, M.1
Moura, J.M.F.2
Johnson, J.3
Padua, D.4
Veloso, M.5
Singer, B.W.6
Xiong, J.7
Franchetti, F.8
Gačić, A.9
Voronenko, Y.10
Chen, K.11
Johnson, R.W.12
Rizzolo, N.13
-
12
-
-
34548342614
-
-
Spiral website [Online]. Available: Www.spiral.net , pp. -,
-
Spiral website [Online]. Available: Www.spiral.net , "," vol. , pp. -,
-
-
-
-
15
-
-
0000459334
-
Rewriting
-
A. Robinson and A. Voronkov, Eds. New York: Elsevier, ch. 9, pp
-
N. Dershowitz and D. A. Plaisted, "Rewriting," in Handbook of Automated Reasoning, A. Robinson and A. Voronkov, Eds. New York: Elsevier, 2001, vol. 1, ch. 9, pp. 535-610.
-
(2001)
Handbook of Automated Reasoning
, vol.1
, pp. 535-610
-
-
Dershowitz, N.1
Plaisted, D.A.2
-
16
-
-
31844432305
-
Loop merging for signal transforms
-
F. Franchetti, Y. Voronenko, and M. Püschel, "Loop merging for signal transforms," in Proc. Programming Language Design Implementation (PLDI), 2005, pp. 315-326.
-
(2005)
Proc. Programming Language Design Implementation (PLDI)
, pp. 315-326
-
-
Franchetti, F.1
Voronenko, Y.2
Püschel, M.3
-
19
-
-
34548370373
-
-
FFTW 3.1.2 2006 [Online]. Available: Www.fftw.org
-
FFTW 3.1.2 2006 [Online]. Available: Www.fftw.org
-
-
-
-
20
-
-
0031636309
-
FFTW: An adaptive software architecture for the FFT
-
Online, Available
-
M. Frigo and S. G. Johnson, "FFTW: An adaptive software architecture for the FFT," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), 1998, vol. 3, pp. 1381-1384 [Online]. Available: www.fftw.org
-
(1998)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP)
, vol.3
, pp. 1381-1384
-
-
Frigo, M.1
Johnson, S.G.2
-
22
-
-
0022665487
-
-
H. V. Sorensen, H. M. T. , C. S. Burrus, and M. T. Heideman, On computing the split-radix FFT, IEEE Trans. Acoust., Speech, Signal Process., ASSP-34, no. 1, pp. 152-156, 1986.
-
H. V. Sorensen, H. M. T. , C. S. Burrus, and M. T. Heideman, "On computing the split-radix FFT," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, no. 1, pp. 152-156, 1986.
-
-
-
-
24
-
-
0026678378
-
New networks for perfect inversion and perfect reconstruction
-
Jan
-
F. Bruekers and A. Enden, "New networks for perfect inversion and perfect reconstruction," IEEE J. Sel. Areas Commun., vol. 10, pp. 130-137, Jan. 1992.
-
(1992)
IEEE J. Sel. Areas Commun
, vol.10
, pp. 130-137
-
-
Bruekers, F.1
Enden, A.2
-
25
-
-
30244489068
-
The lifting scheme: A custom-design construction of biorthogonal wavelets
-
W. Sweldens, "The lifting scheme: A custom-design construction of biorthogonal wavelets," Appl. Comput. Harmon. Anal., vol. 3, no. 2, pp. 186-200, 1996.
-
(1996)
Appl. Comput. Harmon. Anal
, vol.3
, Issue.2
, pp. 186-200
-
-
Sweldens, W.1
-
26
-
-
34548340409
-
Optimal placement of fused multiply-add (FMA) instructions
-
presented at the, San Jose, CA, Mar
-
K. Serebryany, "Optimal placement of fused multiply-add (FMA) instructions," presented at the 6th Workshop on Explicitly Parallel Instructions, Computing Architectures and Compiler Technology (EPIC-6), San Jose, CA, Mar. 2007.
-
(2007)
6th Workshop on Explicitly Parallel Instructions, Computing Architectures and Compiler Technology (EPIC-6)
-
-
Serebryany, K.1
|