메뉴 건너뛰기




Volumn 93, Issue 2, 2005, Pages 232-273

SPIRAL: Code generation for DSP transforms

Author keywords

Adaptation; Automatic performance tuning; Code optimization; Discrete cosine transform (DCT); Discrete Fourier transform (DFT); Fast Fourier transform (FFT); Filter; Genetic and evolutionary algorithm; High performance computing; Learning; Library generation

Indexed keywords

ADAPTIVE ALGORITHMS; ALGORITHMS; CODES (STANDARDS); COMPUTER ARCHITECTURE; COSINE TRANSFORMS; DISCRETE FOURIER TRANSFORMS; EVOLUTIONARY ALGORITHMS; FAST FOURIER TRANSFORMS; GENETIC ALGORITHMS; MARKOV PROCESSES; SIGNAL THEORY; WAVE FILTERS;

EID: 19344368072     PISSN: 00189219     EISSN: None     Source Type: Journal    
DOI: 10.1109/JPROC.2004.840306     Document Type: Conference Paper
Times cited : (654)

References (92)
  • 3
    • 85072516160 scopus 로고
    • Automatic program transformations for virtual memory computers
    • W. Abu-Sufah, D. J. Kuck, and D. H. Lawrie, "Automatic program transformations for virtual memory computers," in Proc. Nat. Computer Conf., 1979, pp. 969-974.
    • (1979) Proc. Nat. Computer Conf. , pp. 969-974
    • Abu-Sufah, W.1    Kuck, D.J.2    Lawrie, D.H.3
  • 5
    • 0001775038 scopus 로고
    • A catalogue of optimizing transformations
    • R. Rustin, Ed. Englewood Cliffs, NJ: Prentice-Hall
    • F. Allen and J. Cocke, "A catalogue of optimizing transformations," in Design and Optimization of Compilers, R. Rustin, Ed. Englewood Cliffs, NJ: Prentice-Hall, 1972, pp. 1-30.
    • (1972) Design and Optimization of Compilers , pp. 1-30
    • Allen, F.1    Cocke, J.2
  • 10
    • 84956865893 scopus 로고    scopus 로고
    • On the equivalence of two systems of affine recurrence equations
    • Heidelberg, Germany: Springer-Verlag
    • D. Barthou, P. Feautrier, and X. Redon, "On the equivalence of two systems of affine recurrence equations," in Lecture Notes in Computer Science, Euro-Par 2002. Heidelberg, Germany: Springer-Verlag, 2002, vol. 2400, pp. 309-313.
    • (2002) Lecture Notes in Computer Science, Euro-Par 2002 , vol.2400 , pp. 309-313
    • Barthou, D.1    Feautrier, P.2    Redon, X.3
  • 11
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimization of software and the ATLAS project
    • R. C. Whaley, A. Petitet, and J. J. Dongarra, "Automated empirical optimization of software and the ATLAS project," Parallel Comput., vol. 27, no. 1-2, pp. 3-35, 2001.
    • (2001) Parallel Comput. , vol.27 , Issue.1-2 , pp. 3-35
    • Whaley, R.C.1    Petitet, A.2    Dongarra, J.J.3
  • 14
    • 20744459570 scopus 로고    scopus 로고
    • A comparison of empirical and model-driven optimization
    • Feb.
    • K. Yotov, X. Li, G. Ren, M. Garzaran, D. Padua, K. Pingali, and P. Stodghill, "A comparison of empirical and model-driven optimization," Proc. IEEE, vol. 93, no. 2, pp. 358-386, Feb. 2005.
    • (2005) Proc. IEEE , vol.93 , Issue.2 , pp. 358-386
    • Yotov, K.1    Li, X.2    Ren, G.3    Garzaran, M.4    Padua, D.5    Pingali, K.6    Stodghill, P.7
  • 15
    • 1542501019 scopus 로고    scopus 로고
    • Sparsity: Optimization framework for sparse matrix kernels
    • E.-I. Im, K. Yelick, and R. Vuduc, "Sparsity: Optimization framework for sparse matrix kernels," Int. J. High Perform. Comput. Appl., vol. 18, no. 1, pp. 135-158, 2004.
    • (2004) Int. J. High Perform. Comput. Appl. , vol.18 , Issue.1 , pp. 135-158
    • Im, E.-I.1    Yelick, K.2    Vuduc, R.3
  • 18
    • 20744449792 scopus 로고    scopus 로고
    • The design and implementation of FFTW3
    • Feb.
    • M. Frigo and S. G. Johnson, "The design and implementation of FFTW3," Proc. IEEE, vol. 93, no. 2, pp. 216-231, Feb. 2005.
    • (2005) Proc. IEEE , vol.93 , Issue.2 , pp. 216-231
    • Frigo, M.1    Johnson, S.G.2
  • 19
    • 0031636309 scopus 로고    scopus 로고
    • FFTW: An adaptive software architecture for the FFT
    • [Online]
    • _, "FFTW: An adaptive software architecture for the FFT," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), vol. 3, 1998, pp. 1381-1384. [Online]. Available: http://www.fftw.org.
    • (1998) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , vol.3 , pp. 1381-1384
  • 24
    • 20744444976 scopus 로고    scopus 로고
    • Architecture-cognizant divide and conquer algorithms
    • Portland, OR
    • _, "Architecture-cognizant divide and conquer algorithms," presented at the Conf. Supercomputing, Portland, OR, 1999.
    • (1999) Conf. Supercomputing
  • 25
    • 0029322264 scopus 로고
    • Unfavorable strides in cache memory systems
    • D. H. Bailey, "Unfavorable strides in cache memory systems," Sci. Program., vol. 4, pp. 53-58, 1995.
    • (1995) Sci. Program. , vol.4 , pp. 53-58
    • Bailey, D.H.1
  • 28
    • 20744446138 scopus 로고    scopus 로고
    • Parallel VSIPL++: An open standard software library for high-performance parallel signal processing
    • Feb.
    • J. Lebak, J. Kepner, H. Hoffmann, and E. Rutledge, "Parallel VSIPL++: An open standard software library for high-performance parallel signal processing," Proc. IEEE, vol. 93, no. 2, pp. 313-330, Feb. 2005.
    • (2005) Proc. IEEE , vol.93 , Issue.2 , pp. 313-330
    • Lebak, J.1    Kepner, J.2    Hoffmann, H.3    Rutledge, E.4
  • 30
    • 0026866013 scopus 로고
    • Profileguided automatic inline expansion for C programs
    • P. P. Chang, S. A. Mahlke, W. Y. Chen, and W. M. W. Hwu, "Profileguided automatic inline expansion for C programs," Softw. Pract. Exper., vol. 22, no. 5, pp. 349-369, 1992.
    • (1992) Softw. Pract. Exper. , vol.22 , Issue.5 , pp. 349-369
    • Chang, P.P.1    Mahlke, S.A.2    Chen, W.Y.3    Hwu, W.M.W.4
  • 32
    • 20744451215 scopus 로고    scopus 로고
    • Information technology - JPEG 2000 image coding system - Part 1: Core coding system
    • ISO/IEC 15444-1
    • "Information Technology - JPEG 2000 Image Coding System - Part 1: Core Coding System," Int. Org. Standardization/Int. Electrotech. Comm., ISO/IEC 15444-1:2000.
    • (2000) Int. Org. Standardization/Int. Electrotech. Comm.
  • 33
    • 0025600627 scopus 로고
    • A methodology for designing, modifying, and implementing Fourier transform algorithms on various architectures
    • J. R. Johnson, R. W. Johnson, D. Rodriguez, and R. Tolimieri, "A methodology for designing, modifying, and implementing Fourier transform algorithms on various architectures," Circuits, Syst., Signal Process., vol. 9, no. 4, pp. 449-500, 1990.
    • (1990) Circuits, Syst., Signal Process. , vol.9 , Issue.4 , pp. 449-500
    • Johnson, J.R.1    Johnson, R.W.2    Rodriguez, D.3    Tolimieri, R.4
  • 40
    • 18344410543 scopus 로고    scopus 로고
    • Factoring wavelet transforms into lifting steps
    • I. Daubechies and W. Sweldens, "Factoring wavelet transforms into lifting steps," J. Fourier Anal. Appl., vol. 4, no. 3, pp. 247-269, 1998.
    • (1998) J. Fourier Anal. Appl. , vol.4 , Issue.3 , pp. 247-269
    • Daubechies, I.1    Sweldens, W.2
  • 42
    • 0000459334 scopus 로고    scopus 로고
    • Rewriting
    • A. Robinson and A. Voronkov, Eds. New York: Elsevier, ch. 9
    • N. Dershowitz and D. A. Plaisted, "Rewriting," in Handbook of Automated Reasoning, A. Robinson and A. Voronkov, Eds. New York: Elsevier, 2001, vol. 1, ch. 9, pp. 535-610.
    • (2001) Handbook of Automated Reasoning , vol.1 , pp. 535-610
    • Dershowitz, N.1    Plaisted, D.A.2
  • 45
    • 20744433926 scopus 로고    scopus 로고
    • GAP - Groups, algorithms, and programming. GAP Team, Univ. St. Andrews, St. Andrews, U.K. [Online]
    • (1997) GAP - Groups, algorithms, and programming. GAP Team, Univ. St. Andrews, St. Andrews, U.K. [Online]. Available: http://www-gap.dcs.st-and.ac.uk/ ~gap/
    • (1997)
  • 50
    • 0026916192 scopus 로고
    • Engineering a simple, efficient code-generator generator
    • C. W. Fraser, D. R. Hanson, and T. A. Proebsting, "Engineering a simple, efficient code-generator generator," ACM Lett. Program. Lang. Syst., vol. 1, no. 3, pp. 213-226, 1992.
    • (1992) ACM Lett. Program. Lang. Syst. , vol.1 , Issue.3 , pp. 213-226
    • Fraser, C.W.1    Hanson, D.R.2    Proebsting, T.A.3
  • 51
    • 0027274408 scopus 로고
    • Implementation of efficient FFT algorithms on fused multiply-add architectures
    • Jan.
    • E. Linzer and E. Feig, "Implementation of efficient FFT algorithms on fused multiply-add architectures," IEEE Trans. Signal Process., vol. 41, no. 1, p. 93, Jan. 1993.
    • (1993) IEEE Trans. Signal Process. , vol.41 , Issue.1 , pp. 93
    • Linzer, E.1    Feig, E.2
  • 52
    • 0026396564 scopus 로고
    • Implementation of multiply-add FFT algorithms for complex and real data sequences
    • C. Lu, "Implementation of multiply-add FFT algorithms for complex and real data sequences," in Proc. Int. Symp. Circuits and Systems (ISCAS), vol. 1, 1991, pp. 480-483.
    • (1991) Proc. Int. Symp. Circuits and Systems (ISCAS) , vol.1 , pp. 480-483
    • Lu, C.1
  • 55
  • 56
    • 19344363982 scopus 로고    scopus 로고
    • Efficient utilization of SIMD extensions
    • Feb.
    • F. Franchetti, S. Kral, J. Lorenz, and C. Ueberhuber, "Efficient utilization of SIMD extensions," Proc. IEEE, vol. 93, no. 2, pp. 409-425, Feb. 2005.
    • (2005) Proc. IEEE , vol.93 , Issue.2 , pp. 409-425
    • Franchetti, F.1    Kral, S.2    Lorenz, J.3    Ueberhuber, C.4
  • 58
    • 0031699606 scopus 로고    scopus 로고
    • On the automatic parallelization of the perfect benchmarks
    • Jan.
    • R. E. J. Hoeflinger and D. Padua, "On the automatic parallelization of the perfect benchmarks," IEEE Trans. Parallel Distrib. Syst., vol. 9, no. 1, pp. 5-23, Jan. 1998.
    • (1998) IEEE Trans. Parallel Distrib. Syst. , vol.9 , Issue.1 , pp. 5-23
    • Hoeflinger, R.E.J.1    Padua, D.2
  • 64
    • 0343394643 scopus 로고
    • Testing multivariate linear functions: Overcoming the generator bottleneck
    • F. Ergün, "Testing multivariate linear functions: Overcoming the generator bottleneck," in Proc. ACM Symp. Theory of Computing (STOC), vol. 2, 1995, pp. 407-416.
    • (1995) Proc. ACM Symp. Theory of Computing (STOC) , vol.2 , pp. 407-416
    • Ergün, F.1
  • 66
    • 20744431703 scopus 로고
    • ser. CBMS-NSF Regional Conf. Ser. Appl. Math. Philadelphia, PA: SIAM
    • S. Winograd, Arithmetic Complexity of Computations, ser. CBMS-NSF Regional Conf. Ser. Appl. Math. Philadelphia, PA: SIAM, 1980.
    • (1980) Arithmetic Complexity of Computations
    • Winograd, S.1
  • 67
    • 1942477516 scopus 로고    scopus 로고
    • Automatic derivation and implementation of fast convolution algorithms
    • J. R. Johnson and A. F. Breitzman, "Automatic derivation and implementation of fast convolution algorithms," J. Symbol. Comput., vol. 37, no. 2, pp. 261-293, 2004.
    • (2004) J. Symbol. Comput. , vol.37 , Issue.2 , pp. 261-293
    • Johnson, J.R.1    Breitzman, A.F.2
  • 70
    • 0021513104 scopus 로고
    • Some complexity issues in digital signal processing
    • Oct.
    • P. R. Cappello and K. Steiglitz, "Some complexity issues in digital signal processing," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 5, pp. 1037-1041, Oct. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-32 , Issue.5 , pp. 1037-1041
    • Cappello, P.R.1    Steiglitz, K.2
  • 74
    • 20744458364 scopus 로고
    • Information technology-coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbits/s
    • ISO/IEC 11 172
    • "Information technology-coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbits/s," Int. Org. Standardization/Int. Electrotech. Comm., ISO/IEC 11 172, 1995.
    • (1995) Int. Org. Standardization/Int. Electrotech. Comm.
  • 77
    • 20744438614 scopus 로고    scopus 로고
    • Dataflow analysis of the FFT
    • Dept. Comput. Sci., Drexel Univ., Philadelphia, PA
    • A. Parekh and J. R. Johnson, "Dataflow analysis of the FFT," Dept. Comput. Sci., Drexel Univ., Philadelphia, PA, Tech. Rep. DU-CS-2004-01, 2004.
    • (2004) Tech. Rep. , vol.DU-CS-2004-01
    • Parekh, A.1    Johnson, J.R.2
  • 79
    • 20744449261 scopus 로고    scopus 로고
    • Distribution of a class of divide and conquer recurrences arising from the computation of the Walsh-Hadamard transform
    • submitted for publication
    • P. Hitczenko, H.-J. Huang, and J. R. Johnson, "Distribution of a class of divide and conquer recurrences arising from the computation of the Walsh-Hadamard transform," Theor. Comput. Sci., 2003, submitted for publication.
    • (2003) Theor. Comput. Sci.
    • Hitczenko, P.1    Huang, H.-J.2    Johnson, J.R.3
  • 82
    • 0141696394 scopus 로고    scopus 로고
    • Stochastic search for signal processing algorithm optimization
    • B. Singer and M. Veloso, "Stochastic search for signal processing algorithm optimization," Proc. Supercomputing, 2001.
    • (2001) Proc. Supercomputing
    • Singer, B.1    Veloso, M.2
  • 83
    • 0013103910 scopus 로고    scopus 로고
    • Ph.D. dissertation, Dept. Comput. Sci., Faculty Sci., Univ. Porto, Porto, Portugal
    • L. Torgo, "Inductive learning of tree-based regression models," Ph.D. dissertation, Dept. Comput. Sci., Faculty Sci., Univ. Porto, Porto, Portugal, 1999.
    • (1999) Inductive Learning of Tree-based Regression Models
    • Torgo, L.1
  • 84
    • 0141496142 scopus 로고    scopus 로고
    • Learning to construct fast signal processing implementations
    • B. Singer and M. Veloso, "Learning to construct fast signal processing implementations," J. Mach. Learn. Res., vol. 3, pp. 887-919, 2002.
    • (2002) J. Mach. Learn. Res. , vol.3 , pp. 887-919
    • Singer, B.1    Veloso, M.2
  • 85
    • 20744432440 scopus 로고    scopus 로고
    • Learning to generate fast signal processing implementations
    • _, "Learning to generate fast signal processing implementations, " in Proc. Int. Conf. Machine Learning, 2001, pp. 529-536.
    • (2001) Proc. Int. Conf. Machine Learning , pp. 529-536
  • 86
    • 0036684741 scopus 로고    scopus 로고
    • Automating the modeling and optimization of the performance of signal transforms
    • Aug.
    • B. Singer and M. M. Veloso, "Automating the modeling and optimization of the performance of signal transforms," IEEE Trans. Signal Process., vol. 50, no. 8, pp. 2003-2014, Aug. 2002.
    • (2002) IEEE Trans. Signal Process. , vol.50 , Issue.8 , pp. 2003-2014
    • Singer, B.1    Veloso, M.M.2
  • 88
    • 20744435264 scopus 로고    scopus 로고
    • Ph.D. dissertation, Dept. Elect. Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA
    • Ph.D. dissertation, Dept. Elect. Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.