메뉴 건너뛰기




Volumn 5658 LNCS, Issue , 2009, Pages 385-409

Operator language: A program generation framework for fast kernels

Author keywords

Automatic performance tuning; High performance software; Library generation; Multicore CPU; Program generation

Indexed keywords

AUTOMATIC PERFORMANCE TUNING; HIGH PERFORMANCE SOFTWARE; LIBRARY GENERATION; MULTICORE CPU; PROGRAM GENERATION; CIRCULAR CONVOLUTIONS; MATRIX MATRIX MULTIPLICATIONS; MULTI-CORE CPUS; REWRITING SYSTEMS; STRUCTURAL ARCHITECTURE;

EID: 69049084626     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-03034-5_18     Document Type: Conference Paper
Times cited : (39)

References (46)
  • 1
    • 69049096837 scopus 로고    scopus 로고
    • Intel: Integrated Performance Primitives 5.3, User Guide
    • Intel: Integrated Performance Primitives 5.3, User Guide
  • 2
    • 57049117343 scopus 로고    scopus 로고
    • How to write fast numerical code: A small introduction
    • Lämmel, R, Visser, J, Saraiva, J, eds, Generative and Transformational Techniques in Software Engineering II, Springer, Heidelberg
    • Chellappa, S., Franchetti, F., Püschel, M.: How to write fast numerical code: A small introduction. In: Lämmel, R., Visser, J., Saraiva, J. (eds.) Generative and Transformational Techniques in Software Engineering II. LNCS, vol. 5235, pp. 196-259. Springer, Heidelberg (2008)
    • (2008) LNCS , vol.5235 , pp. 196-259
    • Chellappa, S.1    Franchetti, F.2    Püschel, M.3
  • 6
    • 38049144052 scopus 로고    scopus 로고
    • A rewriting system for the vectorization of signal transforms
    • Daydé, M, Palma, J.M.L.M, Coutinho, Á .L.G.A, Pacitti, E, Lopes, J.C, eds, VECPAR 2006, Springer, Heidelberg
    • Franchetti, F., Voronenko, Y., Püschel, M.: A rewriting system for the vectorization of signal transforms. In: Daydé, M., Palma, J.M.L.M., Coutinho, Á .L.G.A., Pacitti, E., Lopes, J.C. (eds.) VECPAR 2006. LNCS, vol. 4395, pp. 363-377. Springer, Heidelberg (2007)
    • (2007) LNCS , vol.4395 , pp. 363-377
    • Franchetti, F.1    Voronenko, Y.2    Püschel, M.3
  • 7
    • 69049083172 scopus 로고    scopus 로고
    • GPCE: ACM conference on generative programming and component engineering
    • GPCE: ACM conference on generative programming and component engineering
  • 11
    • 69049089562 scopus 로고    scopus 로고
    • Smith, D.R.: Mechanizing the development of software. In: Broy, M. (ed.) Calculational System Design, Proc. of the International Summer School Marktoberdorf. NATO ASI Series. IOS Press, Amsterdam (1999); Kestrel Institute Technical Report KES.U.99.1
    • Smith, D.R.: Mechanizing the development of software. In: Broy, M. (ed.) Calculational System Design, Proc. of the International Summer School Marktoberdorf. NATO ASI Series. IOS Press, Amsterdam (1999); Kestrel Institute Technical Report KES.U.99.1
  • 12
    • 67649530725 scopus 로고
    • Little language processing, an alternative to courses on compiler construction
    • Gough, K.J.: Little language processing, an alternative to courses on compiler construction. SIGCSE Bulletin 13(3), 31-34 (1981)
    • (1981) SIGCSE Bulletin , vol.13 , Issue.3 , pp. 31-34
    • Gough, K.J.1
  • 13
    • 84976711318 scopus 로고
    • Programming pearls: Little languages
    • Bentley, J.: Programming pearls: little languages. Communications of the ACM 29(8), 711- 721 (1986)
    • (1986) Communications of the ACM , vol.29 , Issue.8 , pp. 711-721
    • Bentley, J.1
  • 15
    • 84947255563 scopus 로고    scopus 로고
    • DSL implementation in MetaOCaml, Template Haskell, and C++
    • Lengauer, C, Batory, D, Consel, C, Odersky, M, eds, Domain-Specific Program Generation, Springer, Heidelberg
    • Czarnecki, K., O'Donnell, J., Striegnitz, J., Taha, W.: DSL implementation in MetaOCaml, Template Haskell, and C++. In: Lengauer, C., Batory, D., Consel, C., Odersky, M. (eds.) Domain-Specific Program Generation. LNCS, vol. 3016, pp. 51-72. Springer, Heidelberg(2004)
    • (2004) LNCS , vol.3016 , pp. 51-72
    • Czarnecki, K.1    O'Donnell, J.2    Striegnitz, J.3    Taha, W.4
  • 17
    • 0002515795 scopus 로고    scopus 로고
    • Automatically Tuned Linear Algebra Software (ATLAS)
    • Whaley, R.C., Dongarra, J.: Automatically Tuned Linear Algebra Software (ATLAS). In: Proc. Supercomputing (1998), math-atlas.sourceforge.net
    • (1998) Proc. Supercomputing
    • Whaley, R.C.1    Dongarra, J.2
  • 23
    • 0000459334 scopus 로고    scopus 로고
    • Rewriting
    • Robinson, A, Voronkov, A, eds, Elsevier, Amsterdam
    • Dershowitz, N., Plaisted, D.A.: Rewriting. In: Robinson, A., Voronkov, A. (eds.) Handbook of Automated Reasoning, vol. 1, pp. 535-610. Elsevier, Amsterdam (2001)
    • (2001) Handbook of Automated Reasoning , vol.1 , pp. 535-610
    • Dershowitz, N.1    Plaisted, D.A.2
  • 24
    • 69049091660 scopus 로고    scopus 로고
    • Nilsson, U., Maluszynski, J.: Logic, Programming and Prolog, 2nd edn. John Wiley & Sons Inc., Chichester (1995)
    • Nilsson, U., Maluszynski, J.: Logic, Programming and Prolog, 2nd edn. John Wiley & Sons Inc., Chichester (1995)
  • 27
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimization of software and the ATLAS project
    • Whaley, R.C., Petitet, A., Dongarra, J.J.: Automated empirical optimization of software and the ATLAS project. Parallel Computing 27(1-2), 3-35 (2001)
    • (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
    • Whaley, R.C.1    Petitet, A.2    Dongarra, J.J.3
  • 30
    • 51049115051 scopus 로고    scopus 로고
    • PhD thesis, Electrical and Computer Engineering, Carnegie Mellon University
    • Voronenko, Y.: Library Generation for Linear Transforms. PhD thesis, Electrical and Computer Engineering, Carnegie Mellon University (2008)
    • (2008) Library Generation for Linear Transforms
    • Voronenko, Y.1
  • 32
    • 85154002090 scopus 로고
    • Sorting networks and their applications
    • Batcher, K.: Sorting networks and their applications. In: Proc. AFIPS Spring Joint Comput. Conf., vol. 32, pp. 307-314 (1968)
    • (1968) Proc. AFIPS Spring Joint Comput. Conf , vol.32 , pp. 307-314
    • Batcher, K.1
  • 33
    • 84935113569 scopus 로고
    • Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
    • Viterbi, A.: Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory 13(2), 260-269 (1967)
    • (1967) IEEE Transactions on Information Theory , vol.13 , Issue.2 , pp. 260-269
    • Viterbi, A.1
  • 41
    • 69049110689 scopus 로고    scopus 로고
    • Intel: Math Kernel Library 10.0, Reference Manual
    • Intel: Math Kernel Library 10.0, Reference Manual
  • 42
    • 69049096836 scopus 로고    scopus 로고
    • Goto, K.: GotoBLAS 1.26 (2008), http://www.tacc.utexas.edu/resources/ software/#blas
    • (2008) GotoBLAS 1.26
    • Goto, K.1
  • 44
    • 69049101756 scopus 로고    scopus 로고
    • Implementation of polar format SAR image formation on the IBM Cell Broadband Engine
    • Rudin, J.A.: Implementation of polar format SAR image formation on the IBM Cell Broadband Engine. In: Proc. High Performance Embedded Computing (HPEC) (2007)
    • (2007) Proc. High Performance Embedded Computing (HPEC)
    • Rudin, J.A.1
  • 45
    • 69049107251 scopus 로고    scopus 로고
    • Karn, P, FEC library version 3.0.1 August 2007
    • Karn, P.: FEC library version 3.0.1 (August 2007), http://www.ka9q.net/ code/fec/


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.