메뉴 건너뛰기




Volumn 18, Issue 1, 2004, Pages 65-94

Statistical models for empirical search-based performance tuning

Author keywords

Algorithm selection; Automatic performance tuning; Early stopping; Feedback directed optimization; Matrix multiplication; Performance distribution; Performance optimization; Software engineering; Support vector method

Indexed keywords

ALGORITHMS; CODES (SYMBOLS); COMPUTER ARCHITECTURE; COMPUTER HARDWARE; DIGITAL LIBRARIES; FAST FOURIER TRANSFORMS; LINEAR ALGEBRA; OPTIMIZATION; PROBLEM SOLVING; PROGRAM COMPILERS; STATISTICAL METHODS; VECTORS;

EID: 1542710758     PISSN: 10943420     EISSN: None     Source Type: Journal    
DOI: 10.1177/1094342004041293     Document Type: Article
Times cited : (83)

References (97)
  • 7
    • 85088338365 scopus 로고    scopus 로고
    • Run-time interprocedural data placement optimization for lazy parallel libraries
    • August; Springer-Verlag, Berlin
    • Beckmann, O. and Kelley, P. H. J. August 1997. Run-time interprocedural data placement optimization for lazy parallel libraries. In EuroPar, Lecture Notes in Computer Science, Springer-Verlag, Berlin.
    • (1997) EuroPar, Lecture Notes in Computer Science
    • Beckmann, O.1    Kelley, P.H.J.2
  • 12
    • 84937351638 scopus 로고
    • Numerical tabulation of the distribution of kolmogorov's statistic for finite sample size
    • Birnbaum, Z. W. 1952. Numerical tabulation of the distribution of Kolmogorov's statistic for finite sample size. Journal of the American Statistical Association 47:425-441.
    • (1952) Journal of the American Statistical Association , vol.47 , pp. 425-441
    • Birnbaum, Z.W.1
  • 13
    • 84891471315 scopus 로고    scopus 로고
    • Document for the basic linear algebra subprograms (BLAS) standard: BLAS technical forum
    • Blackford, S. et al. 2001. Document for the Basic Linear Algebra Subprograms (BLAS) standard: BLAS Technical Forum, http://www.netlib.org/blas/blast-forum.
    • Blackford, S.1
  • 17
    • 0026368758 scopus 로고
    • Using profile information to assist classic code optimizations
    • Chang, P. P., Mahlke, S. A., and Hwu, W. W. 1991. Using profile information to assist classic code optimizations. Software - Practice and Experience 21(12):1301-1321.
    • (1991) Software - Practice and Experience , vol.21 , Issue.12 , pp. 1301-1321
    • Chang, P.P.1    Mahlke, S.A.2    Hwu, W.W.3
  • 26
    • 28244496090 scopus 로고    scopus 로고
    • Benchmarking optimization software with performance profiles
    • Dolan, E. D. and Moré, J. J. 2002. Benchmarking optimization software with performance profiles. Mathematical Programming 91:201-213.
    • (2002) Mathematical Programming , vol.91 , pp. 201-213
    • Dolan, E.D.1    Moré, J.J.2
  • 36
  • 38
    • 84976736522 scopus 로고
    • Gprof: A call graph execution profiler
    • Graham, S. L., Kessler, P. B., and McKusick, M. K. 1982. gprof: a call graph execution profiler. SIGPLAN Notices 17(6):120-126.
    • (1982) SIGPLAN Notices , vol.17 , Issue.6 , pp. 120-126
    • Graham, S.L.1    Kessler, P.B.2    McKusick, M.K.3
  • 39
    • 0026965653 scopus 로고
    • Eliminating branches using a superoptimizer and the GNU C compiler
    • Granlund, T. and Krenner, R. 1992. Eliminating branches using a superoptimizer and the GNU C compiler. SIGPLAN Notices 27(7):341-352.
    • (1992) SIGPLAN Notices , vol.27 , Issue.7 , pp. 341-352
    • Granlund, T.1    Krenner, R.2
  • 47
    • 0032155271 scopus 로고    scopus 로고
    • GEMM-based level 3 BLAS: High-performance model implementations and performance evaluation benchmark
    • Kagstrom, B., Ling, P., and Loan, C. V. 1998. GEMM-based level 3 BLAS: high-performance model implementations and performance evaluation benchmark. ACM Transactions on Mathematical Software 24(3):268-302.
    • (1998) ACM Transactions on Mathematical Software , vol.24 , Issue.3 , pp. 268-302
    • Kagstrom, B.1    Ling, P.2    Loan, C.V.3
  • 50
    • 84983965442 scopus 로고
    • An empirical study of FORTRAN programs
    • Knuth, D. 1971. An empirical study of FORTRAN programs. Software - Practice and Experience 1(2):105-133.
    • (1971) Software - Practice and Experience , vol.1 , Issue.2 , pp. 105-133
    • Knuth, D.1
  • 55
    • 85039586972 scopus 로고    scopus 로고
    • Delayed evaluation, self-optimising software components as a programming model
    • August; Paderborn, Germany
    • Liniker, P., Beckmann, O., and Kelly, P. H. J. August 2002. Delayed evaluation, self-optimising software components as a programming model. In Euro-Par, Paderborn, Germany.
    • (2002) Euro-Par
    • Liniker, P.1    Beckmann, O.2    Kelly, P.H.J.3
  • 57
    • 0029204029 scopus 로고
    • Automatic benchmark generation for cache optimization of matrix algorithms
    • March; R. Geist and S. Junkins, editors, ACM, New York
    • McCalpin, J. D. and Smotherman, M. March 1995. Automatic benchmark generation for cache optimization of matrix algorithms. In Proceedings of the 33rd Annual Southeast Conference, Clemson, SC, USA, R. Geist and S. Junkins, editors, ACM, New York, pp. 195-204.
    • (1995) Proceedings of the 33rd Annual Southeast Conference, Clemson, SC, USA , pp. 195-204
    • McCalpin, J.D.1    Smotherman, M.2
  • 64
    • 0012987759 scopus 로고
    • Note on the Kolmogorov statistic in the discrete case
    • Noether, G. E. 1963. Note on the Kolmogorov statistic in the discrete case. Metrika 7:115-116.
    • (1963) Metrika , vol.7 , pp. 115-116
    • Noether, G.E.1
  • 65
    • 25944474315 scopus 로고    scopus 로고
    • Cache-oblivious algorithms in practice
    • Master's thesis, University of Copenhagen, Copenhagen, Denmark
    • Olsen, J. H. and Skov, S. C. 2002. Cache-oblivious algorithms in practice, Master's thesis, University of Copenhagen, Copenhagen, Denmark.
    • (2002)
    • Olsen, J.H.1    Skov, S.C.2
  • 66
    • 85117254435 scopus 로고    scopus 로고
    • On increasing architecture awareness in program optimizations to bridge the gap between peak and sustained processor performance-matrix multiply revisited
    • Parello, D., Temam, O., and Verdun, J.-M. November 2002. On increasing architecture awareness in program optimizations to bridge the gap between peak and sustained processor performance-matrix multiply revisited. In Proceedings of the IEEE/ACM Conference on Supercomputing, Baltimore, MD.
    • (2002) Proceedings of the IEEE/ACM Conference on Supercomputing, Baltimore, MD
    • Parello, D.1    Temam, O.2    Verdun, J.-M.3
  • 69
    • 0003120218 scopus 로고    scopus 로고
    • Fast training of support vector machines using sequential minimal optimization
    • January; B. Schölkopf, C. Burges, and A. Smola, editors, MIT Press, Cambridge, MA
    • Platt, J. January 1999. Fast training of support vector machines using sequential minimal optimization. In Advances in Kernel Methods - Support Vector Learning, B. Schölkopf, C. Burges, and A. Smola, editors, MIT Press, Cambridge, MA, pp. 185-208.
    • (1999) Advances in Kernel Methods - Support Vector Learning , pp. 185-208
    • Platt, J.1
  • 73
    • 0003056605 scopus 로고
    • The algorithm selection problem
    • Rice, J. R. 1976. The algorithm selection problem. Advances in Computers 15:65-118.
    • (1976) Advances in Computers , vol.15 , pp. 65-118
    • Rice, J.R.1
  • 75
    • 84957579840 scopus 로고
    • Extending the Hong-Kung model to memory hierarchies
    • D.-Z. Du and M. Li, editors, LNCS; Springer-Verlag, Berlin
    • Savage, J. E. 1995. Extending the Hong-Kung model to memory hierarchies. In Computing and Combinatorics, D.-Z. Du and M. Li, editors, LNCS Vol. 959, Springer-Verlag, Berlin, pp. 270-281.
    • (1995) Computing and Combinatorics , vol.959 , pp. 270-281
    • Savage, J.E.1
  • 77
    • 1542706831 scopus 로고    scopus 로고
    • A rational approach to portable high performance: The basic linear algebra instruction set (BLAIS) and the fixed algorithm size template (FAST) library
    • Siek, J. G. and Lumsdaine, A. 1998. A rational approach to portable high performance: the Basic Linear Algebra Instruction Set (BLAIS) and the Fixed Algorithm Size Template (FAST) library. In Proceedings of ECOOP, Brussels, Belgium.
    • (1998) Proceedings of ECOOP, Brussels, Belgium
    • Siek, J.G.1    Lumsdaine, A.2
  • 79
    • 0003401675 scopus 로고    scopus 로고
    • A tutorial on support vector regression
    • Technical Report NC2-TR-1998-030. European Community ESPRIT Working Group in Neural and Computational Learning Theory
    • Smola, A. J. and Schölkopf, B. 1998. A tutorial on support vector regression, Technical Report NC2-TR-1998-030. European Community ESPRIT Working Group in Neural and Computational Learning Theory. http://www.neurocolt.com.
    • (1998)
    • Smola, A.J.1    Schölkopf, B.2
  • 83
    • 0031496750 scopus 로고    scopus 로고
    • Locality of reference in LU decomposition with partial pivoting
    • Toledo, S. 1997. Locality of reference in LU decomposition with partial pivoting, SIAM Journal on Matrix Analysis and Applications 18(4):1065-1081.
    • (1997) SIAM Journal on Matrix Analysis and Applications , vol.18 , Issue.4 , pp. 1065-1081
    • Toledo, S.1
  • 89
    • 84947558148 scopus 로고    scopus 로고
    • Arrays in Blitz++
    • Springer-Verlag, Berlin
    • Veldhuizen, T. 1998. Arrays in Blitz++. In Proceedings of ISCOPE, LNCS Vol. 1505, Springer-Verlag, Berlin.
    • (1998) Proceedings of ISCOPE, LNCS , vol.1505
    • Veldhuizen, T.1
  • 93
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimizations of software and the ATLAS project
    • Whaley, R. C., Petitet, A., and Dongarra, J. 2001. Automated empirical optimizations of software and the ATLAS project. Parallel Computing 27(1):3-25.
    • (2001) Parallel Computing , vol.27 , Issue.1 , pp. 3-25
    • Whaley, R.C.1    Petitet, A.2    Dongarra, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.