메뉴 건너뛰기




Volumn , Issue , 2009, Pages

Annotation-based empirical performance tuning using orio

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC PARALLELIZATION; CODE CHANGES; CODE MODIFICATIONS; CODE OPTIMIZATION; COMPUTATIONAL KERNELS; DENSE ARRAYS; EMPIRICAL PERFORMANCE; HIGH-PERFORMANCE ARCHITECTURE; NON-INTRUSIVE; PARALLEL CODE; PERFORMANCE OPTIMIZATIONS; PERFORMANCE TUNING; PROCESSING INFRASTRUCTURES; SCIENTIFIC APPLICATIONS; SOFTWARE DEVELOPER; SOURCE CODES; SPARSE MATRICES; TUNING SYSTEM;

EID: 70449793159     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2009.5161004     Document Type: Conference Paper
Times cited : (96)

References (36)
  • 2
    • 38049177237 scopus 로고    scopus 로고
    • Performance evaluation of scientific applications on modern parallel vector systems
    • VECPAR, J. Daydé, J. M. L. M. Palma, A. L. G. A. Coutinho, E. Pacitti, and J. C. Lopes, Eds, 4395. Springer
    • J. Carter, L. Oliker, and J. Shalf, "Performance evaluation of scientific applications on modern parallel vector systems," in VECPAR, ser. Lecture Notes in Computer Science, M. J. Daydé, J. M. L. M. Palma, A. L. G. A. Coutinho, E. Pacitti, and J. C. Lopes, Eds., vol. 4395. Springer, 2006, pp. 490-503.
    • (2006) ser. Lecture Notes in Computer Science , vol.1000 , pp. 490-503
    • Carter, J.1    Oliker, L.2    Shalf, J.3
  • 7
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimization of software and the ATLAS project
    • R. C. Whaley, A. Petitet, and J. J. Dongarra, "Automated empirical optimization of software and the ATLAS project," Parallel Computing, vol. 27, no. 1-2, pp. 3-35, 2001.
    • (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
    • Whaley, R.C.1    Petitet, A.2    Dongarra, J.J.3
  • 9
    • 24344485098 scopus 로고    scopus 로고
    • R. Vuduc, J. Demmel, and K. Yelick, OSKI: A library of automatically tuned sparse matrix kernels, in Proceedings of SciDAC 2005, ser. Journal of Physics: Conference Series, 16. Institute of Physics Publishing, June 2005, pp. 521-530.
    • R. Vuduc, J. Demmel, and K. Yelick, "OSKI: A library of automatically tuned sparse matrix kernels," in Proceedings of SciDAC 2005, ser. Journal of Physics: Conference Series, vol. 16. Institute of Physics Publishing, June 2005, pp. 521-530.
  • 10
    • 0030661485 scopus 로고    scopus 로고
    • Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C codingmethodology
    • J. Bilmes, K. Asanovic, C.-W. Chin, and J. Demmel, "Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C codingmethodology," in International Conference on Supercomputing, 1997, pp. 340-347.
    • (1997) International Conference on Supercomputing , pp. 340-347
    • Bilmes, J.1    Asanovic, K.2    Chin, C.-W.3    Demmel, J.4
  • 11
    • 0031636309 scopus 로고    scopus 로고
    • FFTW: An adaptive software architecture for the FFT
    • M. Frigo, "FFTW: An adaptive software architecture for the FFT," in Proceedings of the ICASSP Conference, vol. 3, 1998, p. 1381.
    • (1998) Proceedings of the ICASSP Conference , vol.3 , pp. 1381
    • Frigo, M.1
  • 13
    • 70449783253 scopus 로고    scopus 로고
    • K. Goto, "GotoBLAS," http://www.tacc.utexas.edu/resources/ software/,2007.
    • (2007) GotoBLAS
    • Goto, K.1
  • 15
    • 84947901718 scopus 로고    scopus 로고
    • A rational approach to portable high performance: The basic linear algebra instruction set (BLAIS) and the fixed algorithm size template (FAST) library
    • ECOOP Workshops, S. Demeyer and J. Bosch, Eds, Springer
    • J. G. Siek and A. Lumsdaine, "A rational approach to portable high performance: The basic linear algebra instruction set (BLAIS) and the fixed algorithm size template (FAST) library," in ECOOP Workshops, ser. Lecture Notes in Computer Science, S. Demeyer and J. Bosch, Eds., vol. 1543. Springer, 1998, pp. 468-469.
    • (1998) ser. Lecture Notes in Computer Science , vol.1543 , pp. 468-469
    • Siek, J.G.1    Lumsdaine, A.2
  • 17
    • 43949129775 scopus 로고    scopus 로고
    • Language for the compact representation of multiple program versions
    • Proceedings of Languages and Compilers for Parallel Computing LCPC05, Germany: Springer-Verlag
    • S. Donadio, J. Brodman, T. Roeder, K. Yotov, D. Barthou, A. Cohen,M. J. Garzarán, D. Padua, and K. Pingali, "Language for the compact representation of multiple program versions," in Proceedings of Languages and Compilers for Parallel Computing (LCPC05), ser. Lecture Notes in Computer Science. Germany: Springer-Verlag, 2006, no. 4339, pp. 136-151.
    • (2006) ser. Lecture Notes in Computer Science , Issue.4339 , pp. 136-151
    • Donadio, S.1    Brodman, J.2    Roeder, T.3    Yotov, K.4    Barthou, D.5    Cohen, A.6    Garzarán, M.J.7    Padua, D.8    Pingali, K.9
  • 18
    • 20744452343 scopus 로고    scopus 로고
    • Broadway: A compiler for exploiting the domain-specific semantics of software libraries
    • July
    • C. Lin and S. Z. Guyer, "Broadway: A compiler for exploiting the domain-specific semantics of software libraries," Proceedings of the IEEE, vol. 93, no. 2, pp. 342-357, July 2005.
    • (2005) Proceedings of the IEEE , vol.93 , Issue.2 , pp. 342-357
    • Lin, C.1    Guyer, S.Z.2
  • 19
    • 68849088002 scopus 로고    scopus 로고
    • Telescoping languages project description
    • http: //telescoping.rice.edu
    • K. Kennedy et al., "Telescoping languages project description," http: //telescoping.rice.edu/, 2006.
    • (2006)
    • Kennedy, K.1
  • 22
    • 70449756060 scopus 로고    scopus 로고
    • T. Veldhuizen, Expression templates, C++ Report, 7, no. 5, pp. 26-31, June 1995.
    • T. Veldhuizen, "Expression templates," C++ Report, vol. 7, no. 5, pp. 26-31, June 1995.
  • 26
    • 70449935802 scopus 로고    scopus 로고
    • Orio project, trac.mcs.anl.gov/projects/performance/orio
    • "Orio project," trac.mcs.anl.gov/projects/performance/orio, 2008.
    • (2008)
  • 27
    • 0032251894 scopus 로고    scopus 로고
    • Convergence properties of the Nelder-Mead simplex method in low dimensions
    • J. C. Lagarias, J. A. Reeds, M. H. Wright, and P. E. Wright, "Convergence properties of the Nelder-Mead simplex method in low dimensions," SIAM Journal of Optimization, vol. 9, pp. 112-147, 1998.
    • (1998) SIAM Journal of Optimization , vol.9 , pp. 112-147
    • Lagarias, J.C.1    Reeds, J.A.2    Wright, M.H.3    Wright, P.E.4
  • 29
    • 26444479778 scopus 로고
    • Optimization by simulated annealing
    • S. Kirkpatrick, C. D. Gelatt, and M. P. Vecchi, "Optimization by simulated annealing," Science, vol. 220, pp. 671-680, 1983.
    • (1983) Science , vol.220 , pp. 671-680
    • Kirkpatrick, S.1    Gelatt, C.D.2    Vecchi, M.P.3
  • 30
    • 0348126362 scopus 로고    scopus 로고
    • Optimized unrolling of nested loops
    • V. Sarkar, "Optimized unrolling of nested loops," Int. J. Parallel Program., vol. 29, no. 5, pp. 545-581, 2001.
    • (2001) Int. J. Parallel Program , vol.29 , Issue.5 , pp. 545-581
    • Sarkar, V.1
  • 32
    • 84976736522 scopus 로고
    • gprof: A call graph execution profiler
    • Jun
    • S. L. Graham, P. B. Kessler, and M. K. McKusick, "gprof: A call graph execution profiler," SIGPLAN Notices, vol. 17, no. 6, p. 120, Jun. 1982.
    • (1982) SIGPLAN Notices , vol.17 , Issue.6 , pp. 120
    • Graham, S.L.1    Kessler, P.B.2    McKusick, M.K.3
  • 33
    • 70449759963 scopus 로고    scopus 로고
    • S. Balay, K. Buschelman, V. Eijkhout, W. D. Gropp, D. Kaushik, M. G. Knepley, L. C. McInnes, B. F. Smith, and H. Zhang, PETSc Users Manual, Argonne National Laboratory, Tech. Rep. ANL-95/11 - Revision 2.1.5, 2004.
    • S. Balay, K. Buschelman, V. Eijkhout, W. D. Gropp, D. Kaushik, M. G. Knepley, L. C. McInnes, B. F. Smith, and H. Zhang, "PETSc Users Manual," Argonne National Laboratory, Tech. Rep. ANL-95/11 - Revision 2.1.5, 2004.
  • 34
    • 10044233808 scopus 로고    scopus 로고
    • Automatic performance tuning of sparse matrix kernels,
    • Ph.D. dissertation, University of California, Berkeley, December
    • R. W. Vuduc, "Automatic performance tuning of sparse matrix kernels," Ph.D. dissertation, University of California, Berkeley, December 2003.
    • (2003)
    • Vuduc, R.W.1
  • 35
    • 0347017866 scopus 로고    scopus 로고
    • Pseudo-transient continuation and differential-algebraic equations
    • T. S. Coffey, C. T. Kelley, and D. E. Keyes, "Pseudo-transient continuation and differential-algebraic equations," SIAM J. Sci. Comput., vol. 25, no. 2, pp. 553-569, 2003.
    • (2003) SIAM J. Sci. Comput , vol.25 , Issue.2 , pp. 553-569
    • Coffey, T.S.1    Kelley, C.T.2    Keyes, D.E.3
  • 36
    • 70449871737 scopus 로고    scopus 로고
    • The Pluto automatic parallelizer, sourceforge.net/projects/ pluto-compiler.
    • "The Pluto automatic parallelizer," sourceforge.net/projects/ pluto-compiler.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.