메뉴 건너뛰기




Volumn , Issue , 2010, Pages

An auto-tuning framework for parallel multicore stencil computations

Author keywords

[No Author keywords available]

Indexed keywords

AUTOTUNING; BARCELONA; DOMAIN SPECIFIC; FORTRAN; FORTRAN 95; MULTI CORE; MULTI-CORE SYSTEMS; PARALLEL IMPLEMENTATIONS; PERFORMANCE GAIN; PERFORMANCE PORTABILITY; PROGRAMMER PRODUCTIVITY; SINGLE KERNEL; STENCIL COMPUTATIONS;

EID: 77954022347     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2010.5470421     Document Type: Conference Paper
Times cited : (158)

References (26)
  • 1
    • 35648995516 scopus 로고    scopus 로고
    • The landscape of parallel computing research: A view from Berkeley
    • University of California, Berkeley
    • K. Asanovic, R. Bodik, B. Catanzaro, et al. The landscape of parallel computing research: A view from Berkeley. Technical Report UCB/EECS-2006-2183, EECS, University of California, Berkeley, 2006.
    • (2006) Technical Report UCB/EECS-2006-2183, EECS
    • Asanovic, K.1    Bodik, R.2    Catanzaro, B.3
  • 3
    • 67650673172 scopus 로고    scopus 로고
    • Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures
    • 08, Austin, Texas
    • K. Datta, M. Murphy, V. Volkov, et al. Stencil Computation Optimization and Auto-Tuning on State-of-the-art Multicore Architectures. In Proceedings of SC '08, Austin, Texas, 2008.
    • (2008) Proceedings of SC
    • Datta, K.1    Murphy, M.2    Volkov, V.3
  • 5
    • 59749100826 scopus 로고    scopus 로고
    • Optimization and performance modeling of stencil computations on modern microprocessors
    • Kaushik Datta, Shoaib Kamil, Samuel Williams, Leonid Oliker, John Shalf, and Katherine Yelick. Optimization and performance modeling of stencil computations on modern microprocessors. SIAM Review, 51(1):129-159, 2009.
    • (2009) SIAM Review , vol.51 , Issue.1 , pp. 129-159
    • Datta, K.1    Kamil, S.2    Williams, S.3    Oliker, L.4    Shalf, J.5    Katherine, Yelick.6
  • 7
    • 57349139452 scopus 로고    scopus 로고
    • A practical automatic polyhedral parallelizer and locality optimizer
    • Bondhugula et al. A practical automatic polyhedral parallelizer and locality optimizer. SIGPLAN Not., 43(6):101-113, 2008.
    • (2008) SIGPLAN Not. , vol.43 , Issue.6 , pp. 101-113
    • Bondhugula1
  • 9
    • 0348209599 scopus 로고    scopus 로고
    • A fast fourier transform compiler
    • Matteo Frigo. A fast fourier transform compiler. SIGPLAN Not., 34(5):169-180, 1999.
    • (1999) SIGPLAN Not. , vol.34 , Issue.5 , pp. 169-180
    • Matteo, Frigo.1
  • 11
    • 77953968816 scopus 로고    scopus 로고
    • GreenFlash. http://www.lbl.gov/CS/html/greenflash.html.
    • GreenFlash
  • 12
    • 0000631097 scopus 로고
    • Numerical integration of the shallow-water equations of a twisted icosahedral grid. part i: Basic design and results of tests
    • R. Heikes and D.A. Randall. Numerical integration of the shallow-water equations of a twisted icosahedral grid. part i: basic design and results of tests. Mon. Wea. Rev., 123:1862-1880, 1995.
    • (1995) Mon. Wea. Rev. , vol.123 , pp. 1862-1880
    • Heikes, R.1    Randall, D.A.2
  • 13
    • 0024903997 scopus 로고
    • Evaluating associativity in CPU caches
    • M. D. Hill and A. J. Smith. Evaluating Associativity in CPU Caches. IEEE Trans. Comput., 38(12):1612-1630, 1989.
    • (1989) IEEE Trans. Comput. , vol.38 , Issue.12 , pp. 1612-1630
    • Hill, M.D.1    Smith., A.J.2
  • 14
    • 78249252490 scopus 로고    scopus 로고
    • A generalized framework for auto-tuning stencil computations
    • S. Kamil, C. Chan, S. Williams, et al. A generalized framework for auto-tuning stencil computations. In Cray User Group, 2009.
    • (2009) Cray User Group
    • Kamil, S.1    Chan, C.2    Williams, S.3
  • 20
    • 19344368072 scopus 로고    scopus 로고
    • SPIRAL: Code generation for DSP transforms. Proceedings of the IEEE special issue on
    • M. Puschel, J. Moura, J. Johnson, et al. SPIRAL: Code generation for DSP transforms. Proceedings of the IEEE, special issue on "Program Generation, Optimization, and Adaptation", 93(2):232-275, 2005.
    • (2005) Program Generation, Optimization, and Adaptation , vol.93 , Issue.2 , pp. 232-275
    • Puschel, M.1    Moura, J.2    Johnson, J.3
  • 21
  • 25
    • 0343462141 scopus 로고    scopus 로고
    • Automated Empirical Optimization of Software and the ATLAS project
    • R. C. Whaley, A. Petitet, and J. Dongarra. Automated Empirical Optimization of Software and the ATLAS project. Parallel Computing, 27(1-2):3-35, 2001.
    • (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
    • Whaley, R.C.1    Petitet, A.2    Dongarra., J.3
  • 26
    • 67650797544 scopus 로고    scopus 로고
    • Roofline: An insightful visual performance model for floating-point programs and multicore architectures
    • April
    • S. Williams, A. Watterman, and D. Patterson. Roofline: An insightful visual performance model for floating-point programs and multicore architectures. Communications of the ACM, April 2009.
    • (2009) Communications of the ACM
    • Williams, S.1    Watterman, A.2    Patterson, D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.