메뉴 건너뛰기




Volumn 20, Issue 4, 2010, Pages 359-376

Leveraging shared caches for parallel temporal blocking of stencil codes on multicore processors and clusters

Author keywords

multi halo exchange; multicore; shared caches; stencil algorithm; temporal blocking

Indexed keywords

MULTI CORE; MULTI-HALO EXCHANGE; SHARED CACHE; STENCIL ALGORITHM; TEMPORAL BLOCKING;

EID: 78650871519     PISSN: 01296264     EISSN: None     Source Type: Journal    
DOI: 10.1142/S0129626410000296     Document Type: Conference Paper
Times cited : (15)

References (14)
  • 1
    • 56749108087 scopus 로고    scopus 로고
    • 10 Unknowns the Largest Finite El- ement System that Can Be Solved Today?
    • ACM/IEEE (Ed.)(Supercomputing Conference '05, Seattle, Nov 12-18, 2005)
    • B. Bergen, F. Hülsemann, U. Rüde: Is 1.7×1010 Unknowns the Largest Finite El- ement System that Can Be Solved Today? In: ACM/IEEE (Ed.): Proceedings of the ACM/IEEE SC 2005 Conference (Supercomputing Conference '05, Seattle, Nov 12-18, 2005).
    • (2005) Proceedings of the ACM/IEEE SC Conference
    • Bergen, B.1    Hülsemann, F.2    Rüde, U.3
  • 3
    • 59749100826 scopus 로고    scopus 로고
    • Optimization and performance modeling of stencil computa- tions on modern microprocessors
    • DOI: 10.1137/070693199
    • K. Datta et al.: Optimization and performance modeling of stencil computa- tions on modern microprocessors. SIAM Review 51(1), (2009) 129-159. DOI: 10.1137/070693199
    • (2009) SIAM Review , vol.51 , Issue.1 , pp. 129-159
    • Datta, K.1
  • 4
    • 70449657442 scopus 로고    scopus 로고
    • Efficient temporal blocking for stencil computations by multicore-aware wavefront parallelization
    • DOI: 10.1109/COMPSAC.2009.82
    • G.Wellein, G. Hager, T. Zeiser, M.Wittmann, H. Fehske: Efficient temporal blocking for stencil computations by multicore-aware wavefront parallelization. Proc. COMP- SAC 2009. DOI: 10.1109/COMPSAC.2009.82
    • (2009) Proc. COMPSAC
    • Wellein, G.1    Hager, G.2    Zeiser, T.3    Wittmann, M.4    Fehske, H.5
  • 7
    • 84858693885 scopus 로고    scopus 로고
    • Increasing temporal locality with skewing and recursive blocking
    • DOI: 10.1145/582034.582077
    • G. Jin, J. Mellor-Crummey, R. Fowler: Increasing temporal locality with skewing and recursive blocking. Proc. SC2001. DOI: 10.1145/582034.582077
    • (2001) Proc. SC
    • Jin, G.1    Mellor-Crummey, J.2    Fowler, R.3
  • 11
    • 56349170328 scopus 로고    scopus 로고
    • Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method
    • T. Zeiser, G. Wellein, A. Nitsure, K. Iglberger, U. Rüde, G. Hager: Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method. Progress in CFD, vol. 8, No. 1-4, pp. 179-188, 2008.
    • (2008) Progress in CFD , vol.8 , Issue.1-4 , pp. 179-188
    • Zeiser, T.1    Wellein, G.2    Nitsure, A.3    Iglberger, K.4    Rüde, U.5    Hager, G.6
  • 13
    • 77954049274 scopus 로고    scopus 로고
    • A ghost cell expansion method for reducing communications in solving PDE problems
    • DOI: 10.1145/582034.582084
    • C. Ding, Y. He: A ghost cell expansion method for reducing communications in solving PDE problems. Proc. SC2001. DOI: 10.1145/582034.582084
    • Proc. SC2001
    • Ding, C.1    He, Y.2
  • 14
    • 77954074485 scopus 로고    scopus 로고
    • Potentials of temporal blocking for stencil-based computations on multi-core systems
    • March
    • M. Wittmann: Potentials of temporal blocking for stencil-based computations on multi-core systems. Master's Thesis, University of Applied Sciences Nuremberg, March 2009. http://www.hpc.rrze.uni-erlangen.de/Projekte/ stencil.shtml
    • (2009) Master's Thesis,University of Applied Sciences Nuremberg
    • Wittmann, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.