메뉴 건너뛰기




Volumn 2, Issue 2, 2011, Pages 130-137

Efficient multicore-aware parallelization strategies for iterative stencil computations

Author keywords

Multicore; Simultaneous multi threading; Spatial blocking; Stencil computations; Temporal blocking; Wavefront parallelization

Indexed keywords

MULTI CORE; SIMULTANEOUS MULTI-THREADING; SPATIAL BLOCKING; STENCIL COMPUTATIONS; TEMPORAL BLOCKING; WAVEFRONT PARALLELIZATION;

EID: 79958773431     PISSN: 18777503     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.jocs.2011.01.010     Document Type: Article
Times cited : (33)

References (14)
  • 4
    • 59749100826 scopus 로고    scopus 로고
    • Optimization and performance modeling of stencil computations on modern microprocessors
    • Datta K., Kamil S., Williams S., Oliker L., Shalf J., Yelick: K. Optimization and performance modeling of stencil computations on modern microprocessors. SIAM Rev. 2009, 51(1):129-159.
    • (2009) SIAM Rev. , vol.51 , Issue.1 , pp. 129-159
    • Datta, K.1    Kamil, S.2    Williams, S.3    Oliker, L.4    Shalf, J.5    Yelick, K.6
  • 6
    • 79958773085 scopus 로고    scopus 로고
    • Efficiency Improvements of Iterative Numerical Algorithms on Modern Architectures. Ph.D. Thesis, July, URN: urn:nbn:de:bvb:29-opus-14036.
    • J. Treibig, Efficiency Improvements of Iterative Numerical Algorithms on Modern Architectures. Ph.D. Thesis, July 2009, URN: urn:nbn:de:bvb:29-opus-14036.
    • (2009)
    • Treibig, J.1
  • 8
    • 56349170328 scopus 로고    scopus 로고
    • Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method
    • Zeiser T., Wellein G., Nitsure A., Iglberger K., Rüde U., Hager: G. Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method. Prog. CFD 2008, 8(1-4):179-188.
    • (2008) Prog. CFD , vol.8 , Issue.1-4 , pp. 179-188
    • Zeiser, T.1    Wellein, G.2    Nitsure, A.3    Iglberger, K.4    Rüde, U.5    Hager, G.6
  • 9
    • 70449657442 scopus 로고    scopus 로고
    • Efficient temporal blocking for stencil computations by multicore-aware wavefront parallelization
    • Wellein G., Hager G., Zeiser T., Wittmann M., Fehske H. Efficient temporal blocking for stencil computations by multicore-aware wavefront parallelization. Proc. COMPSAC 2009 2009, 10.1109/COMPSAC.1.2009.82.
    • (2009) Proc. COMPSAC 2009
    • Wellein, G.1    Hager, G.2    Zeiser, T.3    Wittmann, M.4    Fehske, H.5
  • 10
  • 11
    • 79958765147 scopus 로고    scopus 로고
    • STREAM: Sustainable Memory Bandwidth in High Performance Computers.
    • J.D. McCalpin, STREAM: Sustainable Memory Bandwidth in High Performance Computers. http://www.cs.virginia.edu/stream.
    • McCalpin, J.D.1
  • 13
    • 78649844813 scopus 로고    scopus 로고
    • LIKWID: a lightweight performance-oriented tool suite for x86 multicore environments, PSTI2010, the First International Workshop on Parallel Software Tools and Tool Infrastructures, San Diego CA, September 13, arXiv:1004.4431, in press. doi:10.1109/ICPPW.2010.38
    • J. Treibig, G. Hager, G. Wellein, LIKWID: a lightweight performance-oriented tool suite for x86 multicore environments, PSTI2010, the First International Workshop on Parallel Software Tools and Tool Infrastructures, San Diego CA, September 13, 2010. arXiv:1004.4431, in press. doi:10.1109/ICPPW.2010.38.
    • Treibig, J.1    Hager, G.2    Wellein, G.3
  • 14
    • 78650871519 scopus 로고    scopus 로고
    • Leveraging shared caches for parallel temporal blocking of stencil codes on multicore processors and clusters
    • Wittmann M., Hager G., Treibig J., Wellein G. Leveraging shared caches for parallel temporal blocking of stencil codes on multicore processors and clusters. Parallel Processing Letters 2010, 20(4):359-376.
    • (2010) Parallel Processing Letters , vol.20 , Issue.4 , pp. 359-376
    • Wittmann, M.1    Hager, G.2    Treibig, J.3    Wellein, G.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.