메뉴 건너뛰기




Volumn , Issue , 2007, Pages 235-244

Effective automatic parallelization of stencil computations

Author keywords

Automatic parallelization; Load; Stencil computations; Tiling

Indexed keywords

AUTOMATIC PARALLELIZATION; STENCIL COMPUTATIONS; TILING;

EID: 35448944792     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1250734.1250761     Document Type: Conference Paper
Times cited : (133)

References (29)
  • 1
    • 0003302104 scopus 로고    scopus 로고
    • High performance fortran compilation techniques for parallelizing scientific codes
    • V. Adve, G. Jin, J. Mellor-Crummey, and Q. Yi. High performance fortran compilation techniques for parallelizing scientific codes. In Proceedings of Supercomputing '98, pages 1-23, 1998.
    • (1998) Proceedings of Supercomputing '98 , pp. 1-23
    • Adve, V.1    Jin, G.2    Mellor-Crummey, J.3    Yi, Q.4
  • 2
    • 0033700781 scopus 로고    scopus 로고
    • Synthesizing transformations for locality enhancement of imperfectly nested loops
    • N. Ahmed, N. Mateev, and K. Pingali. Synthesizing transformations for locality enhancement of imperfectly nested loops. In Proceedings of ACMICS 2000, pages 141-152, 2000.
    • (2000) Proceedings of ACMICS , pp. 141-152
    • Ahmed, N.1    Mateev, N.2    Pingali, K.3
  • 4
    • 10844259103 scopus 로고    scopus 로고
    • Synthesizing transformations for locality enhancement of imperfectly-nested loop nests
    • Oct
    • N. Ahmed, N. Mateev, and K. Pingali. Synthesizing transformations for locality enhancement of imperfectly-nested loop nests. International Journal of Parallel Programming, 29(5), Oct. 2001.
    • (2001) International Journal of Parallel Programming , vol.29 , Issue.5
    • Ahmed, N.1    Mateev, N.2    Pingali, K.3
  • 8
    • 84976745804 scopus 로고
    • Tile size selection using cache organization and data layout
    • S. Coleman and K. S. McKinley. Tile size selection using cache organization and data layout. In Proceedings of PLDI '95, pages 279-290, 1995.
    • (1995) Proceedings of PLDI '95 , pp. 279-290
    • Coleman, S.1    McKinley, K.S.2
  • 10
    • 35448960379 scopus 로고    scopus 로고
    • The memory behavior of cache oblivious stencil computations
    • M. Frigo and V. Strumpen. The memory behavior of cache oblivious stencil computations. J. of Supercomputing, 2006.
    • (2006) J. of Supercomputing
    • Frigo, M.1    Strumpen, V.2
  • 11
    • 0034830737 scopus 로고    scopus 로고
    • On tiling space-time mapped loop nests
    • M. Griebl. On tiling space-time mapped loop nests. In Proceedings of SPAA '01, pages 322-323, 2001.
    • (2001) Proceedings of SPAA '01 , pp. 322-323
    • Griebl, M.1
  • 12
    • 35448957255 scopus 로고    scopus 로고
    • Automatic Parallelization of Loop Programs for Distributed Memory Architectures. University of Passau, Habilitation Thesis
    • M. Griebl. Automatic Parallelization of Loop Programs for Distributed Memory Architectures. University of Passau, 2004. Habilitation Thesis.
    • (2004)
    • Griebl, M.1
  • 14
    • 0036958653 scopus 로고    scopus 로고
    • On time optimal supernode shape
    • E. Hodzic and W. Shang. On time optimal supernode shape. IEEE Trans. Par. & Dist. Sys., 13(12): 1220-1233, 2002.
    • (2002) IEEE Trans. Par. & Dist. Sys , vol.13 , Issue.12 , pp. 1220-1233
    • Hodzic, E.1    Shang, W.2
  • 19
    • 84958661690 scopus 로고    scopus 로고
    • Impact of modern memory subsystems on cache optimizations for stencil computations
    • S. Kamil, P. Husbands, L. Oliker, J. Shalf, and K. Yelick. Impact of modern memory subsystems on cache optimizations for stencil computations. In Proceedings of MSP '05, pages 36-43, 2005.
    • (2005) Proceedings of MSP '05 , pp. 36-43
    • Kamil, S.1    Husbands, P.2    Oliker, L.3    Shalf, J.4    Yelick, K.5
  • 21
    • 0026274706 scopus 로고
    • Tiling multidimensional iteration spaces for nonshared memory machines
    • J. Ramanujam and P. Sadayappan. Tiling multidimensional iteration spaces for nonshared memory machines. In Proceedings of Supercomputing '91, pages 111-120, 1991.
    • (1991) Proceedings of Supercomputing '91 , pp. 111-120
    • Ramanujam, J.1    Sadayappan, P.2
  • 22
    • 84934300040 scopus 로고    scopus 로고
    • A geometric programming framework for optimal multi-level tiling
    • L. Renganarayana and S. Rajopadhye. A geometric programming framework for optimal multi-level tiling. In Proceedings of SC '04, page 18, 2004.
    • (2004) Proceedings of SC '04 , pp. 18
    • Renganarayana, L.1    Rajopadhye, S.2
  • 24
    • 84957885078 scopus 로고    scopus 로고
    • Program analysis of overlap area usage in self-similar parallel programs
    • A. Sawdey and M. T. O'Keefe. Program analysis of overlap area usage in self-similar parallel programs. In Proceedings of LCPC '97, pages 79-93, 1998.
    • (1998) Proceedings of LCPC '97 , pp. 79-93
    • Sawdey, A.1    O'Keefe, M.T.2
  • 25
    • 0003929457 scopus 로고
    • Automatic blocking of nested loops
    • Technical report, University of Tennessee, Knoxville, TN, Aug
    • R. Schreiber and J. Dongarra. Automatic blocking of nested loops. Technical report, University of Tennessee, Knoxville, TN, Aug. 1990.
    • (1990)
    • Schreiber, R.1    Dongarra, J.2
  • 26
    • 0032635362 scopus 로고    scopus 로고
    • New tiling techniques to improve cache temporal locality
    • Y. Song and Z. Li. New tiling techniques to improve cache temporal locality. In Proceedings of PLDI "99, pages 215-228, 1999.
    • (1999) Proceedings of PLDI 99 , pp. 215-228
    • Song, Y.1    Li, Z.2
  • 28
    • 84976827033 scopus 로고
    • A data locality optimizing algorithm
    • M. E. Wolf and M. S. Lam. A data locality optimizing algorithm. In Proceedings of PLDI '91, pages 30-44, 1991.
    • (1991) Proceedings of PLDI '91 , pp. 30-44
    • Wolf, M.E.1    Lam, M.S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.