메뉴 건너뛰기




Volumn 21, Issue 1, 2009, Pages 25-39

Writing productive stencil codes with overlapped tiling

Author keywords

Overlapped tiling; Productivity; Shadow regions; Stencil computations; Tiles

Indexed keywords

PRODUCTIVITY; TILE;

EID: 58149239639     PISSN: 15320626     EISSN: 15320634     Source Type: Journal    
DOI: 10.1002/cpe.1340     Document Type: Article
Times cited : (18)

References (23)
  • 1
    • 84945709131 scopus 로고
    • Organizing matrices and matrix operations for paged memory systems
    • McKellar AC, Coffman JEG. Organizing matrices and matrix operations for paged memory systems. Communications of the ACM 1969; 12(3):153-165.
    • (1969) Communications of the ACM , vol.12 , Issue.3 , pp. 153-165
    • McKellar, A.C.1    Coffman, J.E.G.2
  • 3
    • 38149030197 scopus 로고    scopus 로고
    • Design and use of htalib-A library for hierarchically tiled arrays
    • Proceedings of the International Workshop on Languages and Compilers for Parallel Computing, New Orleans, LA, U.S.A, November
    • Bikshandi G, Guo J, von. Praun C, Tañase G, Fragüela BB, Garzarán MJ, Padua D, Rauchwerger L. Design and use of htalib-A library for hierarchically tiled arrays. Proceedings of the International Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science (LNCS), vol. 4382, New Orleans, LA, U.S.A., November 2006; 17-32.
    • (2006) Lecture Notes in Computer Science (LNCS , vol.4382 , pp. 17-32
    • Bikshandi, G.1    Guo, J.2    von3    Praun, C.4    Tañase, G.5    Fragüela, B.B.6    Garzarán, M.J.7    Padua, D.8    Rauchwerger, L.9
  • 5
    • 58149248749 scopus 로고    scopus 로고
    • NAS Parallel Benchmarks. Website, 22 September 2007
    • NAS Parallel Benchmarks. Website. http://www.nas.nasa.gov/Software/NPB/ [22 September 2007].
    • Software/NPB
  • 8
    • 0026306973 scopus 로고
    • Compiler optimizations for Fortran D on MIMD distributed-memory machines
    • ACM Press: New York
    • Hiranandani S, Kennedy K, Tseng C-W. Compiler optimizations for Fortran D on MIMD distributed-memory machines. Proceedings of Supercomputing '91. ACM Press: New York, 1991; 86-100.
    • (1991) Proceedings of Supercomputing '91 , pp. 86-100
    • Hiranandani, S.1    Kennedy, K.2    Tseng, C.-W.3
  • 9
    • 58149271299 scopus 로고    scopus 로고
    • Parallel programming with hierarchically tiled arrays. PhD Thesis
    • Bikshandi G. Parallel programming with hierarchically tiled arrays. PhD Thesis, 2007.
    • (2007)
    • Bikshandi, G.1
  • 10
    • 0002081678 scopus 로고    scopus 로고
    • Co-array Fortran for parallel programming
    • Numrich RW, Reid J. Co-array Fortran for parallel programming. SIGPLAN Fortran Forum 1998; 17(2):1-31.
    • (1998) SIGPLAN Fortran Forum , vol.17 , Issue.2 , pp. 1-31
    • Numrich, R.W.1    Reid, J.2
  • 13
    • 84976827033 scopus 로고
    • A data locality optimizing algorithm
    • Toronto, Ontario, Canada
    • Wolf ME, Lam MS. A data locality optimizing algorithm. Proceedings of PLDI'91, Toronto, Ontario, Canada, 1991; 30-44.
    • (1991) Proceedings of PLDI'91 , pp. 30-44
    • Wolf, M.E.1    Lam, M.S.2
  • 15
    • 33845574641 scopus 로고    scopus 로고
    • Tiling optimizations for 3d scientific computations
    • Dallas, TX, U.S.A
    • Rivera G, Tseng C-W. Tiling optimizations for 3d scientific computations. Proceedings of Supercomputing '00, Dallas, TX, U.S.A., 2000; 32.
    • (2000) Proceedings of Supercomputing '00 , pp. 32
    • Rivera, G.1    Tseng, C.-W.2
  • 17
    • 26444503508 scopus 로고
    • An overview of high performance Fortran
    • Koelbel C, Mehrotra P. An overview of high performance Fortran. SIGPLAN Fortran Forum 1992; 11(4):9-16.
    • (1992) SIGPLAN Fortran Forum , vol.11 , Issue.4 , pp. 9-16
    • Koelbel, C.1    Mehrotra, P.2
  • 19
    • 58149221207 scopus 로고    scopus 로고
    • Introduction to UPC and language specification. Technical Report CCS-TR-99-157, IDA Center for Computing Sciences
    • Carlson W, Draper J, Culler D, Yelick. K, Brooks E, Warren K. Introduction to UPC and language specification. Technical Report CCS-TR-99-157, IDA Center for Computing Sciences, 1999.
    • (1999)
    • Carlson, W.1    Draper, J.2    Culler, D.3    Yelick, K.4    Brooks, E.5    Warren, K.6
  • 20
    • 35448977691 scopus 로고    scopus 로고
    • Program analysis of overlap area usage in self-similar parallel programs
    • Minneapolis, MN, U.S.A
    • Sawdey A, O'Keefe M. Program analysis of overlap area usage in self-similar parallel programs. Proceedings of LCPC, Minneapolis, MN, U.S.A., 1997; 79-93.
    • (1997) Proceedings of LCPC , pp. 79-93
    • Sawdey, A.1    O'Keefe, M.2
  • 21
    • 0003302104 scopus 로고    scopus 로고
    • High performance Fortran compilation techniques for parallelizing scientific codes
    • IEEE Computer Society: Silver Spring, MD
    • Adve V, Jin G, Mellor-Crummey J, Yi Q. High performance Fortran compilation techniques for parallelizing scientific codes. Proceedings of Supercomputing '98. IEEE Computer Society: Silver Spring, MD, 1998; 1-23.
    • (1998) Proceedings of Supercomputing '98 , pp. 1-23
    • Adve, V.1    Jin, G.2    Mellor-Crummey, J.3    Yi, Q.4
  • 23
    • 0141982363 scopus 로고    scopus 로고
    • The Global Arrays User's Manual
    • Technical Report Number PNNL-13130. Pacific Northwest National Laboratory
    • Nieplocha J, Krishnan M, Palmer B, Tipparaju V, Ju J. The Global Arrays User's Manual. Technical Report Number PNNL-13130. Pacific Northwest National Laboratory, 2006.
    • (2006)
    • Nieplocha, J.1    Krishnan, M.2    Palmer, B.3    Tipparaju, V.4    Ju, J.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.