메뉴 건너뛰기




Volumn , Issue , 2007, Pages

Towards optimal multi-level tiling for stencil computations

Author keywords

[No Author keywords available]

Indexed keywords

OPTIMIZATION; PROGRAM PROCESSORS;

EID: 34548752231     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2007.370291     Document Type: Conference Paper
Times cited : (36)

References (32)
  • 4
    • 29244443735 scopus 로고    scopus 로고
    • M. Bromley, S. Heller, T. McNerney, and J. Guy L. Steele. Fortran at ten Gigaflops: the connection machine convolution compiler. In PLDI '91: Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation, 1991.
    • M. Bromley, S. Heller, T. McNerney, and J. Guy L. Steele. Fortran at ten Gigaflops: the connection machine convolution compiler. In PLDI '91: Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation, 1991.
  • 5
    • 0000209582 scopus 로고
    • Regular partitioning for synthesizing fixed-size systolic arrays
    • A. Darte. Regular partitioning for synthesizing fixed-size systolic arrays. Integration, The VLSI J., 12(3):293-304, 1991.
    • (1991) Integration, The VLSI J , vol.12 , Issue.3 , pp. 293-304
    • Darte, A.1
  • 6
    • 4243166952 scopus 로고    scopus 로고
    • Tight bounds on cache use for stencil operations on rectangular grids
    • M. A. Frumkin and R. F. V. der Wijngaart. Tight bounds on cache use for stencil operations on rectangular grids. J. ACM, 49(3):434-453, 2002.
    • (2002) J. ACM , vol.49 , Issue.3 , pp. 434-453
    • Frumkin, M.A.1    der Wijngaart, R.F.V.2
  • 7
    • 0023379612 scopus 로고
    • Solving pdes on loosely-coupled parallel processors
    • W. D. Gropp. Solving pdes on loosely-coupled parallel processors. Parallel Computing, 5(1-2):165-173, 1987.
    • (1987) Parallel Computing , vol.5 , Issue.1-2 , pp. 165-173
    • Gropp, W.D.1
  • 12
    • 0001512318 scopus 로고
    • The organization of computations for uniform recurrence equations
    • R. M. Karp, R. E. Miller, and S. Winograd. The organization of computations for uniform recurrence equations. J. ACM, 14(3):563-590, 1967.
    • (1967) J. ACM , vol.14 , Issue.3 , pp. 563-590
    • Karp, R.M.1    Miller, R.E.2    Winograd, S.3
  • 13
    • 0043048462 scopus 로고    scopus 로고
    • An infeasible interiorpoint algorithm for solving primal and dual geometric programs
    • K. O. Kortanek, X. Xu, and Y. Ye. An infeasible interiorpoint algorithm for solving primal and dual geometric programs. Math. Program., 76(1):155-181, 1997.
    • (1997) Math. Program , vol.76 , Issue.1 , pp. 155-181
    • Kortanek, K.O.1    Xu, X.2    Ye, Y.3
  • 14
    • 24644456455 scopus 로고    scopus 로고
    • Automatic tiling of iterative stencil loops
    • Z. Li and Y. Song. Automatic tiling of iterative stencil loops. ACM Trans. Program. Lang. Syst., 26(6):975-1028, 2004.
    • (2004) ACM Trans. Program. Lang. Syst , vol.26 , Issue.6 , pp. 975-1028
    • Li, Z.1    Song, Y.2
  • 15
    • 20344396845 scopus 로고    scopus 로고
    • YALMIP : A toolbox for modeling and optimization in MATLAB
    • J. Löfberg. YALMIP : A toolbox for modeling and optimization in MATLAB. In Proceedings of the CACSD Conference, 2004.
    • (2004) Proceedings of the CACSD Conference
    • Löfberg, J.1
  • 17
    • 0022482205 scopus 로고
    • Partitioning and mapping algorithms into fixed size systolic arrays
    • 351, 12
    • D. I. Moldovan and J. A. B. Fortes. Partitioning and mapping algorithms into fixed size systolic arrays. IEEE Trans. Comput., 35(1)--12, 1986.
    • (1986) IEEE Trans. Comput
    • Moldovan, D.I.1    Fortes, J.A.B.2
  • 18
    • 2442670256 scopus 로고    scopus 로고
    • Available from
    • NAS Parallel Benchmarks. Available from http://www.netlib.org/parkbench/.
    • NAS Parallel Benchmarks
  • 19
    • 34548743372 scopus 로고    scopus 로고
    • PARKBENCH:, Available from
    • PARKBENCH: PARallel Kernels and BENCHmarks. Available from http://www.netlib.org/parkbench/.
    • PARallel Kernels and BENCHmarks
  • 20
    • 51249173427 scopus 로고
    • The mapping of linear recurrence equations on regular arrays
    • P. Quinton and V. Van Dongen. The mapping of linear recurrence equations on regular arrays. Journal of VLSI Signal Processing, 1(2):95-113, 1989.
    • (1989) Journal of VLSI Signal Processing , vol.1 , Issue.2 , pp. 95-113
    • Quinton, P.1    Van Dongen, V.2
  • 21
    • 0025446495 scopus 로고
    • Synthesizing systolic arrays from recurrence equations
    • June
    • S. V. Rajopadhye and R. M. Fujimoto. Synthesizing systolic arrays from recurrence equations. Parallel Computing, 14:163-189, June 1990.
    • (1990) Parallel Computing , vol.14 , pp. 163-189
    • Rajopadhye, S.V.1    Fujimoto, R.M.2
  • 30
    • 1542392248 scopus 로고    scopus 로고
    • Achieving scalable locality with time skewing
    • D. Wonnacott. Achieving scalable locality with time skewing. Int. J. Parallel Program., 30(3):181-221, 2002.
    • (2002) Int. J. Parallel Program , vol.30 , Issue.3 , pp. 181-221
    • Wonnacott, D.1
  • 31
    • 0000703719 scopus 로고    scopus 로고
    • On tiling as a loop transformation
    • J. Xue. On tiling as a loop transformation. Parallel Processing Letters, 7(4):409-424, 1997.
    • (1997) Parallel Processing Letters , vol.7 , Issue.4 , pp. 409-424
    • Xue, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.