메뉴 건너뛰기




Volumn 26, Issue 6, 2004, Pages 975-1028

Automatic tiling of iterative stencil loops

Author keywords

Caches; Loop transformations; Optimizing compilers

Indexed keywords

CACHES; ITERATIVE STENCIL LOOPS; LOOP TRANSFORMATIONS; OPTIMIZING COMPILERS;

EID: 24644456455     PISSN: 01640925     EISSN: None     Source Type: Journal    
DOI: 10.1145/1034774.1034777     Document Type: Article
Times cited : (56)

References (57)
  • 5
    • 0023438847 scopus 로고
    • Automatic translation of FORTRAN programs to vector form
    • ALLEN, J. R. AND KENNEDY, K. 1984. Automatic translation of FORTRAN programs to vector form. ACM Trans. Programm. Lang. Syst. 9, 4 (Oct.), 491-542.
    • (1984) ACM Trans. Programm. Lang. Syst. , vol.9 , Issue.4 OCT. , pp. 491-542
    • Allen, J.R.1    Kennedy, K.2
  • 8
    • 1242313972 scopus 로고
    • A compiler framework for restructuring data declarations to enhance cache and tlb effectiveness
    • Toronto, Ont., Canada
    • BACON, D., CHOW, J.-H., JU, D., MUTHUKUMAR, K., AND SARKAR, V. 1994. A compiler framework for restructuring data declarations to enhance cache and tlb effectiveness. In Proceedings of CASCON'94 (Toronto, Ont., Canada).
    • (1994) Proceedings of CASCON'94
    • Bacon, D.1    Chow, J.-H.2    Ju, D.3    Muthukumar, K.4    Sarkar, V.5
  • 9
    • 0032313172 scopus 로고    scopus 로고
    • Non-linear and symbolic data dependence testing
    • BLUME, W. AND EIGENMANN, R. 1998. Non-linear and symbolic data dependence testing. IEEE Trans. Parall. Distrib. Syst. 9, 12 (Dec.), 1180-1194.
    • (1998) IEEE Trans. Parall. Distrib. Syst. , vol.9 , Issue.12 DEC. , pp. 1180-1194
    • Blume, W.1    Eigenmann, R.2
  • 10
    • 0032648736 scopus 로고    scopus 로고
    • Static tiling for heterogeneous computing platforms
    • BOULET, P., DONGARRA, J., ROBERT, Y., AND VIVIEN, F. 1999. Static tiling for heterogeneous computing platforms. Parall. Comput. 25, 547-568.
    • (1999) Parall. Comput. , vol.25 , pp. 547-568
    • Boulet, P.1    Dongarra, J.2    Robert, Y.3    Vivien, F.4
  • 30
    • 0001465739 scopus 로고
    • Maximizing loop parallelism and improving data locality via loop fusion and distribution
    • Portland, OR, Aug. 1993. Lecture Notes in Computer Science, Springer-Verlag, Berlin, Germany
    • KENNEDY, K. AND MVKINLEY, K. S. 1993. Maximizing loop parallelism and improving data locality via loop fusion and distribution. In Proceedings of the Sixth Workhsop on Languages and Compilers for Parallel Computing (Portland, OR, Aug. 1993). Lecture Notes in Computer Science, vol. 768, Springer-Verlag, Berlin, Germany.
    • (1993) Proceedings of the Sixth Workhsop on Languages and Compilers for Parallel Computing , vol.768
    • Kennedy, K.1    Mvkinley, K.S.2
  • 35
    • 3142754802 scopus 로고
    • Smallest-last ordering and clustering and graph coloring algorithms
    • Department of Computer Science and Engineering, Southern Methodist University, Dallas, TX
    • MATULA, D. AND BECK, L. 1981. Smallest-last ordering and clustering and graph coloring algorithms. Tech. rep. TR CSE 8104. Department of Computer Science and Engineering, Southern Methodist University, Dallas, TX.
    • (1981) Tech. Rep. , vol.TR CSE 8104
    • Matula, D.1    Beck, L.2
  • 36
  • 37
    • 0032064896 scopus 로고    scopus 로고
    • Interprocedural analysis for loop scheduling and data allocation
    • NGUYEN, T. AND LI, Z. 1998. Interprocedural analysis for loop scheduling and data allocation. Parall. Comput. 24, 3, 477-504.
    • (1998) Parall. Comput. , vol.24 , Issue.3 , pp. 477-504
    • Nguyen, T.1    Li, Z.2
  • 40
    • 0033076195 scopus 로고    scopus 로고
    • Augmenting loop tiling with data alignment for improved cache performance
    • PANDA, P., NAKAMURA, H., DUTT, N., AND NICOLAU, A. 1999. Augmenting loop tiling with data alignment for improved cache performance. IEEE Trans. Comput. 48, 2 (Feb.), 142-149.
    • (1999) IEEE Trans. Comput. , vol.48 , Issue.2 FEB. , pp. 142-149
    • Panda, P.1    Nakamura, H.2    Dutt, N.3    Nicolau, A.4
  • 42
    • 84976676720 scopus 로고
    • A practical algorithm for exact array dependence analysis
    • PUGH, W. 1992. A practical algorithm for exact array dependence analysis. Commun. ACM 35, 8 (Aug.), 102-114.
    • (1992) Commun. ACM , vol.35 , Issue.8 AUG. , pp. 102-114
    • Pugh, W.1
  • 44
    • 17244382508 scopus 로고    scopus 로고
    • Exploiting monotone convergence functions in parallel programs
    • University of Maryland, College Park, MD
    • PUGH, W., ROSSER, E., AND SHPEISMAN, T. 1996. Exploiting monotone convergence functions in parallel programs. Tech. rep. CS-TR-3636. University of Maryland, College Park, MD.
    • (1996) Tech. Rep. , vol.CS-TR-3636
    • Pugh, W.1    Rosser, E.2    Shpeisman, T.3
  • 47
    • 0005045396 scopus 로고    scopus 로고
    • Ph.D. dissertation. Department of Computer Science, University of Maryland at College Park, MD
    • ROSSER, E. 1998. Fine-grained analysis of array computations. Ph.D. dissertation. Department of Computer Science, University of Maryland at College Park, MD.
    • (1998) Fine-grained Analysis of Array Computations
    • Rosser, E.1
  • 54
    • 0003553286 scopus 로고
    • Ph.D. dissertation. Department of Computer Science, Stanford University, Stanford, CA
    • WOLF, M. 1992. Improving locality and parallelism in nested loops. Ph.D. dissertation. Department of Computer Science, Stanford University, Stanford, CA.
    • (1992) Improving Locality and Parallelism in Nested Loops
    • Wolf, M.1
  • 56
    • 1542392248 scopus 로고    scopus 로고
    • Achieving scalable locality with time skewing
    • WONNACOTT, D. 2002. Achieving scalable locality with time skewing. Int. J. Parall. Programm. 30, 3 (June), 181-221.
    • (2002) Int. J. Parall. Programm. , vol.30 , Issue.3 JUNE , pp. 181-221
    • Wonnacott, D.1
  • 57
    • 0442303278 scopus 로고    scopus 로고
    • Kluwer Academic Publishers, Dordrecht, The Netherlands
    • XUE, J. 2000. Loop Tiling for Parallelism. Kluwer Academic Publishers, Dordrecht, The Netherlands.
    • (2000) Loop Tiling for Parallelism
    • Xue, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.