메뉴 건너뛰기




Volumn 26, Issue 6, 1998, Pages 641-670

Quantifying the Multi-Level Nature of Tiling Interactions

Author keywords

Compiler; Locality; Memory hierarchy; Parallelism; Tiling

Indexed keywords

COMPUTER ARCHITECTURE; COSTS; DATA STORAGE EQUIPMENT; HIERARCHICAL SYSTEMS; OPTIMIZATION; STORAGE ALLOCATION (COMPUTER);

EID: 0032308685     PISSN: 08857458     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1018782528453     Document Type: Article
Times cited : (44)

References (32)
  • 2
    • 84964748976 scopus 로고
    • Compiler blockability of numerical algorithms
    • November
    • Steve Carr and Ken Kennedy, Compiler blockability of numerical algorithms, J. Supercomputing, pp. 114-124 (November 1992).
    • (1992) J. Supercomputing , pp. 114-124
    • Carr, S.1    Kennedy, K.2
  • 4
    • 0028549474 scopus 로고
    • Improving the ratio of memory operations to floatingpoint operations in loops
    • November
    • Steve Carr and Ken Kennedy, Improving the ratio of memory operations to floatingpoint operations in loops, Trans. Progr. Lang. Syst. 16(6):1768-1810 (November 1994).
    • (1994) Trans. Progr. Lang. Syst. , vol.16 , Issue.6 , pp. 1768-1810
    • Carr, S.1    Kennedy, K.2
  • 6
    • 0026232450 scopus 로고
    • A loop transformation theory and an algorithm to maximize parallelism
    • Michael E. Wolf and Monica S. Lam, A loop transformation theory and an algorithm to maximize parallelism, IEEE Trans. Parallel Distrib. Syst. 2(4):452-471 (1991).
    • (1991) IEEE Trans. Parallel Distrib. Syst. , vol.2 , Issue.4 , pp. 452-471
    • Wolf, M.E.1    Lam, M.S.2
  • 7
    • 0026933251 scopus 로고
    • Some efficient solutions to the affine scheduling problem, Part I, one-dimensional time
    • October
    • Paul Feautrier, Some efficient solutions to the affine scheduling problem, Part I, one-dimensional time, IJPP 21(5):xx-xx (October 1992).
    • (1992) IJPP , vol.21 , Issue.5
    • Feautrier, P.1
  • 9
    • 2342480327 scopus 로고
    • Unrolling-based optimizations for modulo scheduling
    • December
    • Daniel Lavery and Wen-mei Hwu, Unrolling-based optimizations for modulo scheduling, 28th Int'l. Symp. Microarchit., pp. 126-141 (December 1995).
    • (1995) 28th Int'l. Symp. Microarchit. , pp. 126-141
    • Lavery, D.1    Hwu, W.-M.2
  • 13
    • 0030379246 scopus 로고    scopus 로고
    • Combining loop transformations considering caches and scheduling
    • December
    • Michael E. Wolf, Dror Maydan, and Ding-Kai Chen, Combining loop transformations considering caches and scheduling, 29th Int'l. Symp. Microarchit. (December 1996).
    • (1996) 29th Int'l. Symp. Microarchit.
    • Wolf, M.E.1    Maydan, D.2    Chen, D.-K.3
  • 14
    • 0002433589 scopus 로고
    • Iteration space tiling for memory hierarchies
    • Michael J. Wolfe, Iteration space tiling for memory hierarchies, Parallel Processing for Sci. Comput., pp. 357-361 (1987).
    • (1987) Parallel Processing for Sci. Comput. , pp. 357-361
    • Wolfe, M.J.1
  • 15
    • 0002238004 scopus 로고
    • Tiling multidimensional iteration spaces for nonshared memory machines
    • November
    • J. Ramanujam and P. Sadayappan, Tiling multidimensional iteration spaces for nonshared memory machines, Supercomputing (November 1991).
    • (1991) Supercomputing
    • Ramanujam, J.1    Sadayappan, P.2
  • 16
    • 0022874874 scopus 로고
    • Advanced compiler optimizations for supercomputers
    • December
    • David A. Padua and Michael J. Wolfe, Advanced compiler optimizations for supercomputers, Commun. ACM 29(12):1184-1201 (December 1986).
    • (1986) Commun. ACM , vol.29 , Issue.12 , pp. 1184-1201
    • Padua, D.A.1    Wolfe, M.J.2
  • 17
    • 0001366267 scopus 로고
    • Strategies for cache and local memory management by global program transformation
    • October
    • Dennis Gannon, William Jalby, and Kyle Gallivan, Strategies for cache and local memory management by global program transformation, J. Parallel and Distrib. Comput., Vol. 5, No. 5 (October 1988).
    • (1988) J. Parallel and Distrib. Comput. , vol.5 , Issue.5
    • Gannon, D.1    Jalby, W.2    Gallivan, K.3
  • 19
    • 0024935630 scopus 로고
    • More iteration space tiling
    • Michael J. Wolfe, More iteration space tiling, Supercomputing, pp. 655-664 (1989).
    • (1989) Supercomputing , pp. 655-664
    • Wolfe, M.J.1
  • 20
    • 0026137116 scopus 로고
    • The cache performance and optimizations of blocked algorithms
    • Palo Alto, California April
    • Monica S. Lam, Edward E. Rothberg, and Michael E. Wolf, The cache performance and optimizations of blocked algorithms, ASPLOS-IV , Palo Alto, California (April 1991).
    • (1991) ASPLOS-IV
    • Lam, M.S.1    Rothberg, E.E.2    Wolf, M.E.3
  • 21
  • 24
    • 0003934689 scopus 로고
    • Automatic partitioning of parallel loops and data arrays for distributed shared memory multiprocessors
    • Anant Agarwal, David Kranz, and Venkat Natarajan, Automatic partitioning of parallel loops and data arrays for distributed shared memory multiprocessors, Int'l. Conf. Parallel Computing (1993).
    • (1993) Int'l. Conf. Parallel Computing
    • Agarwal, A.1    Kranz, D.2    Natarajan, V.3
  • 25
    • 84949655044 scopus 로고
    • A general framework for iteration-reordering loop transformations, Technical Summary
    • Vivek Sarkar and Radhika Thekkath, A general framework for iteration-reordering loop transformations, Technical Summary, Progr. Lang. Design and Implementation (1992).
    • (1992) Progr. Lang. Design and Implementation
    • Sarkar, V.1    Thekkath, R.2
  • 26
    • 0029749714 scopus 로고    scopus 로고
    • Combining optimization for cache and instruction-level parallelism
    • Steve Carr, Combining optimization for cache and instruction-level parallelism, PACT '96, pp. 238-247 (1996).
    • (1996) PACT '96 , pp. 238-247
    • Carr, S.1
  • 27
    • 0001465739 scopus 로고
    • Maximizing loop parallelism and improving data locality via loop fusion and distribution
    • Ken Kennedy and Kathryn S. McKinley, Maximizing loop parallelism and improving data locality via loop fusion and distribution, Lang. Compilers for Parallel Computing (1993).
    • (1993) Lang. Compilers for Parallel Computing
    • Kennedy, K.1    McKinley, K.S.2
  • 28
    • 0030661485 scopus 로고    scopus 로고
    • Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
    • Jeff Bilmes, Krste Asanović, Chee-Whye Chin, and Jim Demmel, Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology, Intl. Conf. Supercomputing (1997).
    • (1997) Intl. Conf. Supercomputing
    • Bilmes, J.1    Asanović, K.2    Chin, C.-W.3    Demmel, J.4
  • 29
    • 0002741087 scopus 로고    scopus 로고
    • Hierarchical tiling: A methodology for high performance
    • UCSD, Department of Computer Science and Engineering November
    • Larry Carter, Jeanne Ferrante, Susan Flynn Hummel, Bowen Alpern, and Kang Su Gatlin, Hierarchical tiling: A methodology for high performance, Technical Report CS96-508, UCSD, Department of Computer Science and Engineering (November 1996).
    • (1996) Technical Report CS96-508
    • Carter, L.1    Ferrante, J.2    Hummel, S.F.3    Alpern, B.4    Gatlin, K.S.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.