SCOPUS 정보 검색 플랫폼

International Journal of Parallel Programming

Volumn 26, Issue 6, 1998, Pages 641-670

Quantifying the Multi-Level Nature of Tiling Interactions

(4) Mitchell, Nicholas a Högstedt, Karin b Carter, Larry b Ferrante, Jeanne b

a UNIVERSITY OF CALIFORNIA (United States)

b NONE

Author keywords

Compiler; Locality; Memory hierarchy; Parallelism; Tiling

Indexed keywords

COMPUTER ARCHITECTURE; COSTS; DATA STORAGE EQUIPMENT; HIERARCHICAL SYSTEMS; OPTIMIZATION; STORAGE ALLOCATION (COMPUTER);

MEMORY HIERARCHY; MULTILEVEL COST FUNCTIONS; PARALLELISM; TILING;

PARALLEL PROCESSING SYSTEMS;

EID: 0032308685 PISSN: 08857458 EISSN: None Source Type: Journal
DOI: 10.1023/A:1018782528453 Document Type: Article

Times cited : (44)

References (32)

1
- 85013942562
- A data locality optimizing algorithm
- Michael E. Wolf and Monica S. Lam, A data locality optimizing algorithm, Progr. Lang. Design Implementation (1991).
- (1991) Progr. Lang. Design Implementation
- Wolf, M.E.¹ Lam, M.S.²

2
- 84964748976
- Compiler blockability of numerical algorithms
- November
- Steve Carr and Ken Kennedy, Compiler blockability of numerical algorithms, J. Supercomputing, pp. 114-124 (November 1992).
- (1992) J. Supercomputing , pp. 114-124
- Carr, S.¹ Kennedy, K.²

3
- 85009364061
- Compiler optimizations for improving data locality
- San Jose, California, Oct.
- Steve Carr, Kathryn S. McKinley, and Chau-Wen Tseng, Compiler optimizations for improving data locality, Sixth Intl. Conf. Archit. Support Progr. Lang. Oper. Syst., San Jose, California, Oct. 1994.
- (1994) Sixth Intl. Conf. Archit. Support Progr. Lang. Oper. Syst.
- Carr, S.¹ McKinley, K.S.² Tseng, C.-W.³

4
- 0028549474
- Improving the ratio of memory operations to floatingpoint operations in loops
- November
- Steve Carr and Ken Kennedy, Improving the ratio of memory operations to floatingpoint operations in loops, Trans. Progr. Lang. Syst. 16(6):1768-1810 (November 1994).
- (1994) Trans. Progr. Lang. Syst. , vol.16 , Issue.6 , pp. 1768-1810
- Carr, S.¹ Kennedy, K.²

5
- 84976766536
- Scanning polyhedra with DO loops
- April
- Corinne Ancourt and François Irigoin, Scanning polyhedra with DO loops, Principles and Practice of Parallel Progr., pp. 39-50 (April 1991).
- (1991) Principles and Practice of Parallel Progr. , pp. 39-50
- Ancourt, C.¹ Irigoin, F.²

6
- 0026232450
- A loop transformation theory and an algorithm to maximize parallelism
- Michael E. Wolf and Monica S. Lam, A loop transformation theory and an algorithm to maximize parallelism, IEEE Trans. Parallel Distrib. Syst. 2(4):452-471 (1991).
- (1991) IEEE Trans. Parallel Distrib. Syst. , vol.2 , Issue.4 , pp. 452-471
- Wolf, M.E.¹ Lam, M.S.²

7
- 0026933251
- Some efficient solutions to the affine scheduling problem, Part I, one-dimensional time
- October
- Paul Feautrier, Some efficient solutions to the affine scheduling problem, Part I, one-dimensional time, IJPP 21(5):xx-xx (October 1992).
- (1992) IJPP , vol.21 , Issue.5
- Feautrier, P.¹

8
- 0029218667
- A unifying framework for iteration reordering transformations
- April
- Wayne Kelly and William Pugh, A unifying framework for iteration reordering transformations, IEEE First Int'l. Conf. Algorithms and Architectures for Parallel Processing (April 1995).
- (1995) IEEE First Int'l. Conf. Algorithms and Architectures for Parallel Processing
- Kelly, W.¹ Pugh, W.²

9
- 2342480327
- Unrolling-based optimizations for modulo scheduling
- December
- Daniel Lavery and Wen-mei Hwu, Unrolling-based optimizations for modulo scheduling, 28th Int'l. Symp. Microarchit., pp. 126-141 (December 1995).
- (1995) 28th Int'l. Symp. Microarchit. , pp. 126-141
- Lavery, D.¹ Hwu, W.-M.²

10
- 85009352487
- Tile size selection using cache organization and data layout
- June
- Stephanie Coleman and Kathryn S. McKinley, Tile size selection using cache organization and data layout, Progr. Lang. Design and Implementation (June 1995).
- (1995) Progr. Lang. Design and Implementation
- Coleman, S.¹ McKinley, K.S.²

11
- 84958797356
- Locality analysis for distributed shared-memory multiprocessors
- Vivek Sarkar, Guang R. Gao, and Shaohua Han, Locality analysis for distributed shared-memory multiprocessors, Lang. Compilers for Parallel Computing (1996).
- (1996) Lang. Compilers for Parallel Computing
- Sarkar, V.¹ Gao, G.R.² Han, S.³

12
- 84941604581
- Chap. 12, McGraw Hill Co.
- Dennis Gannon and Ko-Yang Wang, Applying AI Techniques to Program Optimization for Parallel Computers, Chap. 12, McGraw Hill Co. (1989).
- (1989) Applying AI Techniques to Program Optimization for Parallel Computers
- Gannon, D.¹ Wang, K.-Y.²

13
- 0030379246
- Combining loop transformations considering caches and scheduling
- December
- Michael E. Wolf, Dror Maydan, and Ding-Kai Chen, Combining loop transformations considering caches and scheduling, 29th Int'l. Symp. Microarchit. (December 1996).
- (1996) 29th Int'l. Symp. Microarchit.
- Wolf, M.E.¹ Maydan, D.² Chen, D.-K.³

14
- 0002433589
- Iteration space tiling for memory hierarchies
- Michael J. Wolfe, Iteration space tiling for memory hierarchies, Parallel Processing for Sci. Comput., pp. 357-361 (1987).
- (1987) Parallel Processing for Sci. Comput. , pp. 357-361
- Wolfe, M.J.¹

15
- 0002238004
- Tiling multidimensional iteration spaces for nonshared memory machines
- November
- J. Ramanujam and P. Sadayappan, Tiling multidimensional iteration spaces for nonshared memory machines, Supercomputing (November 1991).
- (1991) Supercomputing
- Ramanujam, J.¹ Sadayappan, P.²

16
- 0022874874
- Advanced compiler optimizations for supercomputers
- December
- David A. Padua and Michael J. Wolfe, Advanced compiler optimizations for supercomputers, Commun. ACM 29(12):1184-1201 (December 1986).
- (1986) Commun. ACM , vol.29 , Issue.12 , pp. 1184-1201
- Padua, D.A.¹ Wolfe, M.J.²

17
- 0001366267
- Strategies for cache and local memory management by global program transformation
- October
- Dennis Gannon, William Jalby, and Kyle Gallivan, Strategies for cache and local memory management by global program transformation, J. Parallel and Distrib. Comput., Vol. 5, No. 5 (October 1988).
- (1988) J. Parallel and Distrib. Comput. , vol.5 , Issue.5
- Gannon, D.¹ Jalby, W.² Gallivan, K.³

18
- 85026986651
- January
- François Irigoin and Rémi Triolet, Supernode partitioning Principles of Progr. Lang., pp. 319-328 (January 1988).
- (1988) Principles of Progr. Lang. , pp. 319-328
- Irigoin, F.¹ Rémi Triolet, S.P.²

19
- 0024935630
- More iteration space tiling
- Michael J. Wolfe, More iteration space tiling, Supercomputing, pp. 655-664 (1989).
- (1989) Supercomputing , pp. 655-664
- Wolfe, M.J.¹

20
- 0026137116
- The cache performance and optimizations of blocked algorithms
- Palo Alto, California April
- Monica S. Lam, Edward E. Rothberg, and Michael E. Wolf, The cache performance and optimizations of blocked algorithms, ASPLOS-IV , Palo Alto, California (April 1991).
- (1991) ASPLOS-IV
- Lam, M.S.¹ Rothberg, E.E.² Wolf, M.E.³

21
- 0003207812
- Unimodular transformations of double loops
- Irvine, California August
- Utpal Banerjee, Unimodular transformations of double loops, in Progr. Lang. Compilers for Parallel Computing, Irvine, California (August 1990).
- (1990) Progr. Lang. Compilers for Parallel Computing
- Banerjee, U.¹

22
- 85027602455
- Optimizing for parallelism and data locality
- July
- Ken Kennedy and Kathryn S. McKinley, Optimizing for parallelism and data locality, Intl. Conf. Supercomputing (July 1992).
- (1992) Intl. Conf. Supercomputing
- Kennedy, K.¹ McKinley, K.S.²

23
- 0002678692
- On estimating and enhancing cache effectiveness
- Jeanne Ferrante, Vivek Sarkar, and Wedy Thrash, On estimating and enhancing cache effectiveness, Lang. Compilers for Parallel Computing (1991).
- (1991) Lang. Compilers for Parallel Computing
- Ferrante, J.¹ Sarkar, V.² Thrash, W.³

24
- 0003934689
- Automatic partitioning of parallel loops and data arrays for distributed shared memory multiprocessors
- Anant Agarwal, David Kranz, and Venkat Natarajan, Automatic partitioning of parallel loops and data arrays for distributed shared memory multiprocessors, Int'l. Conf. Parallel Computing (1993).
- (1993) Int'l. Conf. Parallel Computing
- Agarwal, A.¹ Kranz, D.² Natarajan, V.³

25
- 84949655044
- A general framework for iteration-reordering loop transformations, Technical Summary
- Vivek Sarkar and Radhika Thekkath, A general framework for iteration-reordering loop transformations, Technical Summary, Progr. Lang. Design and Implementation (1992).
- (1992) Progr. Lang. Design and Implementation
- Sarkar, V.¹ Thekkath, R.²

26
- 0029749714
- Combining optimization for cache and instruction-level parallelism
- Steve Carr, Combining optimization for cache and instruction-level parallelism, PACT '96, pp. 238-247 (1996).
- (1996) PACT '96 , pp. 238-247
- Carr, S.¹

27
- 0001465739
- Maximizing loop parallelism and improving data locality via loop fusion and distribution
- Ken Kennedy and Kathryn S. McKinley, Maximizing loop parallelism and improving data locality via loop fusion and distribution, Lang. Compilers for Parallel Computing (1993).
- (1993) Lang. Compilers for Parallel Computing
- Kennedy, K.¹ McKinley, K.S.²

28
- 0030661485
- Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
- Jeff Bilmes, Krste Asanović, Chee-Whye Chin, and Jim Demmel, Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology, Intl. Conf. Supercomputing (1997).
- (1997) Intl. Conf. Supercomputing
- Bilmes, J.¹ Asanović, K.² Chin, C.-W.³ Demmel, J.⁴

29
- 0002741087
- Hierarchical tiling: A methodology for high performance
- UCSD, Department of Computer Science and Engineering November
- Larry Carter, Jeanne Ferrante, Susan Flynn Hummel, Bowen Alpern, and Kang Su Gatlin, Hierarchical tiling: A methodology for high performance, Technical Report CS96-508, UCSD, Department of Computer Science and Engineering (November 1996).
- (1996) Technical Report CS96-508
- Carter, L.¹ Ferrante, J.² Hummel, S.F.³ Alpern, B.⁴ Gatlin, K.S.⁵

30
- 0003496786
- Doug Burger and Todd Austin, The SimpleScalar architectural research tool set, Version 2.0, http://www.cs.wisc.edu/m̃scalar/simplescalar.html
- The SimpleScalar Architectural Research Tool Set, Version 2.0
- Burger, D.¹ Austin, T.²

31
- 0030651937
- Determining the idle time of a tiling
- Karin Högstedt, Larry Carter, and Jeanne Ferrante, Determining the idle time of a tiling, Principles of Progr. Lang. (1997).
- (1997) Principles of Progr. Lang.
- Högstedt, K.¹ Carter, L.² Ferrante, J.³

32
- 0029235623
- Hierarchical tiling for improved superscalar performance
- April
- Larry Carter, Jeanne Ferrante, and S. Flynn Hummel, Hierarchical tiling for improved superscalar performance, Int'l. Parallel Processing Symp. (April 1995).
- (1995) Int'l. Parallel Processing Symp.
- Carter, L.¹ Ferrante, J.² Flynn Hummel, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.