SCOPUS 정보 검색 플랫폼

Journal of Parallel and Distributed Computing

Volumn 58, Issue 2, 1999, Pages 190-235

A Matrix-Based Approach to Global Locality Optimization

(4) Kandemir, Mahmut a Choudhary, Alok b Ramanujam, J c Banerjee, Prith b

a Syracuse University (United States)

b Northwestern University (United States)

c LOUISIANA STATE UNIVERSITY (United States)

Author keywords

Array restructuring; Data reuse; Data transformations; Locality; Loop tranformations; Memory hierarchy; Parallelism

Indexed keywords

EID: 0008323676 PISSN: 07437315 EISSN: None Source Type: Journal
DOI: 10.1006/jpdc.1999.1552 Document Type: Article

Times cited : (13)

References (59)

1
- 0019567795
- On the performance enhancement of paging systems through program analysis and transformations
- A. Abu-Sufah, D. Kuck, and D. Lawrie, On the performance enhancement of paging systems through program analysis and transformations, IEEE Trans. Comput. C-30 (5) (1981), 341-356.
- (1981) IEEE Trans. Comput. , vol.C-30 , Issue.5 , pp. 341-356
- Abu-Sufah, A.¹ Kuck, D.² Lawrie, D.³

2
- 0004072686
- Addison-Wesley, Reading, MA
- A. V. Aho, R. Sethi, and J. Ullman, "Compilers: Principles, Techniques, and Tools," 2nd ed., Addison-Wesley, Reading, MA, 1986.
- (1986) "Compilers: Principles, Techniques, and Tools," 2nd Ed.
- Aho, A.V.¹ Sethi, R.² Ullman, J.³

3
- 0004096151
- Ph. D. dissertation, Stanford University, March 1997. [Technical ReportCSL-TR-97-179, Computer Systems Laboratory, Stanford University]
- J. Anderson, "Automatic Computation and Data Decomposition for Multiprocessors," Ph. D. dissertation, Stanford University, March 1997. [Technical ReportCSL-TR-97-179, Computer Systems Laboratory, Stanford University]
- Automatic Computation and Data Decomposition for Multiprocessors
- Anderson, J.¹

4
- 0029181140
- Data and computation transformations for multiprocessors
- J. Anderson, S. Amarasinghe, and M. Lam, Data and computation transformations for multiprocessors, in "Proc. 5th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP '95), July 1995."
- Proc. 5th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP '95), July 1995
- Anderson, J.¹ Amarasinghe, S.² Lam, M.³

5
- 0027870804
- Global optimizations for parallelism and locality on scalable parallel machines
- J. Anderson and M. Lam, Global optimizations for parallelism and locality on scalable parallel machines, in "Proc. SIGPLAN Conference on Programming Language Design and Implementation (PLDI '93), pages 112-125, June 1993."
- Proc. SIGPLAN Conference on Programming Language Design and Implementation (PLDI '93), Pages 112-125, June 1993
- Anderson, J.¹ Lam, M.²

6
- 79952145180
- A compiler framework for restructuring data declarations to enhance cache and TLB effectiveness
- D. Bacon, J-H. Chow, D. Ching, R. Ju, K. Muthukumar, and V. Sarkar, a compiler framework for restructuring data declarations to enhance cache and TLB effectiveness, in "Proc. GASCON '94 Conference, Toronto, Canada, November 1994."
- Proc. GASCON '94 Conference, Toronto, Canada, November 1994
- Bacon, D.¹ Chow, J.-H.² Ching, D.³ Ju, R.⁴ Muthukumar, K.⁵ Sarkar, V.⁶

7
- 0003207812
- Unimodular transformations of double loops
- (A. Nicolau et al., Eds.), MIT Press, Cambridge, MA
- U. Banerjee, Unimodular transformations of double loops, in "Proc. Advances in Languages and Compilers for Parallel Processing" (A. Nicolau et al., Eds.), MIT Press, Cambridge, MA, 1991.
- (1991) Proc. Advances in Languages and Compilers for Parallel Processing
- Banerjee, U.¹

8
- 0003862557
- Technical Report 94-14, Dept. of Computer Science, Leiden University
- A. Bik and H. Wijshoff, "On a Completion Method for Unimodular Matrices," Technical Report 94-14, Dept. of Computer Science, Leiden University, 1994.
- (1994) On a Completion Method for Unimodular Matrices
- Bik, A.¹ Wijshoff, H.²

9
- 0030382364
- Advanced program restructuring for high-performance computers with Polaris
- December
- W. Blume, R. Doallo, R. Eigenmann, J. Grout, J. Hoeflinger, T. Lawrence, J. Lee, D. Padua, Y. Paek, B. Pottenger, L. Rauchwerger, and P. Tu, Advanced program restructuring for high-performance computers with Polaris, IEEE Comput. (December 1996), 78-82.
- (1996) IEEE Comput. , pp. 78-82
- Blume, W.¹ Doallo, R.² Eigenmann, R.³ Grout, J.⁴ Hoeflinger, J.⁵ Lawrence, T.⁶ Lee, J.⁷ Padua, D.⁸ Paek, Y.⁹ Pottenger, B.¹⁰ Rauchwerger, L.¹¹ Tu, P.¹²

10
- 0005055365
- An overview of symbolic analysis techniques needed for the effective parallelization of the PERFECT benchmarks
- W. Blume and R. Eigenmann, An overview of symbolic analysis techniques needed for the effective parallelization of the PERFECT benchmarks, in "Proc. 1994 International Conference on Parallel Processing (ICPP '94), pages II.233-II.238, August, 1994."
- Proc. 1994 International Conference on Parallel Processing (ICPP '94), Pages II.233-II.238, August, 1994
- Blume, W.¹ Eigenmann, R.²

11
- 0025447908
- Improving register allocation for subscripted variables
- Assoc. Comput. Mach., New York
- D. Callahan, S. Carr, and K. Kennedy, Improving register allocation for subscripted variables, in "Proc. SIGPLAN Conference on Programming Language Design and Implementation (PLDI '90)," Assoc. Comput. Mach., New York, 1990.
- (1990) Proc. SIGPLAN Conference on Programming Language Design and Implementation (PLDI '90)
- Callahan, D.¹ Carr, S.² Kennedy, K.³

12
- 0028549474
- Improving the ratio of memory operations to floating-point operations in loops
- November
- S. Carr and K. Kennedy, Improving the ratio of memory operations to floating-point operations in loops, ACM Trans. Program. Lang. Syst. 16 (6) (November 1994), 1769-1810.
- (1994) ACM Trans. Program. Lang. Syst. , vol.16 , Issue.6 , pp. 1769-1810
- Carr, S.¹ Kennedy, K.²

13
- 0030651789
- Data-distribution support on distributed-shared memory multiprocessors
- R. Chandra, D. Chen, R. Cox, D. Maydan, N. Nedeljkovic, and J. M. Anderson, Data-distribution support on distributed-shared memory multiprocessors, in "Proc. Programming Language Design and Implementation (PLDI '97), Las Vegas, NV, 1997."
- Proc. Programming Language Design and Implementation (PLDI '97), Las Vegas, NV, 1997
- Chandra, R.¹ Chen, D.² Cox, R.³ Maydan, D.⁴ Nedeljkovic, N.⁵ Anderson, J.M.⁶

14
- 0029238937
- Optimal Evaluation of Array Expressions on Massively parallel Machines
- January
- S. Chatterjee, J. Gilbert, R. Schreiber, and S. Teng, Optimal Evaluation of Array Expressions on Massively parallel Machines, ACM Trans. Programming Lang. Systems 17 (1) (January 1995), 123-156.
- (1995) ACM Trans. Programming Lang. Systems , vol.17 , Issue.1 , pp. 123-156
- Chatterjee, S.¹ Gilbert, J.² Schreiber, R.³ Teng, S.⁴

15
- 84976859799
- Unifying data and control transformations for distributed shared memory machines
- M. Cierniak and W. Li, Unifying data and control transformations for distributed shared memory machines, in "Proc. SIGPLAN '95 Conference on Programming language Design and Implementation (PLDI '95), June 1995."
- Proc. SIGPLAN '95 Conference on Programming Language Design and Implementation (PLDI '95), June 1995
- Cierniak, M.¹ Li, W.²

16
- 33646152155
- Technical Report 591. Dept. of Computer Science, University of Rochester, July
- M. Cierniak and W. Li, Recovering logical data and code structures, Technical Report 591. Dept. of Computer Science, University of Rochester, July 1995.
- (1995) Recovering Logical Data and Code Structures
- Cierniak, M.¹ Li, W.²

17
- 84976745804
- Tile size selection using cache organization and data layout
- S. Coleman and K. McKinley, Tile size selection using cache organization and data layout, in "Proc. SIGPLAN '95 Conference on Programming Language Design and Implementation (PLDI '95), June 1995."
- Proc. SIGPLAN '95 Conference on Programming Language Design and Implementation (PLDI '95), June 1995
- Coleman, S.¹ McKinley, K.²

18
- 0030295490
- Compiling affine nested loops: How to optimize the residual communications after the alignment phase
- M. Dion, C. Randriamaro, and Y. Robert, Compiling affine nested loops: how to optimize the residual communications after the alignment phase, Journal of Parallel and Distributed Computing (JPDC) 38 (2) (1996), 176-187.
- (1996) Journal of Parallel and Distributed Computing (JPDC) , vol.38 , Issue.2 , pp. 176-187
- Dion, M.¹ Randriamaro, C.² Robert, Y.³

19
- 85015240805
- On estimating and enhancing cache effectiveness
- J. Ferrante, V. Sarkar, and W. Thrash, On estimating and enhancing cache effectiveness, in "Proc. Languages and Compilers for Parallel Computing(LCPC '91), pages 328-343, 1991."
- Proc. Languages and Compilers for Parallel Computing(LCPC '91), Pages 328-343, 1991
- Ferrante, J.¹ Sarkar, V.² Thrash, W.³

20
- 0001366267
- Strategies for cache and local memory management by global program transformations
- D. Gannon, W. Jalby, and K. Gallivan, Strategies for cache and local memory management by global program transformations, Journal of Parallel and Distributed Computing 5 (1988), 587-616.
- (1988) Journal of Parallel and Distributed Computing , vol.5 , pp. 587-616
- Gannon, D.¹ Jalby, W.² Gallivan, K.³

21
- 0029430244
- A novel approach towards automatic data distribution
- J. Garcia, E. Ayguade, and J. Labarta, A novel approach towards automatic data distribution, in "Proc. Supercomputing '95, San Diego, December 1995."
- Proc. Supercomputing '95, San Diego, December 1995
- Garcia, J.¹ Ayguade, E.² Labarta, J.³

22
- 85030069977
- Symbolic analysis: A basis for parallelization, optimization, and scheduling of programs
- M. Haghighat and C. Polychronopoulos, Symbolic analysis: A basis for parallelization, optimization, and scheduling of programs, in "Proc. 6th Annual Workshop on Languages and Compilers for Parallel Computing (LCPC '93), Portland, OR, 1993."
- Proc. 6th Annual Workshop on Languages and Compilers for Parallel Computing (LCPC '93), Portland, OR, 1993
- Haghighat, M.¹ Polychronopoulos, C.²

23
- 0004302191
- Morgan Kaufmann Publishers, San Mateo, CA
- J. Hennessy and D. Patterson, "Computer Architecture: A Quantitiative Approach," 2nd ed., Morgan Kaufmann Publishers, San Mateo, CA, 1995.
- (1995) "Computer Architecture: A Quantitiative Approach," 2nd Ed.
- Hennessy, J.¹ Patterson, D.²

24
- 38249000489
- Communication-free partitioning of nested loops
- C.-H. Huang and P. Sadayappan, Communication-free partitioning of nested loops, J. Parallel Distrib. Comput. 19 (1993), 90-102.
- (1993) J. Parallel Distrib. Comput. , vol.19 , pp. 90-102
- Huang, C.-H.¹ Sadayappan, P.²

25
- 0029192199
- Reducing false sharing on shared memory multiprocessors through compile time data transformations
- T. Jeremiassen and S. Eggers, Reducing false sharing on shared memory multiprocessors through compile time data transformations, in "Proc. 5th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP '95), July, 1995."
- Proc. 5th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP '95), July, 1995
- Jeremiassen, T.¹ Eggers, S.²

26
- 85030071012
- Deleted in proof.
- Deleted in proof.

27
- 0031645299
- A hyperplane based approach for optimizating spatial locality in loop nests
- M. Kandemir, A. Choudhary, N. Shenoy, P. Banerjee, and J. Ramanujam, A hyperplane based approach for optimizating spatial locality in loop nests, in "Proc. 1998 ACM International Conference on Supercomputing (ICS '98), July 1998."
- Proc. 1998 ACM International Conference on Supercomputing (ICS '98), July 1998
- Kandemir, M.¹ Choudhary, A.² Shenoy, N.³ Banerjee, P.⁴ Ramanujam, J.⁵

28
- 0037722074
- A matrix-based approach to the global locality optimization problem
- M. Kandemir, A. Choudhary, J. Ramanujam, and P. Banerjee, A matrix-based approach to the global locality optimization problem, in "Proc. international Conference on parallel Architectures and Compilation Techniques (PACT '98), October 14-17, 1998, Paris, France."
- Proc. International Conference on Parallel Architectures and Compilation Techniques (PACT '98), October 14-17, 1998, Paris, France
- Kandemir, M.¹ Choudhary, A.² Ramanujam, J.³ Banerjee, P.⁴

29
- 85030061212
- An iteration space transformation algorithm based on explicit data layout representation for optimizing locality
- M. Kandemir, J. Ramanujam, A. Choudhary, and P. Banerjee, An iteration space transformation algorithm based on explicit data layout representation for optimizing locality, in "Proc. Languages & Compilers for Parallel Computing (LCPC '98), Chapel Hill, NC, August 1998."
- Proc. Languages & Compilers for Parallel Computing (LCPC '98), Chapel Hill, NC, August 1998
- Kandemir, M.¹ Ramanujam, J.² Choudhary, A.³ Banerjee, P.⁴

30
- 0030662867
- A compiler algorithm for optimizing locality in loop nests
- M. Kandemir, J. Ramanujam, and A. Choudhary, A compiler algorithm for optimizing locality in loop nests, in "Proc. 11th ACM International Conference on Supercomputing (ICS '97), pages 269-276. Vienna, Austria, July 1997."
- Proc. 11th ACM International Conference on Supercomputing (ICS '97), Pages 269-276. Vienna, Austria, July 1997
- Kandemir, M.¹ Ramanujam, J.² Choudhary, A.³

31
- 0031334865
- Compiler algorithms for optimizing locality and parallelism on shared and distributed memory machines
- M. Kandemir, J. Ramanujam, and A. Choudhary, Compiler algorithms for optimizing locality and parallelism on shared and distributed memory machines, in "Proc. 1997 Int. Conf. Parallel Architectures and Compilation Techniques (PACT '97), pages 236-247, San Francisco, CA, November 1997."
- Proc. 1997 Int. Conf. Parallel Architectures and Compilation Techniques (PACT '97), Pages 236-247, San Francisco, CA, November 1997
- Kandemir, M.¹ Ramanujam, J.² Choudhary, A.³

32
- 0003904906
- Technical Report CS-TR-3445, CS Dept., University of Maryland, College Park, March
- W. Kelly, V. Maslov, W. Pugh, E. Rosser, T. Shpeisman, and David Wonnacott, "The Omega Library Interface Guide," Technical Report CS-TR-3445, CS Dept., University of Maryland, College Park, March 1995.
- (1995) The Omega Library Interface Guide
- Kelly, W.¹ Maslov, V.² Pugh, W.³ Rosser, E.⁴ Shpeisman, T.⁵ Wonnacott, D.⁶

33
- 85030078148
- Automatic data layout for High Performance Fortan
- K. Kennedy and U. Kremer, Automatic data layout for High Performance Fortan, in "Proc. Supercomputing '95, San Diego, CA, December 1995."
- Proc. Supercomputing '95, San Diego, CA, December 1995
- Kennedy, K.¹ Kremer, U.²

34
- 0007890139
- Transformations of imperfectly nested loops
- I. Kodukula and K. Pingali, Transformations of imperfectly nested loops, in "Proc. Supercomputing, November 1996."
- Proc. Supercomputing, November 1996
- Kodukula, I.¹ Pingali, K.²

35
- 0030685988
- Data-centric multi-level blocking
- I. Kodukula, N. Ahmed, and K. Pingali, Data-centric multi-level blocking, in "Proc. Programming Language Design and Implementation (PLDI '97), June 1997."
- Proc. Programming Language Design and Implementation (PLDI '97), June 1997
- Kodukula, I.¹ Ahmed, N.² Pingali, K.³

36
- 0003493790
- Prentice-Hall, Englewood Cliffs, NJ
- B. Kolman, "Introductory Linear Algebra with Applications," Prentice-Hall, Englewood Cliffs, NJ, 1997.
- (1997) Introductory Linear Algebra with Applications
- Kolman, B.¹

37
- 0002447423
- The cache performance of blocked algorithms
- M. Lam, E. Rothberg, and M. Wolf, The cache performance of blocked algorithms, in "Proc. 4th Int. Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS '91), April 1991."
- Proc. 4th Int. Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS '91), April 1991
- Lam, M.¹ Rothberg, E.² Wolf, M.³

38
- 0003582055
- Technical Report TR 95-09-01, Dept. Computer Science and Engingeering, University of Washington, Sept.
- S-T. Leung and J. Zahorjan, "Optimizing Data Locality by Array Restructuring," Technical Report TR 95-09-01, Dept. Computer Science and Engingeering, University of Washington, Sept. 1995.
- (1995) Optimizing Data Locality by Array Restructuring
- Leung, S.-T.¹ Zahorjan, J.²

39
- 0003888396
- Ph.D. thesis, Cornell University, Ithaca, NY
- W. Li, "Compiling for NUMA Parallel Machines," Ph.D. thesis, Cornell University, Ithaca, NY, 1993.
- (1993) Compiling for NUMA Parallel Machines
- Li, W.¹

40
- 0026187669
- Compiling communication efficient programs for massively parallel machines
- J. Li and M. Chen, Compiling communication efficient programs for massively parallel machines, J. Parallel Distrib. Comput. 2 (3) (1991), 361-376.
- (1991) J. Parallel Distrib. Comput. , vol.2 , Issue.3 , pp. 361-376
- Li, J.¹ Chen, M.²

41
- 0026971052
- De-linearization: An effecient way to break multi-loop dependence equations
- V. Maslov, De-linearization: An effecient way to break multi-loop dependence equations, in "Proc. the SIGPLAN '92 Conference on Programming Language Design and Implementation (PLDI'92), San Francisco, CA, June 1992."
- Proc. the SIGPLAN '92 Conference on Programming Language Design and Implementation (PLDI'92), San Francisco, CA, June 1992
- Maslov, V.¹

42
- 84945709131
- The organization of matrices and matrix operations in a paged multiprogramming environment
- A. McKeller and E. Coffman, The organization of matrices and matrix operations in a paged multiprogramming environment, Comm. CACM 12 (3) (1969), 153-165.
- (1969) Comm. CACM , vol.12 , Issue.3 , pp. 153-165
- McKeller, A.¹ Coffman, E.²

43
- 0030190854
- Improving data locality with loop transformations
- K. McKinley, S. Carr, and C. W. Tseng, Improving data locality with loop transformations, ACM Trans. Programming Lang. Systems, 1996.
- (1996) ACM Trans. Programming Lang. Systems
- McKinley, K.¹ Carr, S.² Tseng, C.W.³

44
- 85030080063
- Non-singular data transoformations: Definition, validity, applications
- M. O'Boyle and P. Knijnenburg, Non-singular data transoformations: Definition, validity, applications, in "Proc. 6th Workshop on Compilers for Parallel Computers (CPC '96), pages 287-297, 1996."
- Proc. 6th Workshop on Compilers for Parallel Computers (CPC '96), Pages 287-297, 1996
- O'Boyle, M.¹ Knijnenburg, P.²

45
- 0036107271
- Integrating loop and data transformations for global optimisation
- M. O'Boyle and P. Knijnenburg, Integrating loop and data transformations for global optimisation, in "Proc. International Conference on Parallel Architectures and Compilation Techniques (PACT '98), October 14-17, 1998, Paris, France."
- Proc. International Conference on Parallel Architectures and Compilation Techniques (PACT '98), October 14-17, 1998, Paris, France
- O'Boyle, M.¹ Knijnenburg, P.²

46
- 0003690936
- Ph.D. thesis, Rice University, Houston, May
- A. Porterfield, "Software Methods for Improvement of Cache Performance on Supercomputer Applications," Ph.D. thesis, Rice University, Houston, May 1989.
- (1989) Software Methods for Improvement of Cache Performance on Supercomputer Applications
- Porterfield, A.¹

47
- 0026231056
- Compile-time techniques for data distribution in distributed memory machines
- Oct.
- J. Ramanujam and P. Sadayappan, Compile-time techniques for data distribution in distributed memory machines, IEEE Trans. Parallel Distrib. Systems 2 (4) (Oct. 1991), 472-482.
- (1991) IEEE Trans. Parallel Distrib. Systems , vol.2 , Issue.4 , pp. 472-482
- Ramanujam, J.¹ Sadayappan, P.²

48
- 0031622954
- Data transformations for eliminating conflict misses
- G. Rivera and C.-W. Tseng, Data transformations for eliminating conflict misses, in "Proc. the 1998 ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI '98), Montreal, Canada, June 1998."
- Proc. the 1998 ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI '98), Montreal, Canada, June 1998
- Rivera, G.¹ Tseng, C.-W.²

49
- 85030069920
- Locality analysis for distributed shared-Memory multiprocessors
- V. Sarkar, G. R. Gao, and S. Han, Locality analysis for distributed shared-Memory multiprocessors, in "Proc. 9th Workshop on Languages and Compilers for Parallel Computing (LCPC '96), Santa Clara, CA, August 1996."
- Proc. 9th Workshop on Languages and Compilers for Parallel Computing (LCPC '96), Santa Clara, CA, August 1996
- Sarkar, V.¹ Gao, G.R.² Han, S.³

50
- 0003690189
- Wiley, New York
- A. Schrijver, "Theory of Linear and Integer Programming," Wiley, New York, 1986.
- (1986) Theory of Linear and Integer Programming
- Schrijver, A.¹

51
- 85030063443
- Deleted in proof.
- Deleted in proof.

52
- 0030652844
- Automatic partitioning of data and computations on scalable shared memory multiprocessors
- S. Tandri and T. Abdelrahman, Automatic partitioning of data and computations on scalable shared memory multiprocessors, in "Proc. 1997 International Conference on Parallel Processing (ICPP '97), Bloomingdale, IL, pages 64-73, August 1997."
- Proc. 1997 International Conference on Parallel Processing (ICPP '97), Bloomingdale, IL, Pages 64-73, August 1997
- Tandri, S.¹ Abdelrahman, T.²

53
- 0029194311
- Unified compilation techniques for shared and distributed address space machines
- C.-W. Tseng, J. Anderson, S. Amarasinghe, and M. Lam, Unified compilation techniques for shared and distributed address space machines, in "Proc. ACM International Conference on Supercomputing (ICS '95), July 1995."
- Proc. ACM International Conference on Supercomputing (ICS '95), July 1995
- Tseng, C.-W.¹ Anderson, J.² Amarasinghe, S.³ Lam, M.⁴

54
- 0028446907
- False sharing and spatial locality in multiprocessor caches
- June
- J. Torrellas, M. S. Lam, and J. L. Hennessey, False sharing and spatial locality in multiprocessor caches, IEEE Trans. Comput. 43 (6) (June 1994), 651-663.
- (1994) IEEE Trans. Comput. , vol.43 , Issue.6 , pp. 651-663
- Torrellas, J.¹ Lam, M.S.² Hennessey, J.L.³

55
- 0029194338
- Evaluating the impact of advanced memory systems on compiler-parallelized codes
- E. Torrie, C-W. Tseng, M. Martonosi, and M. W. Hall, Evaluating the impact of advanced memory systems on compiler-parallelized codes, in "Proc. International Conference on Parallel Architectures and Compilation Techniques (PACT '95), June 1995."
- Proc. International Conference on Parallel Architectures and Compilation Techniques (PACT '95), June 1995
- Torrie, E.¹ Tseng, C.-W.² Martonosi, M.³ Hall, M.W.⁴

56
- 3142767727
- A data locality optimizing algorithm
- M. Wolf and M. Lam, A data locality optimizing algorithm, in "Proc. ACM SIGPLAN 91 Conf. Programming Language Design and Implementation (PLDI '91), pages 30-44, June 1991."
- Proc. ACM SIGPLAN 91 Conf. Programming Language Design and Implementation (PLDI '91), Pages 30-44, June 1991
- Wolf, M.¹ Lam, M.²

57
- 0030379246
- Combining loop transformations considering caches and scheduling
- M. Wolf, D. Maydan, and D.-K. Chen, Combining loop transformations considering caches and scheduling, in "Proc. MICRO '96, pages 274-286, 1996."
- Proc. MICRO '96, Pages 274-286, 1996
- Wolf, M.¹ Maydan, D.² Chen, D.-K.³

58
- 0024935630
- More iteration space tiling
- M. Wolfe, More iteration space tiling, in "Proc. Supercomputing '89, pages 655-664, November 1989."
- Proc. Supercomputing '89, Pages 655-664, November 1989
- Wolfe, M.¹

59
- 0003927035
- Addison-Wesley, Reading, MA
- M. Wolfe, "High Performance Compilers for Parallel Computing," Addison-Wesley, Reading, MA, 1996.
- (1996) High Performance Compilers for Parallel Computing
- Wolfe, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.