-
1
-
-
0019567795
-
On the performance enhancement of paging systems through program analysis and transformations
-
A. Abu-Sufah, D. Kuck, and D. Lawrie, On the performance enhancement of paging systems through program analysis and transformations, IEEE Trans. Comput. C-30 (5) (1981), 341-356.
-
(1981)
IEEE Trans. Comput.
, vol.C-30
, Issue.5
, pp. 341-356
-
-
Abu-Sufah, A.1
Kuck, D.2
Lawrie, D.3
-
2
-
-
0004072686
-
-
Addison-Wesley, Reading, MA
-
A. V. Aho, R. Sethi, and J. Ullman, "Compilers: Principles, Techniques, and Tools," 2nd ed., Addison-Wesley, Reading, MA, 1986.
-
(1986)
"Compilers: Principles, Techniques, and Tools," 2nd Ed.
-
-
Aho, A.V.1
Sethi, R.2
Ullman, J.3
-
3
-
-
0004096151
-
-
Ph. D. dissertation, Stanford University, March 1997. [Technical ReportCSL-TR-97-179, Computer Systems Laboratory, Stanford University]
-
J. Anderson, "Automatic Computation and Data Decomposition for Multiprocessors," Ph. D. dissertation, Stanford University, March 1997. [Technical ReportCSL-TR-97-179, Computer Systems Laboratory, Stanford University]
-
Automatic Computation and Data Decomposition for Multiprocessors
-
-
Anderson, J.1
-
6
-
-
79952145180
-
A compiler framework for restructuring data declarations to enhance cache and TLB effectiveness
-
D. Bacon, J-H. Chow, D. Ching, R. Ju, K. Muthukumar, and V. Sarkar, a compiler framework for restructuring data declarations to enhance cache and TLB effectiveness, in "Proc. GASCON '94 Conference, Toronto, Canada, November 1994."
-
Proc. GASCON '94 Conference, Toronto, Canada, November 1994
-
-
Bacon, D.1
Chow, J.-H.2
Ching, D.3
Ju, R.4
Muthukumar, K.5
Sarkar, V.6
-
7
-
-
0003207812
-
Unimodular transformations of double loops
-
(A. Nicolau et al., Eds.), MIT Press, Cambridge, MA
-
U. Banerjee, Unimodular transformations of double loops, in "Proc. Advances in Languages and Compilers for Parallel Processing" (A. Nicolau et al., Eds.), MIT Press, Cambridge, MA, 1991.
-
(1991)
Proc. Advances in Languages and Compilers for Parallel Processing
-
-
Banerjee, U.1
-
9
-
-
0030382364
-
Advanced program restructuring for high-performance computers with Polaris
-
December
-
W. Blume, R. Doallo, R. Eigenmann, J. Grout, J. Hoeflinger, T. Lawrence, J. Lee, D. Padua, Y. Paek, B. Pottenger, L. Rauchwerger, and P. Tu, Advanced program restructuring for high-performance computers with Polaris, IEEE Comput. (December 1996), 78-82.
-
(1996)
IEEE Comput.
, pp. 78-82
-
-
Blume, W.1
Doallo, R.2
Eigenmann, R.3
Grout, J.4
Hoeflinger, J.5
Lawrence, T.6
Lee, J.7
Padua, D.8
Paek, Y.9
Pottenger, B.10
Rauchwerger, L.11
Tu, P.12
-
10
-
-
0005055365
-
An overview of symbolic analysis techniques needed for the effective parallelization of the PERFECT benchmarks
-
W. Blume and R. Eigenmann, An overview of symbolic analysis techniques needed for the effective parallelization of the PERFECT benchmarks, in "Proc. 1994 International Conference on Parallel Processing (ICPP '94), pages II.233-II.238, August, 1994."
-
Proc. 1994 International Conference on Parallel Processing (ICPP '94), Pages II.233-II.238, August, 1994
-
-
Blume, W.1
Eigenmann, R.2
-
11
-
-
0025447908
-
Improving register allocation for subscripted variables
-
Assoc. Comput. Mach., New York
-
D. Callahan, S. Carr, and K. Kennedy, Improving register allocation for subscripted variables, in "Proc. SIGPLAN Conference on Programming Language Design and Implementation (PLDI '90)," Assoc. Comput. Mach., New York, 1990.
-
(1990)
Proc. SIGPLAN Conference on Programming Language Design and Implementation (PLDI '90)
-
-
Callahan, D.1
Carr, S.2
Kennedy, K.3
-
12
-
-
0028549474
-
Improving the ratio of memory operations to floating-point operations in loops
-
November
-
S. Carr and K. Kennedy, Improving the ratio of memory operations to floating-point operations in loops, ACM Trans. Program. Lang. Syst. 16 (6) (November 1994), 1769-1810.
-
(1994)
ACM Trans. Program. Lang. Syst.
, vol.16
, Issue.6
, pp. 1769-1810
-
-
Carr, S.1
Kennedy, K.2
-
13
-
-
0030651789
-
Data-distribution support on distributed-shared memory multiprocessors
-
R. Chandra, D. Chen, R. Cox, D. Maydan, N. Nedeljkovic, and J. M. Anderson, Data-distribution support on distributed-shared memory multiprocessors, in "Proc. Programming Language Design and Implementation (PLDI '97), Las Vegas, NV, 1997."
-
Proc. Programming Language Design and Implementation (PLDI '97), Las Vegas, NV, 1997
-
-
Chandra, R.1
Chen, D.2
Cox, R.3
Maydan, D.4
Nedeljkovic, N.5
Anderson, J.M.6
-
14
-
-
0029238937
-
Optimal Evaluation of Array Expressions on Massively parallel Machines
-
January
-
S. Chatterjee, J. Gilbert, R. Schreiber, and S. Teng, Optimal Evaluation of Array Expressions on Massively parallel Machines, ACM Trans. Programming Lang. Systems 17 (1) (January 1995), 123-156.
-
(1995)
ACM Trans. Programming Lang. Systems
, vol.17
, Issue.1
, pp. 123-156
-
-
Chatterjee, S.1
Gilbert, J.2
Schreiber, R.3
Teng, S.4
-
16
-
-
33646152155
-
-
Technical Report 591. Dept. of Computer Science, University of Rochester, July
-
M. Cierniak and W. Li, Recovering logical data and code structures, Technical Report 591. Dept. of Computer Science, University of Rochester, July 1995.
-
(1995)
Recovering Logical Data and Code Structures
-
-
Cierniak, M.1
Li, W.2
-
18
-
-
0030295490
-
Compiling affine nested loops: How to optimize the residual communications after the alignment phase
-
M. Dion, C. Randriamaro, and Y. Robert, Compiling affine nested loops: how to optimize the residual communications after the alignment phase, Journal of Parallel and Distributed Computing (JPDC) 38 (2) (1996), 176-187.
-
(1996)
Journal of Parallel and Distributed Computing (JPDC)
, vol.38
, Issue.2
, pp. 176-187
-
-
Dion, M.1
Randriamaro, C.2
Robert, Y.3
-
19
-
-
85015240805
-
On estimating and enhancing cache effectiveness
-
J. Ferrante, V. Sarkar, and W. Thrash, On estimating and enhancing cache effectiveness, in "Proc. Languages and Compilers for Parallel Computing(LCPC '91), pages 328-343, 1991."
-
Proc. Languages and Compilers for Parallel Computing(LCPC '91), Pages 328-343, 1991
-
-
Ferrante, J.1
Sarkar, V.2
Thrash, W.3
-
20
-
-
0001366267
-
Strategies for cache and local memory management by global program transformations
-
D. Gannon, W. Jalby, and K. Gallivan, Strategies for cache and local memory management by global program transformations, Journal of Parallel and Distributed Computing 5 (1988), 587-616.
-
(1988)
Journal of Parallel and Distributed Computing
, vol.5
, pp. 587-616
-
-
Gannon, D.1
Jalby, W.2
Gallivan, K.3
-
21
-
-
0029430244
-
A novel approach towards automatic data distribution
-
J. Garcia, E. Ayguade, and J. Labarta, A novel approach towards automatic data distribution, in "Proc. Supercomputing '95, San Diego, December 1995."
-
Proc. Supercomputing '95, San Diego, December 1995
-
-
Garcia, J.1
Ayguade, E.2
Labarta, J.3
-
22
-
-
85030069977
-
Symbolic analysis: A basis for parallelization, optimization, and scheduling of programs
-
M. Haghighat and C. Polychronopoulos, Symbolic analysis: A basis for parallelization, optimization, and scheduling of programs, in "Proc. 6th Annual Workshop on Languages and Compilers for Parallel Computing (LCPC '93), Portland, OR, 1993."
-
Proc. 6th Annual Workshop on Languages and Compilers for Parallel Computing (LCPC '93), Portland, OR, 1993
-
-
Haghighat, M.1
Polychronopoulos, C.2
-
23
-
-
0004302191
-
-
Morgan Kaufmann Publishers, San Mateo, CA
-
J. Hennessy and D. Patterson, "Computer Architecture: A Quantitiative Approach," 2nd ed., Morgan Kaufmann Publishers, San Mateo, CA, 1995.
-
(1995)
"Computer Architecture: A Quantitiative Approach," 2nd Ed.
-
-
Hennessy, J.1
Patterson, D.2
-
26
-
-
85030071012
-
-
Deleted in proof.
-
Deleted in proof.
-
-
-
-
27
-
-
0031645299
-
A hyperplane based approach for optimizating spatial locality in loop nests
-
M. Kandemir, A. Choudhary, N. Shenoy, P. Banerjee, and J. Ramanujam, A hyperplane based approach for optimizating spatial locality in loop nests, in "Proc. 1998 ACM International Conference on Supercomputing (ICS '98), July 1998."
-
Proc. 1998 ACM International Conference on Supercomputing (ICS '98), July 1998
-
-
Kandemir, M.1
Choudhary, A.2
Shenoy, N.3
Banerjee, P.4
Ramanujam, J.5
-
28
-
-
0037722074
-
A matrix-based approach to the global locality optimization problem
-
M. Kandemir, A. Choudhary, J. Ramanujam, and P. Banerjee, A matrix-based approach to the global locality optimization problem, in "Proc. international Conference on parallel Architectures and Compilation Techniques (PACT '98), October 14-17, 1998, Paris, France."
-
Proc. International Conference on Parallel Architectures and Compilation Techniques (PACT '98), October 14-17, 1998, Paris, France
-
-
Kandemir, M.1
Choudhary, A.2
Ramanujam, J.3
Banerjee, P.4
-
29
-
-
85030061212
-
An iteration space transformation algorithm based on explicit data layout representation for optimizing locality
-
M. Kandemir, J. Ramanujam, A. Choudhary, and P. Banerjee, An iteration space transformation algorithm based on explicit data layout representation for optimizing locality, in "Proc. Languages & Compilers for Parallel Computing (LCPC '98), Chapel Hill, NC, August 1998."
-
Proc. Languages & Compilers for Parallel Computing (LCPC '98), Chapel Hill, NC, August 1998
-
-
Kandemir, M.1
Ramanujam, J.2
Choudhary, A.3
Banerjee, P.4
-
30
-
-
0030662867
-
A compiler algorithm for optimizing locality in loop nests
-
M. Kandemir, J. Ramanujam, and A. Choudhary, A compiler algorithm for optimizing locality in loop nests, in "Proc. 11th ACM International Conference on Supercomputing (ICS '97), pages 269-276. Vienna, Austria, July 1997."
-
Proc. 11th ACM International Conference on Supercomputing (ICS '97), Pages 269-276. Vienna, Austria, July 1997
-
-
Kandemir, M.1
Ramanujam, J.2
Choudhary, A.3
-
31
-
-
0031334865
-
Compiler algorithms for optimizing locality and parallelism on shared and distributed memory machines
-
M. Kandemir, J. Ramanujam, and A. Choudhary, Compiler algorithms for optimizing locality and parallelism on shared and distributed memory machines, in "Proc. 1997 Int. Conf. Parallel Architectures and Compilation Techniques (PACT '97), pages 236-247, San Francisco, CA, November 1997."
-
Proc. 1997 Int. Conf. Parallel Architectures and Compilation Techniques (PACT '97), Pages 236-247, San Francisco, CA, November 1997
-
-
Kandemir, M.1
Ramanujam, J.2
Choudhary, A.3
-
32
-
-
0003904906
-
-
Technical Report CS-TR-3445, CS Dept., University of Maryland, College Park, March
-
W. Kelly, V. Maslov, W. Pugh, E. Rosser, T. Shpeisman, and David Wonnacott, "The Omega Library Interface Guide," Technical Report CS-TR-3445, CS Dept., University of Maryland, College Park, March 1995.
-
(1995)
The Omega Library Interface Guide
-
-
Kelly, W.1
Maslov, V.2
Pugh, W.3
Rosser, E.4
Shpeisman, T.5
Wonnacott, D.6
-
33
-
-
85030078148
-
Automatic data layout for High Performance Fortan
-
K. Kennedy and U. Kremer, Automatic data layout for High Performance Fortan, in "Proc. Supercomputing '95, San Diego, CA, December 1995."
-
Proc. Supercomputing '95, San Diego, CA, December 1995
-
-
Kennedy, K.1
Kremer, U.2
-
38
-
-
0003582055
-
-
Technical Report TR 95-09-01, Dept. Computer Science and Engingeering, University of Washington, Sept.
-
S-T. Leung and J. Zahorjan, "Optimizing Data Locality by Array Restructuring," Technical Report TR 95-09-01, Dept. Computer Science and Engingeering, University of Washington, Sept. 1995.
-
(1995)
Optimizing Data Locality by Array Restructuring
-
-
Leung, S.-T.1
Zahorjan, J.2
-
39
-
-
0003888396
-
-
Ph.D. thesis, Cornell University, Ithaca, NY
-
W. Li, "Compiling for NUMA Parallel Machines," Ph.D. thesis, Cornell University, Ithaca, NY, 1993.
-
(1993)
Compiling for NUMA Parallel Machines
-
-
Li, W.1
-
40
-
-
0026187669
-
Compiling communication efficient programs for massively parallel machines
-
J. Li and M. Chen, Compiling communication efficient programs for massively parallel machines, J. Parallel Distrib. Comput. 2 (3) (1991), 361-376.
-
(1991)
J. Parallel Distrib. Comput.
, vol.2
, Issue.3
, pp. 361-376
-
-
Li, J.1
Chen, M.2
-
42
-
-
84945709131
-
The organization of matrices and matrix operations in a paged multiprogramming environment
-
A. McKeller and E. Coffman, The organization of matrices and matrix operations in a paged multiprogramming environment, Comm. CACM 12 (3) (1969), 153-165.
-
(1969)
Comm. CACM
, vol.12
, Issue.3
, pp. 153-165
-
-
McKeller, A.1
Coffman, E.2
-
44
-
-
85030080063
-
Non-singular data transoformations: Definition, validity, applications
-
M. O'Boyle and P. Knijnenburg, Non-singular data transoformations: Definition, validity, applications, in "Proc. 6th Workshop on Compilers for Parallel Computers (CPC '96), pages 287-297, 1996."
-
Proc. 6th Workshop on Compilers for Parallel Computers (CPC '96), Pages 287-297, 1996
-
-
O'Boyle, M.1
Knijnenburg, P.2
-
45
-
-
0036107271
-
Integrating loop and data transformations for global optimisation
-
M. O'Boyle and P. Knijnenburg, Integrating loop and data transformations for global optimisation, in "Proc. International Conference on Parallel Architectures and Compilation Techniques (PACT '98), October 14-17, 1998, Paris, France."
-
Proc. International Conference on Parallel Architectures and Compilation Techniques (PACT '98), October 14-17, 1998, Paris, France
-
-
O'Boyle, M.1
Knijnenburg, P.2
-
47
-
-
0026231056
-
Compile-time techniques for data distribution in distributed memory machines
-
Oct.
-
J. Ramanujam and P. Sadayappan, Compile-time techniques for data distribution in distributed memory machines, IEEE Trans. Parallel Distrib. Systems 2 (4) (Oct. 1991), 472-482.
-
(1991)
IEEE Trans. Parallel Distrib. Systems
, vol.2
, Issue.4
, pp. 472-482
-
-
Ramanujam, J.1
Sadayappan, P.2
-
48
-
-
0031622954
-
Data transformations for eliminating conflict misses
-
G. Rivera and C.-W. Tseng, Data transformations for eliminating conflict misses, in "Proc. the 1998 ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI '98), Montreal, Canada, June 1998."
-
Proc. the 1998 ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI '98), Montreal, Canada, June 1998
-
-
Rivera, G.1
Tseng, C.-W.2
-
49
-
-
85030069920
-
Locality analysis for distributed shared-Memory multiprocessors
-
V. Sarkar, G. R. Gao, and S. Han, Locality analysis for distributed shared-Memory multiprocessors, in "Proc. 9th Workshop on Languages and Compilers for Parallel Computing (LCPC '96), Santa Clara, CA, August 1996."
-
Proc. 9th Workshop on Languages and Compilers for Parallel Computing (LCPC '96), Santa Clara, CA, August 1996
-
-
Sarkar, V.1
Gao, G.R.2
Han, S.3
-
51
-
-
85030063443
-
-
Deleted in proof.
-
Deleted in proof.
-
-
-
-
52
-
-
0030652844
-
Automatic partitioning of data and computations on scalable shared memory multiprocessors
-
S. Tandri and T. Abdelrahman, Automatic partitioning of data and computations on scalable shared memory multiprocessors, in "Proc. 1997 International Conference on Parallel Processing (ICPP '97), Bloomingdale, IL, pages 64-73, August 1997."
-
Proc. 1997 International Conference on Parallel Processing (ICPP '97), Bloomingdale, IL, Pages 64-73, August 1997
-
-
Tandri, S.1
Abdelrahman, T.2
-
53
-
-
0029194311
-
Unified compilation techniques for shared and distributed address space machines
-
C.-W. Tseng, J. Anderson, S. Amarasinghe, and M. Lam, Unified compilation techniques for shared and distributed address space machines, in "Proc. ACM International Conference on Supercomputing (ICS '95), July 1995."
-
Proc. ACM International Conference on Supercomputing (ICS '95), July 1995
-
-
Tseng, C.-W.1
Anderson, J.2
Amarasinghe, S.3
Lam, M.4
-
54
-
-
0028446907
-
False sharing and spatial locality in multiprocessor caches
-
June
-
J. Torrellas, M. S. Lam, and J. L. Hennessey, False sharing and spatial locality in multiprocessor caches, IEEE Trans. Comput. 43 (6) (June 1994), 651-663.
-
(1994)
IEEE Trans. Comput.
, vol.43
, Issue.6
, pp. 651-663
-
-
Torrellas, J.1
Lam, M.S.2
Hennessey, J.L.3
-
55
-
-
0029194338
-
Evaluating the impact of advanced memory systems on compiler-parallelized codes
-
E. Torrie, C-W. Tseng, M. Martonosi, and M. W. Hall, Evaluating the impact of advanced memory systems on compiler-parallelized codes, in "Proc. International Conference on Parallel Architectures and Compilation Techniques (PACT '95), June 1995."
-
Proc. International Conference on Parallel Architectures and Compilation Techniques (PACT '95), June 1995
-
-
Torrie, E.1
Tseng, C.-W.2
Martonosi, M.3
Hall, M.W.4
-
57
-
-
0030379246
-
Combining loop transformations considering caches and scheduling
-
M. Wolf, D. Maydan, and D.-K. Chen, Combining loop transformations considering caches and scheduling, in "Proc. MICRO '96, pages 274-286, 1996."
-
Proc. MICRO '96, Pages 274-286, 1996
-
-
Wolf, M.1
Maydan, D.2
Chen, D.-K.3
|