-
1
-
-
0033700781
-
Synthesizing transformations for locality enhancement of imperfectly-nested loop nests
-
N. Ahmed, N. Mateev, K. Pingali, Synthesizing transformations for locality enhancement of imperfectly-nested loop nests, in: ICS, 2000, pp. 141-152
-
(2000)
ICS
, pp. 141-152
-
-
Ahmed, N.1
Mateev, N.2
Pingali, K.3
-
2
-
-
0003003638
-
A study of replacement algorithms for a virtual-storage computer
-
L.A. Belady A study of replacement algorithms for a virtual-storage computer IBM Systems Journal 5 2 1966 78 101
-
(1966)
IBM Systems Journal
, vol.5
, Issue.2
, pp. 78-101
-
-
Belady, L.A.1
-
4
-
-
14944358600
-
Compiler generated multithreading to alleviate memory latency
-
Multithreaded Processors and Chip-Multiprocessors
-
K. Beyls, and E. D'Hollander Compiler generated multithreading to alleviate memory latency Multithreaded Processors and Chip-Multiprocessors Journal of Universal Computer Science 6 10 2000 968 993 (special issue)
-
(2000)
Journal of Universal Computer Science
, vol.6
, Issue.10
, pp. 968-993
-
-
Beyls, K.1
D'Hollander, E.2
-
5
-
-
4644271073
-
Reuse distance as a metric for cache behavior
-
August
-
K. Beyls, E.H. D'Hollander, Reuse distance as a metric for cache behavior, in: Proceedings of PDCS'01, August 2001, pp. 617-662
-
(2001)
Proceedings of PDCS'01
, pp. 617-662
-
-
Beyls, K.1
D'Hollander, E.H.2
-
6
-
-
3042548199
-
An analytic model of locality and caching
-
Michigan State University, August
-
M. Brehob, R.J. Enbody, An analytic model of locality and caching, Technical Report MSU-CSE-99-31, Michigan State University, August 1999
-
(1999)
Technical Report
, vol.MSU-CSE-99-31
-
-
Brehob, M.1
Enbody, R.J.2
-
9
-
-
0030086672
-
Design of the HP PA 7200 CPU
-
Technical information from the laboratories of Hewlett-Packard Company.
-
K.K. Chan, C.C. Hay, J.R. Keller, G.P. Kurpanek, F.X. Schumacher, and J. Zheng Design of the HP PA 7200 CPU Technical information from the laboratories of Hewlett-Packard Company Hewlett-Packard Journal 47 1 1996 25 33
-
(1996)
Hewlett-Packard Journal
, vol.47
, Issue.1
, pp. 25-33
-
-
Chan, K.K.1
Hay, C.C.2
Keller, J.R.3
Kurpanek, G.P.4
Schumacher, F.X.5
Zheng, J.6
-
10
-
-
0032652980
-
Nonlinear array layouts for hierarchical memory systems
-
June
-
S. Chatterjee, V. Jain, A.R. Lebeck, S. Mundhra, M.S. Thottethodi, Nonlinear array layouts for hierarchical memory systems, in: 13th ACM International Conference on Supercomputing (ICS'99), June 1999
-
(1999)
13th ACM International Conference on Supercomputing (ICS'99)
-
-
Chatterjee, S.1
Jain, V.2
Lebeck, A.R.3
Mundhra, S.4
Thottethodi, M.S.5
-
11
-
-
0034832018
-
Exact analysis of the cache behavior of nested loops
-
S. Chatterjee, E. Parker, P. Hanlon, A.R. Lebeck, Exact analysis of the cache behavior of nested loops, in: PLDI, 2001, pp. 286-297
-
(2001)
PLDI
, pp. 286-297
-
-
Chatterjee, S.1
Parker, E.2
Hanlon, P.3
Lebeck, A.R.4
-
12
-
-
0032667164
-
Cache-conscious structure layout
-
May
-
T.M. Chilimbi, M.D. Hill, J.R. Larus, Cache-conscious structure layout, in: Proceedings of the ACM SIGPLAN'99 Conference on Programming Language Design and Implementation, May 1999, pp. 1-12
-
(1999)
Proceedings of the ACM SIGPLAN'99 Conference on Programming Language Design and Implementation
, pp. 1-12
-
-
Chilimbi, T.M.1
Hill, M.D.2
Larus, J.R.3
-
13
-
-
0029717349
-
Counting solutions to linear and nonlinear constraints through Ehrhart polynomials: Applications to analyze and transform scientific programs
-
ACM, May
-
P. Clauss, Counting solutions to linear and nonlinear constraints through Ehrhart polynomials: applications to analyze and transform scientific programs, in: ACM International Conference on Supercomputing, ACM, May 1996, pp. 278-285
-
(1996)
ACM International Conference on Supercomputing
, pp. 278-285
-
-
Clauss, P.1
-
15
-
-
0032066945
-
The Fortran parallel transformer and its programming environment
-
E. D'Hollander, F. Zhang, and Q. Wang The Fortran parallel transformer and its programming environment Journal of Information Science 106 7 1998 293 317
-
(1998)
Journal of Information Science
, vol.106
, Issue.7
, pp. 293-317
-
-
D'Hollander, E.1
Zhang, F.2
Wang, Q.3
-
16
-
-
0038716440
-
Predicting whole-program locality through reuse distance analysis
-
ACM
-
C. Ding, Y. Zhong, Predicting whole-program locality through reuse distance analysis, in: PLDI'03, ACM, 2003
-
(2003)
PLDI'03
-
-
Ding, C.1
Zhong, Y.2
-
17
-
-
33745797169
-
Reuse-distance-based miss-rate prediction on a per instruction basis
-
C. Fang, S. Carr, S. Onder, Z. Wang, Reuse-distance-based miss-rate prediction on a per instruction basis, in: Workshop on Memory System Performance (MSP), 2004
-
(2004)
Workshop on Memory System Performance (MSP)
-
-
Fang, C.1
Carr, S.2
Onder, S.3
Wang, Z.4
-
18
-
-
84957027384
-
Automatic parallelization in the polytope model
-
P. Feautrier, Automatic parallelization in the polytope model, in: The Data Parallel Programming Model, 1996, pp. 79-103
-
(1996)
The Data Parallel Programming Model
, pp. 79-103
-
-
Feautrier, P.1
-
19
-
-
1342264156
-
A compiler tool to predict memory hierarchy performance of scientific codes
-
B.B. Fraguela, R. Doallo, J. Touriño, and E.L. Zapata A compiler tool to predict memory hierarchy performance of scientific codes Parallel Computing 30 2004 225 248
-
(2004)
Parallel Computing
, vol.30
, pp. 225-248
-
-
Fraguela, B.B.1
Doallo, R.2
Touriño, J.3
Zapata, E.L.4
-
21
-
-
0034474789
-
MIST: An algorithm for memory miss traffic management
-
P. Grun, N. Dutt, A. Nicolau, MIST: An algorithm for memory miss traffic management, in: International Conference on Computer Aided Design, 2000, pp. 431-437
-
(2000)
International Conference on Computer Aided Design
, pp. 431-437
-
-
Grun, P.1
Dutt, N.2
Nicolau, A.3
-
22
-
-
0024903997
-
Evaluating associativity in CPU caches
-
M.D. Hill, and A.J. Smith Evaluating associativity in CPU caches IEEE Transactions on Computers 38 12 1989 1612 1630
-
(1989)
IEEE Transactions on Computers
, vol.38
, Issue.12
, pp. 1612-1630
-
-
Hill, M.D.1
Smith, A.J.2
-
25
-
-
0035209061
-
Software-assisted replacement mechanisms for embedded systems
-
November
-
P. Jain, S. Devadas, D. Engels, L. Rudolph, Software-assisted replacement mechanisms for embedded systems, in: International Conference on Computer Aided Design, November 2001, pp. 119-126
-
(2001)
International Conference on Computer Aided Design
, pp. 119-126
-
-
Jain, P.1
Devadas, S.2
Engels, D.3
Rudolph, L.4
-
26
-
-
0025429331
-
Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
-
May
-
N.P. Jouppi, Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers, in: The 17th ISCA, May 1990, pp. 364-373
-
(1990)
The 17th ISCA
, pp. 364-373
-
-
Jouppi, N.P.1
-
27
-
-
0032680282
-
An integer linear programming approach for optimizing cache locality
-
ACM Press
-
M. Kandemir, P. Banerjee, A. Choudhary, J. Ramanujam, and E. Ayguade An integer linear programming approach for optimizing cache locality Proceedings of the 1999 Conference on Supercomputing, ACM SIGARCH, NY, June 1999 1999 ACM Press 500 509
-
(1999)
Proceedings of the 1999 Conference on Supercomputing, ACM SIGARCH, NY, June 1999
, pp. 500-509
-
-
Kandemir, M.1
Banerjee, P.2
Choudhary, A.3
Ramanujam, J.4
Ayguade, E.5
-
28
-
-
0037722074
-
A matrix-based approach to the global locality optimization problem
-
IEEE Computer Society Press
-
M. Kandemir, A. Choudhary, J. Ramanujam, and P. Banerjee A matrix-based approach to the global locality optimization problem Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques (PACT'98), Paris, France, October 12-18, 1998 1998 IEEE Computer Society Press 306 313
-
(1998)
Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques (PACT'98), Paris, France, October 12-18, 1998
, pp. 306-313
-
-
Kandemir, M.1
Choudhary, A.2
Ramanujam, J.3
Banerjee, P.4
-
30
-
-
0001465739
-
Maximizing loop parallelism and improving data locality via loop fusion and distribution
-
Springer Verlag Berlin
-
K. Kennedy, and K.S. McKinley Maximizing loop parallelism and improving data locality via loop fusion and distribution 1993 Workshop on Languages and Compilers for Parallel Computing, Number 768, Portland, Ore. 1993 Springer Verlag Berlin 301 320
-
(1993)
1993 Workshop on Languages and Compilers for Parallel Computing, Number 768, Portland, Ore.
, pp. 301-320
-
-
Kennedy, K.1
McKinley, K.S.2
-
31
-
-
0347304618
-
Data-centric multi-level blocking
-
I. Kodukula, N. Ahmed, K. Pingali, Data-centric multi-level blocking, in: PLDI, 1997, pp. 346-357
-
(1997)
PLDI
, pp. 346-357
-
-
Kodukula, I.1
Ahmed, N.2
Pingali, K.3
-
33
-
-
0026918402
-
Design and evaluation of a compiler algorithm for prefetching
-
T.C. Mowry, M.S. Lam, and A. Gupta Design and evaluation of a compiler algorithm for prefetching ACM SIGPLAN Notices 27 9 1992 62 73
-
(1992)
ACM SIGPLAN Notices
, vol.27
, Issue.9
, pp. 62-73
-
-
Mowry, T.C.1
Lam, M.S.2
Gupta, A.3
-
34
-
-
84898062812
-
-
Open64 compiler, < http://sourceforge.net/projects/open64>
-
Open64 Compiler
-
-
-
35
-
-
0029488249
-
Cache miss heuristics and preloading techniques for general-purpose programs
-
Ann Arbor, Michigan, November 29-December 1, IEEE Computer Society TC-MICRO and ACM SIGMICRO
-
T. Ozawa, Y. Kimura, S. Nishizaki, Cache miss heuristics and preloading techniques for general-purpose programs, in: MICRO'95, Ann Arbor, Michigan, November 29-December 1, 1995, IEEE Computer Society TC-MICRO and ACM SIGMICRO, pp. 243-248
-
(1995)
MICRO'95
, pp. 243-248
-
-
Ozawa, T.1
Kimura, Y.2
Nishizaki, S.3
-
36
-
-
0028132512
-
Counting solutions to Presburger formulas: How and why
-
W. Pugh Counting solutions to Presburger formulas: how and why ACM SIGPLAN Notices 29 6 1994 121 134
-
(1994)
ACM SIGPLAN Notices
, vol.29
, Issue.6
, pp. 121-134
-
-
Pugh, W.1
-
39
-
-
84949840377
-
Fast, accurate and flexible data locality analysis
-
IEEE Computer Society Press
-
J. Sánchez, and A. González Fast, accurate and flexible data locality analysis PACT'98, Paris, France, October 12-18, 1998 1998 IEEE Computer Society Press 124 129
-
(1998)
PACT'98, Paris, France, October 12-18, 1998
, pp. 124-129
-
-
Sánchez, J.1
González, A.2
-
40
-
-
0032659502
-
A locality sensitive multi-module cache with explicit management
-
ACM Press
-
J. Sanchez, and A. Gonzalez A locality sensitive multi-module cache with explicit management Proceedings of the 1999 Conference on Supercomputing, ACM SIGARCH, NY, June 20-25, 1999 1999 ACM Press 51 59
-
(1999)
Proceedings of the 1999 Conference on Supercomputing, ACM SIGARCH, NY, June 20-25, 1999
, pp. 51-59
-
-
Sanchez, J.1
Gonzalez, A.2
-
41
-
-
0033892359
-
EPIC: Explicitly parallel instruction computing
-
M.S. Schlansker, and B.R.R. Cover EPIC: Explicitly parallel instruction computing IEEE Computer 33 2 2000 37 45
-
(2000)
IEEE Computer
, vol.33
, Issue.2
, pp. 37-45
-
-
Schlansker, M.S.1
Cover, B.R.R.2
-
43
-
-
84943426389
-
Skewed-associative caches
-
Springer-Verlag, Proceedings of PARLE'93, Munich, Germany, June 14-17, 1993
-
A. Seznec, and F. Bodin Skewed-associative caches Proceedings of PARLE'93, Munich, Germany, June 14-17, 1993 Lecture Notes in Computer Science 1993 Springer-Verlag 305 316
-
(1993)
Lecture Notes in Computer Science
, pp. 305-316
-
-
Seznec, A.1
Bodin, F.2
-
44
-
-
84941198744
-
Compiler-directed resource management for active code regions
-
February
-
R. Sree, A. Settle, I. Bratt, D.A. Connors, Compiler-directed resource management for active code regions, in: Proceedings of the 7th Workshop on Interaction between Compilers and Computer Architecture, February 2003
-
(2003)
Proceedings of the 7th Workshop on Interaction between Compilers and Computer Architecture
-
-
Sree, R.1
Settle, A.2
Bratt, I.3
Connors, D.A.4
-
45
-
-
0029508817
-
A modified approach to data cache management
-
Ann Arbor, Michigan, November 29-December 1, IEEE Computer Society TC-MICRO and ACM SIGMICRO
-
G. Tyson, M. Farrens, J. Matthews, A.R. Pleszkun, A modified approach to data cache management, in: MICRO'95, Ann Arbor, Michigan, November 29-December 1, 1995, IEEE Computer Society TC-MICRO and ACM SIGMICRO, pp. 93-103
-
(1995)
MICRO'95
, pp. 93-103
-
-
Tyson, G.1
Farrens, M.2
Matthews, J.3
Pleszkun, A.R.4
-
48
-
-
14944379009
-
-
The Barvinok library
-
S. Verdoolaege, The Barvinok library, < http://freshmeat.net/projects/ barvinok>
-
-
-
Verdoolaege, S.1
-
49
-
-
24644501875
-
Analytical computation of Ehrhart polynomials: Enabling more compiler analyses and optimizations
-
CASES
-
S. Verdoolaege, R. Seghir, K. Beyls, V. Loechner, M. Bruynooghe, Analytical computation of Ehrhart polynomials: enabling more compiler analyses and optimizations, in: International Conference on Compilers, Architectures and Synthesis for Embedded Systems, CASES, 2004
-
(2004)
International Conference on Compilers, Architectures and Synthesis for Embedded Systems
-
-
Verdoolaege, S.1
Seghir, R.2
Beyls, K.3
Loechner, V.4
Bruynooghe, M.5
-
50
-
-
14944380022
-
Using the compiler to improve cache replacement decisions
-
September
-
Z. Wang, K. McKinley, A. Rosenberg, C. Weems, Using the compiler to improve cache replacement decisions, in: PACT'02, September 2002
-
(2002)
PACT'02
-
-
Wang, Z.1
McKinley, K.2
Rosenberg, A.3
Weems, C.4
-
53
-
-
0034581198
-
Modified LRU policies for improving second-level cache behavior
-
January 8-12
-
W.A. Wong, J.-L. Baer, Modified LRU policies for improving second-level cache behavior, in: HPCA-6, January 8-12, 2000, pp. 49-60
-
(2000)
HPCA-6
, pp. 49-60
-
-
Wong, W.A.1
Baer, J.-L.2
-
54
-
-
14944366408
-
Compiler-assisted cache replacement: Problem formulation and performance evaluation
-
October
-
H. Yang, R. Govindarajan, G.R. Gao, Z. Hu, Compiler-assisted cache replacement: problem formulation and performance evaluation, in: 16th International Workshop on Languages and Compilers for Parallel Computing (LCPC'03), October 2003
-
(2003)
16th International Workshop on Languages and Compilers for Parallel Computing (LCPC'03)
-
-
Yang, H.1
Govindarajan, R.2
Gao, G.R.3
Hu, Z.4
|