-
1
-
-
0014701246
-
"Evaluation Techniques for Storage Hierarchies"
-
R.L. Mattson, J. Gecsei, D. Slutz, and I.L. Traiger, "Evaluation Techniques for Storage Hierarchies," IBM Systems J., vol. 9, no. 2, pp. 78-117, 1970.
-
(1970)
IBM Systems J.
, vol.9
, Issue.2
, pp. 78-117
-
-
Mattson, R.L.1
Gecsei, J.2
Slutz, D.3
Traiger, I.L.4
-
2
-
-
0003789873
-
"Aspects of Cache Memory and Instruction Buffer Performance"
-
PhD thesis, Univ. of California, Berkeley, Nov
-
M.D. Hill, "Aspects of Cache Memory and Instruction Buffer Performance," PhD thesis, Univ. of California, Berkeley, Nov. 1987.
-
(1987)
-
-
Hill, M.D.1
-
8
-
-
4243397947
-
"Efficient Simulation of Multiple Cache Configurations Using Binomial Trees"
-
Technical Report CSE-TR-111-91, Computer Science and Eng. Division, Univ. of Michigan
-
R.A. Sugumar and S.G. Abraham, "Efficient Simulation of Multiple Cache Configurations Using Binomial Trees," Technical Report CSE-TR-111-91, Computer Science and Eng. Division, Univ. of Michigan, 1991.
-
(1991)
-
-
Sugumar, R.A.1
Abraham, S.G.2
-
9
-
-
0003465202
-
"The SimpleScalar Tool Set, Version 2.0"
-
Technical Report CS-TR-97-1342, Dept. of Computer Science, Univ. of Wisconsin, June
-
D. Burger and T. Austin, "The SimpleScalar Tool Set, Version 2.0," Technical Report CS-TR-97-1342, Dept. of Computer Science, Univ. of Wisconsin, June 1997.
-
(1997)
-
-
Burger, D.1
Austin, T.2
-
11
-
-
0003665539
-
"Quantifying Loop Nest Locality Using SPEC'95 and the Perfect Benchmarks"
-
Nov
-
K.S. McKinley and O. Temam, "Quantifying Loop Nest Locality Using SPEC'95 and the Perfect Benchmarks," ACM Trans. Computer Systems, vol. 17, pp. 288-336, Nov. 1999.
-
(1999)
ACM Trans. Computer Systems
, vol.17
, pp. 288-336
-
-
McKinley, K.S.1
Temam, O.2
-
12
-
-
84981274540
-
"Improving Effective Bandwidth through Compiler Enhancement of Global Cache Reuse"
-
Apr
-
C. Ding and K. Kennedy, "Improving Effective Bandwidth through Compiler Enhancement of Global Cache Reuse," Proc. Int'l Parallel and Distributed Processing Symp., Apr. 2001, http://www.ipdps.org.
-
(2001)
Proc. Int'l Parallel and Distributed Processing Symp.
-
-
Ding, C.1
Kennedy, K.2
-
13
-
-
84968854066
-
"Inter-Procedural Loop Fusion, Array Contraction and Rotation"
-
Sept
-
J. Ng, D. Kulkarni, W. Li, R. Cox, and S. Bobholz, "Inter-Procedural Loop Fusion, Array Contraction and Rotation," Proc. Int'l Conf. Parallel Architectures and Compilation Techniques, Sept. 2003.
-
(2003)
Proc. Int'l Conf. Parallel Architectures and Compilation Techniques
-
-
Ng, J.1
Kulkarni, D.2
Li, W.3
Cox, R.4
Bobholz, S.5
-
14
-
-
33745797169
-
"Reuse-Distance-Based Miss-Rate Prediction on a per Instruction Basis"
-
June
-
C. Fang, S. Carr, S. Onder, and Z. Wang, "Reuse-Distance-Based Miss-Rate Prediction on a per Instruction Basis," Proc. First ACM SIGPLAN Workshop Memory System Performance, June 2004.
-
(2004)
Proc. First ACM SIGPLAN Workshop Memory System Performance
-
-
Fang, C.1
Carr, S.2
Onder, S.3
Wang, Z.4
-
19
-
-
0003690938
-
"Software Methods for Improvement of Cache Performance"
-
PhD thesis, Dept. of Computer Science, Rice Univ., May
-
A. Porterfield, "Software Methods for Improvement of Cache Performance," PhD thesis, Dept. of Computer Science, Rice Univ., May 1989.
-
(1989)
-
-
Porterfield, A.1
-
21
-
-
0000579037
-
"Analysis of Interprocedural Side Effects in a Parallel Programming Environment"
-
Oct
-
D. Callahan, J. Cocke, and K. Kennedy, "Analysis of Interprocedural Side Effects in a Parallel Programming Environment," J. Parallel and Distributed Computing, vol. 5, pp. 517-550, Oct. 1988.
-
(1988)
J. Parallel and Distributed Computing
, vol.5
, pp. 517-550
-
-
Callahan, D.1
Cocke, J.2
Kennedy, K.3
-
22
-
-
0026186967
-
"An Implementation of Interprocedural Bounded Regular Section Analysis"
-
July
-
P. Havlak and K. Kennedy, "An Implementation of Interprocedural Bounded Regular Section Analysis," IEEE Trans. Parallel and Distributed Systems, vol. 2, pp. 350-360, July 1991.
-
(1991)
IEEE Trans. Parallel and Distributed Systems
, vol.2
, pp. 350-360
-
-
Havlak, P.1
Kennedy, K.2
-
23
-
-
0025229934
-
"An Efficient Data Dependence Analysis for Parallelizing Compilers"
-
Jan
-
Z. Li, P. Yew, and C. Zhu, "An Efficient Data Dependence Analysis for Parallelizing Compilers," IEEE Trans. Parallel and Distributed Systems, vol. 1, pp. 26-34, Jan. 1990.
-
(1990)
IEEE Trans. Parallel and Distributed Systems
, vol.1
, pp. 26-34
-
-
Li, Z.1
Yew, P.2
Zhu, C.3
-
25
-
-
0002678692
-
"On Estimating and Enhancing Cache Effectiveness"
-
U. Banerjee, D. Gelernter, A. Nicolau, and D. Padua, eds., Aug
-
J. Ferrante, V. Sarkar, and W. Thrash, "On Estimating and Enhancing Cache Effectiveness," Proc. Fourth Int'l Workshop Languages and Compilers for Parallel Computing, U. Banerjee, D. Gelernter, A. Nicolau, and D. Padua, eds., Aug. 1991.
-
(1991)
Proc. Fourth Int'l Workshop Languages and Compilers for Parallel Computing
-
-
Ferrante, J.1
Sarkar, V.2
Thrash, W.3
-
26
-
-
14944340143
-
"Software Methods to Improve Data Locality and Cache Behavior"
-
PhD thesis, Ghent Univ
-
K. Beyls, "Software Methods to Improve Data Locality and Cache Behavior," PhD thesis, Ghent Univ., 2004.
-
(2004)
-
-
Beyls, K.1
-
27
-
-
0034832018
-
"Exact Analysis of the Cache Behavior of Nested Loops"
-
S. Chatterjee, E. Parker, P.J. Hanlon, and A.R. Lebeck, "Exact Analysis of the Cache Behavior of Nested Loops," Proc. ACM SIGPLAN Conf. Programming Language Design and Implementation, 2001.
-
(2001)
Proc. ACM SIGPLAN Conf. Programming Language Design and Implementation
-
-
Chatterjee, S.1
Parker, E.2
Hanlon, P.J.3
Lebeck, A.R.4
-
28
-
-
0001714824
-
"Cache Miss Equations: A Compiler Framework for Analyzing and Tuning Memory Behavior"
-
S. Ghosh, M. Martonosi, and S. Malik, "Cache Miss Equations: A Compiler Framework for Analyzing and Tuning Memory Behavior," ACM Trans. Programming Languages and Systems, vol. 21, no. 4, 1999.
-
(1999)
ACM Trans. Programming Languages and Systems
, vol.21
, Issue.4
-
-
Ghosh, S.1
Martonosi, M.2
Malik, S.3
-
29
-
-
1842635044
-
"A Fast and Accurate Framework to Analyze and Optimize Cache Memory Behavior"
-
Mar
-
X. Vera, N. Bernudo, J. Llosa, and A. Gonzalez, "A Fast and Accurate Framework to Analyze and Optimize Cache Memory Behavior," ACM Trans. Programming Languages and Systems, vol. 26, Mar. 2004.
-
(2004)
ACM Trans. Programming Languages and Systems
, vol.26
-
-
Vera, X.1
Bernudo, N.2
Llosa, J.3
Gonzalez, A.4
-
30
-
-
3042664555
-
"Efficient and Accurate Analytical Modeling of Whole-Program Data Cache Behavior"
-
May
-
J. Xue and X. Vera, "Efficient and Accurate Analytical Modeling of Whole-Program Data Cache Behavior," IEEE Trans. Computers, vol. 53, no. 5, May 2004.
-
(2004)
IEEE Trans. Computers
, vol.53
, Issue.5
-
-
Xue, J.1
Vera, X.2
-
33
-
-
0004007719
-
"Improving Effective Bandwidth through Compiler Enhancement of Global and Dynamic Cache Reuse"
-
PhD thesis, Dept. of Computer Science, Rice Univ., Jan
-
C. Ding, "Improving Effective Bandwidth through Compiler Enhancement of Global and Dynamic Cache Reuse," PhD thesis, Dept. of Computer Science, Rice Univ., Jan. 2000.
-
(2000)
-
-
Ding, C.1
-
34
-
-
0037882892
-
"Reuse Distance Analysis for Scientific Programs"
-
Mar
-
Y. Zhong, C. Ding, and K. Kennedy, "Reuse Distance Analysis for Scientific Programs," Proc. Workshop Languages, Compilers, and Run-Time Systems for Scalable Computers, Mar. 2002.
-
(2002)
Proc. Workshop Languages, Compilers, and Run-Time Systems for Scalable Computers
-
-
Zhong, Y.1
Ding, C.2
Kennedy, K.3
|