-
2
-
-
84976293345
-
Calculating stack distances efficiently
-
G. Almasi, C. Cascaval, and D. Padua. Calculating stack distances efficiently. In Proceedings of the first ACM SIGPLAN Workshop on Memory System Performance, Berlin, Germany, 2002.
-
Proceedings of the First ACM SIGPLAN Workshop on Memory System Performance, Berlin, Germany, 2002
-
-
Almasi, G.1
Cascaval, C.2
Padua, D.3
-
5
-
-
84976768727
-
A static performance estimator to guide data partitioning decisions
-
V. Balasundaram, G. Fox, K. Kennedy, and U. Kremer. A static performance estimator to guide data partitioning decisions. In Proceedings of the Third ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Williamsburg, VA, Apr. 1991.
-
Proceedings of the Third ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Williamsburg, VA, Apr. 1991
-
-
Balasundaram, V.1
Fox, G.2
Kennedy, K.3
Kremer, U.4
-
9
-
-
0000493064
-
Estimating interlock and improving balance for pipelined machines
-
Aug.
-
D. Callahan, J. Cocke, and K. Kennedy. Estimating interlock and improving balance for pipelined machines. Journal of Parallel and Distributed Computing, 5(4):334-358, Aug. 1988.
-
(1988)
Journal of Parallel and Distributed Computing
, vol.5
, Issue.4
, pp. 334-358
-
-
Callahan, D.1
Cocke, J.2
Kennedy, K.3
-
10
-
-
0028549474
-
Improving the ratio of memory operations to floating-point operations in loops
-
S. Carr and K. Kennedy. Improving the ratio of memory operations to floating-point operations in loops. ACM Transactions on Programming Languages and Systems, 16(6):1768-1810, 1994.
-
(1994)
ACM Transactions on Programming Languages and Systems
, vol.16
, Issue.6
, pp. 1768-1810
-
-
Carr, S.1
Kennedy, K.2
-
11
-
-
0003510310
-
Compile-time performance prediction of scientific programs
-
PhD thesis, University of Illinois at Urbana-Champaign
-
G.C. Cascaval. Compile-time Performance Prediction of Scientific Programs. PhD thesis, University of Illinois at Urbana-Champaign, 2000.
-
(2000)
-
-
Cascaval, G.C.1
-
15
-
-
0038220597
-
Profitability computations on program flow graphs
-
Technical Report RC 5123, IBM
-
J. Cocke and K. Kennedy. Profitability computations on program flow graphs. Technical Report RC 5123, IBM, 1974.
-
(1974)
-
-
Cocke, J.1
Kennedy, K.2
-
16
-
-
0001483604
-
Communication optimizations for irregular scientific computations on distributed memory architectures
-
Sept.
-
R. Das, M. Uysal, J. Saltz, and Y.-S. Hwang. Communication optimizations for irregular scientific computations on distributed memory architectures. Journal of Parallel and Distributed Computing, 22(3):462-479, Sept. 1994.
-
(1994)
Journal of Parallel and Distributed Computing
, vol.22
, Issue.3
, pp. 462-479
-
-
Das, R.1
Uysal, M.2
Saltz, J.3
Hwang, Y.-S.4
-
17
-
-
0004007719
-
Improving effective bandwich through compiler enhancement of global and dynamic cache reuse
-
PhD thesis, Dept. of Computer Science, Rice University, January
-
C. Ding. Improving Effective Bandwich through Compiler Enhancement of Global and Dynamic Cache Reuse. PhD thesis, Dept. of Computer Science, Rice University, January 2000.
-
(2000)
-
-
Ding, C.1
-
20
-
-
84948749348
-
Workload design: Selecting representative program-input pairs
-
L. Eeckhout, H. Vandierendonck, and K. D. Bosschere. Workload design: selecting representative program-input pairs. In Proceedings of International Conference on Parallel Architectures and Compilation Techniques, Charlottesville, Virginia, 2002.
-
Proceedings of International Conference on Parallel Architectures and Compilation Techniques, Charlottesville, Virginia, 2002
-
-
Eeckhout, L.1
Vandierendonck, H.2
Bosschere, K.D.3
-
21
-
-
0038220600
-
Locality optimizations for adaptive irregular scientific codes
-
Technical report, Department of Computer Science, University of Maryland, College Park
-
H. Han and C. W. Tseng. Locality optimizations for adaptive irregular scientific codes. Technical report, Department of Computer Science, University of Maryland, College Park, 2000.
-
(2000)
-
-
Han, H.1
Tseng, C.W.2
-
22
-
-
0003789873
-
Aspects of cache memory and instruction buffer performance
-
PhD thesis, University of California, Berkeley, November
-
M. D. Hill. Aspects of cache memory and instruction buffer performance. PhD thesis, University of California, Berkeley, November 1987.
-
(1987)
-
-
Hill, M.D.1
-
27
-
-
84983965442
-
An empirical study of FORTRAN programs
-
D. Knuth. An empirical study of FORTRAN programs. Software - Practice and Experience, 1:105-133, 1971.
-
(1971)
Software - Practice and Experience
, vol.1
, pp. 105-133
-
-
Knuth, D.1
-
28
-
-
1842849819
-
Choosing representatives slices of program execution for microarchitecture simulations: A preliminary application to the data stream
-
Kluwer Academic Publishers
-
T. Lafage and A. Seznec. Choosing representatives slices of program execution for microarchitecture simulations: a preliminary application to the data stream. In Workload Characterization of Emerging Applications, Kluwer Academic Publishers, 2000.
-
(2000)
Workload Characterization of Emerging Applications
-
-
Lafage, T.1
Seznec, A.2
-
29
-
-
84866874941
-
An evaluation of the potential benefits of register allocation for array references
-
Z. Li, J. Gu, and G. Lee. An evaluation of the potential benefits of register allocation for array references. In Workshop on Interaction between Compilers and Computer Archictures in conjunction with the HPCA-2, San Jose, California, February 1996.
-
Workshop on Interaction Between Compilers and Computer Archictures in Conjunction with the HPCA-2, San Jose, California, February 1996
-
-
Li, Z.1
Gu, J.2
Lee, G.3
-
30
-
-
0014701246
-
Evaluation techniques for storage hierarchies
-
R. L. Mattson, J. Gecsei, D. Slutz, and I. L. Traiger. Evaluation techniques for storage hierarchies. IBM System Journal, 9(2):78-117, 1970.
-
(1970)
IBM System Journal
, vol.9
, Issue.2
, pp. 78-117
-
-
Mattson, R.L.1
Gecsei, J.2
Slutz, D.3
Traiger, I.L.4
-
31
-
-
0003665539
-
Quantifying loop nest locality using SPEC'95 and the perfect benchmarks
-
K. S. McKinley and O. Temam. Quantifying loop nest locality using SPEC'95 and the perfect benchmarks. ACM Transactions on Computer Systems, 17(4):288-336, 1999.
-
(1999)
ACM Transactions on Computer Systems
, vol.17
, Issue.4
, pp. 288-336
-
-
McKinley, K.S.1
Temam, O.2
-
32
-
-
0034818669
-
Tools for application-oriented performance tuning
-
J. Mellor-Crummey, R. Fowler, and D. B. Whalley. Tools for application-oriented performance tuning. In Proceedings of the 15th ACM International Conference on Supercomputing, Sorrento, Italy, 2001.
-
Proceedings of the 15th ACM International Conference on Supercomputing, Sorrento, Italy, 2001
-
-
Mellor-Crummey, J.1
Fowler, R.2
Whalley, D.B.3
-
34
-
-
0006946256
-
Efficient methods for calculating the success function of fixed space replacement policies
-
Technical Report LBL-12370, Lawrence Berkeley Laboratory
-
F. Olken. Efficient methods for calculating the success function of fixed space replacement policies. Technical Report LBL-12370, Lawrence Berkeley Laboratory, 1981.
-
(1981)
-
-
Olken, F.1
-
35
-
-
84967068696
-
An inter-reference gap model for temporal locality in program behavior
-
V. Phalke and B. Gopinath. An inter-reference gap model for temporal locality in program behavior. In Proceedings of ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems, Ottawa, Ontario, Canada, 1995.
-
Proceedings of ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems, Ottawa, Ontario, Canada, 1995
-
-
Phalke, V.1
Gopinath, B.2
-
36
-
-
0036953769
-
Automatically characterizing large scale program behavior
-
T. Sherwood, E. Perelman, G. Hamerly, and B. Calder. Automatically characterizing large scale program behavior. In Proceedings of International Conference on Architectural Support for Programming Languages and Operating Systems, San Jose, CA, 2002.
-
Proceedings of International Conference on Architectural Support for Programming Languages and Operating Systems, San Jose, CA, 2002
-
-
Sherwood, T.1
Perelman, E.2
Hamerly, G.3
Calder, B.4
-
38
-
-
0036036127
-
A compiler approach to fast hardware design space exploration in FPGA-based systems
-
B. So, M. W. Hall, and P. C. Diniz. A compiler approach to fast hardware design space exploration in FPGA-based systems. In Proceedings of ACM SIGPLAN Conference on Programming Language Design and Implementation, Berlin, Germany, 2002.
-
Proceedings of ACM SIGPLAN Conference on Programming Language Design and Implementation, Berlin, Germany, 2002
-
-
So, B.1
Hall, M.W.2
Diniz, P.C.3
-
40
-
-
0038039924
-
Compile-time composition of run-time data and iteration reorderings
-
M. M. Strout, L. Carter, and J. Ferrante. Compile-time composition of run-time data and iteration reorderings. In Proceedings of ACM SIGPLAN Conference on Programming Language Design and Implementation, San Diego, CA, 2003.
-
Proceedings of ACM SIGPLAN Conference on Programming Language Design and Implementation, San Diego, CA, 2003
-
-
Strout, M.M.1
Carter, L.2
Ferrante, J.3
-
41
-
-
0013009642
-
Multi-configuration simulation algorithms for the evaluation of computer architecture designs
-
Technical report, University of Michigan
-
R. A. Sugumar and S. G. Abraham. Multi-configuration simulation algorithms for the evaluation of computer architecture designs. Technical report, University of Michigan, 1993.
-
(1993)
-
-
Sugumar, R.A.1
Abraham, S.G.2
-
42
-
-
0037882891
-
Cache management by the compiler
-
PhD thesis, Dept. of Computer Science, Rice University
-
K. O. Thabit. Cache Management by the Compiler. PhD thesis, Dept. of Computer Science, Rice University, 1981.
-
(1981)
-
-
Thabit, K.O.1
-
44
-
-
1442313436
-
Reuse distance analysis for scientific programs
-
Y. Zhong, C. Ding, and K. Kennedy. Reuse distance analysis for scientific programs. In Proceedings of Workshop on Languages, Compilers, and Run-time Systems for Scalable Computers, Washington DC, March 2002.
-
Proceedings of Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers, Washington DC, March 2002
-
-
Zhong, Y.1
Ding, C.2
Kennedy, K.3
|