-
2
-
-
0028055525
-
Predictability of load/store instruction latencies
-
Austin, TX, December
-
S. G. Abraham, R. A. Sugumar, D. Windheiser, B. R. Rau, and R. Gupta. Predictability of load/store instruction latencies. In Proceedings of the 28th International Symposium on Microarchitecture, Austin, TX, December 1993.
-
(1993)
Proceedings of the 28th International Symposium on Microarchitecture
-
-
Abraham, S.G.1
Sugumar, R.A.2
Windheiser, D.3
Rau, B.R.4
Gupta, R.5
-
3
-
-
0342323355
-
Sage++: An object-oriented toolkit and class library for building Fortran and C++ restructuring tools
-
F. Bodin, P. Beckman, D. Gannon, J. Gotwals, S. Narayana, S. Srinivas, and B. Winnicka. Sage++: An object-oriented toolkit and class library for building Fortran and C++ restructuring tools. In Second Object-Oriented Numerics Conference, 1994.
-
(1994)
Second Object-Oriented Numerics Conference
-
-
Bodin, F.1
Beckman, P.2
Gannon, D.3
Gotwals, J.4
Narayana, S.5
Srinivas, S.6
Winnicka, B.7
-
4
-
-
0026267802
-
An effective on-chip preloading scheme to reduce data access penalty
-
Albuquerque, NM, November
-
J-L. Baer and T-F. Chen. An effective on-chip preloading scheme to reduce data access penalty. In Proceedings of Supercomputing '91, Albuquerque, NM, November 1991.
-
(1991)
Proceedings of Supercomputing '91
-
-
Baer, J.-L.1
Chen, T.-F.2
-
5
-
-
0003003638
-
A study of replacement algorithms for a virtual-storage computer
-
L. A. Belady. A study of replacement algorithms for a virtual-storage computer. IBM Systems Journal, 5(2):79-101, 1966.
-
(1966)
IBM Systems Journal
, vol.5
, Issue.2
, pp. 79-101
-
-
Belady, L.A.1
-
6
-
-
0010232351
-
-
Technical Report 1216, Dept. of Computer Science, University of Wisconsin at Madison, January
-
D. Burger, J. R. Goodman, and A. Kägi. The declining effectiveness of dynamic caching for general-puropose microprocessors. Technical Report 1216, Dept. of Computer Science, University of Wisconsin at Madison, January 1995.
-
(1995)
The Declining Effectiveness of Dynamic Caching for General-puropose Microprocessors
-
-
Burger, D.1
Goodman, J.R.2
Kägi, A.3
-
7
-
-
0029666646
-
Memory bandwidth limitations of future microprocessors
-
Philadelphia, PA, May
-
D. Burger, A. Kägi, and J. R. Goodman. Memory bandwidth limitations of future microprocessors. In Proceedings of the 23rd International Symposium on Computer Architecture, Philadelphia, PA, May 1996.
-
(1996)
Proceedings of the 23rd International Symposium on Computer Architecture
-
-
Burger, D.1
Kägi, A.2
Goodman, J.R.3
-
8
-
-
0029308368
-
Effective hardware based data prefetching
-
May
-
T. Chen and J. Baer. Effective hardware based data prefetching. IEEE Transactions on Computers, 44(5):609-623, May 1995.
-
(1995)
IEEE Transactions on Computers
, vol.44
, Issue.5
, pp. 609-623
-
-
Chen, T.1
Baer, J.2
-
9
-
-
0025447908
-
Improving register allocation for subscripted variables
-
White Plains, NY, June
-
D. Callahan, S. Carr, and K. Kennedy. Improving register allocation for subscripted variables. In Proceedings of the SIGPLAN '90 Conference on Programming Language Design and Implementation, White Plains, NY, June 1990.
-
(1990)
Proceedings of the SIGPLAN '90 Conference on Programming Language Design and Implementation
-
-
Callahan, D.1
Carr, S.2
Kennedy, K.3
-
10
-
-
0001041754
-
Improving the ratio of memory operations to floating-point operations in loops
-
July
-
S. Carr and K. Kennedy. Improving the ratio of memory operations to floating-point operations in loops. ACM Transactions on Programming Languages and Systems, 15(3):400-462, July 1994.
-
(1994)
ACM Transactions on Programming Languages and Systems
, vol.15
, Issue.3
, pp. 400-462
-
-
Carr, S.1
Kennedy, K.2
-
11
-
-
0442285964
-
-
Technical Report CRPC-TR95519-S, Center for Research on Parallel Computation, Rice University, October
-
K. Cooper, K. Kennedy, and N. McIntosh. An empirical study of cross-loop reuse in the NAS benchmarks. Technical Report CRPC-TR95519-S, Center for Research on Parallel Computation, Rice University, October 1995.
-
(1995)
An Empirical Study of Cross-loop Reuse in the NAS Benchmarks
-
-
Cooper, K.1
Kennedy, K.2
McIntosh, N.3
-
12
-
-
0347151974
-
Cross-loop reuse analysis and its application to cache optimizations
-
Santa Clara, CA, August
-
K. Cooper, K. Kennedy, and N. McIntosh. Cross-loop reuse analysis and its application to cache optimizations. In Proceedings of the Ninth Workshop on Languages and Compilers for Parallel Computing, Santa Clara, CA, August 1996.
-
(1996)
Proceedings of the Ninth Workshop on Languages and Compilers for Parallel Computing
-
-
Cooper, K.1
Kennedy, K.2
McIntosh, N.3
-
13
-
-
84976722352
-
Software prefetching
-
Santa Clara, CA, April
-
D. Callahan, K. Kennedy, and A. Porterfield. Software prefetching. In Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, pages 40-52, Santa Clara, CA, April 1991.
-
(1991)
Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 40-52
-
-
Callahan, D.1
Kennedy, K.2
Porterfield, A.3
-
14
-
-
0025022825
-
Supercomputer performance evaluation and the Perfect benchmarks
-
Amsterdam, The Netherlands, June
-
G. Cybenko, L. Kipp, L. Pointer, and D. Kuck. Supercomputer performance evaluation and the Perfect benchmarks. In Proceedings of the 1990 ACM International Conference on Supercomputing, Amsterdam, The Netherlands, June 1990.
-
(1990)
Proceedings of the 1990 ACM International Conference on Supercomputing
-
-
Cybenko, G.1
Kipp, L.2
Pointer, L.3
Kuck, D.4
-
16
-
-
85009364061
-
Compiler optimizations for improving data locality
-
San Jose, CA, October
-
S. Carr, K. S. McKinley, and C. Tseng. Compiler optimizations for improving data locality. In Proceedings of the Sixth International Conference on Architectural Support for Programming Languages and Operating Systems, San Jose, CA, October 1994.
-
(1994)
Proceedings of the Sixth International Conference on Architectural Support for Programming Languages and Operating Systems
-
-
Carr, S.1
McKinley, K.S.2
Tseng, C.3
-
19
-
-
0027640963
-
Cache performance of the SPEC92 benchmarksuite
-
August
-
J. D. Gee, M. D. Hill, D. N. Pnevmatikatos, and A. J. Smith. Cache performance of the SPEC92 benchmarksuite. IEEE Micro, 13(4):17-27, August 1993.
-
(1993)
IEEE Micro
, vol.13
, Issue.4
, pp. 17-27
-
-
Gee, J.D.1
Hill, M.D.2
Pnevmatikatos, D.N.3
Smith, A.J.4
-
20
-
-
0001366267
-
Strategies for cache and local memory management by global program transformation
-
October
-
D. Gannon, W. Jalby, and K. Gallivan. Strategies for cache and local memory management by global program transformation. Journal of Parallel and Distributed Computing, 5(5):587-616, October 1988.
-
(1988)
Journal of Parallel and Distributed Computing
, vol.5
, Issue.5
, pp. 587-616
-
-
Gannon, D.1
Jalby, W.2
Gallivan, K.3
-
21
-
-
0003789873
-
-
PhD thesis, Computer Science Dept., University of California, Berkeley, Available as Technical Report UCB/CSD 87/381
-
M. D. Hill. Aspects of Cache Memory and Instruction Buffer Performance. PhD thesis, Computer Science Dept., University of California, Berkeley, 1987. Available as Technical Report UCB/CSD 87/381.
-
(1987)
Aspects of Cache Memory and Instruction Buffer Performance
-
-
Hill, M.D.1
-
22
-
-
0024173488
-
A case for direct-mappedcaches
-
December
-
M. D. Hill. A case for direct-mappedcaches. IEEE Computer, 21(12):25-40, December 1988.
-
(1988)
IEEE Computer
, vol.21
, Issue.12
, pp. 25-40
-
-
Hill, M.D.1
-
24
-
-
0024903997
-
Evaluating associativity in cpu caches
-
December
-
M. D. Hill and A. J. Smith. Evaluating associativity in cpu caches. IEEE Transactions on Computers, 38(12):1612-1630, December 1989.
-
(1989)
IEEE Transactions on Computers
, vol.38
, Issue.12
, pp. 1612-1630
-
-
Hill, M.D.1
Smith, A.J.2
-
26
-
-
0025429331
-
Improving direct-mapped cache performance by the addition of a samll fully-associative cache and prefetch buffers
-
Seattle, WA, June
-
N. P. Jouppi. Improving direct-mapped cache performance by the addition of a samll fully-associative cache and prefetch buffers. In Proceedings of the 17th International Symposium on Computer Architecture, pages 364-373,Seattle, WA, June 1990.
-
(1990)
Proceedings of the 17th International Symposium on Computer Architecture
, pp. 364-373
-
-
Jouppi, N.P.1
-
28
-
-
0442270363
-
Cache based computer systems
-
March
-
K. R. Kaplan and R. O. Winder. Cache based computer systems. IEEE Computer, 6(3):30-36, March 1973.
-
(1973)
IEEE Computer
, vol.6
, Issue.3
, pp. 30-36
-
-
Kaplan, K.R.1
Winder, R.O.2
-
29
-
-
0026137116
-
The cache performance and optimizations of blocked algorithms
-
Santa Clara, CA, April
-
M. Lam, E. Rothberg, and M. E. Wolf. The cache performance and optimizations of blocked algorithms. In Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, Santa Clara, CA, April 1991.
-
(1991)
Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems
-
-
Lam, M.1
Rothberg, E.2
Wolf, M.E.3
-
30
-
-
84976833735
-
Design and evaluation of a compiler algorithm for prefetching
-
Boston, MA, October
-
T. Mowry, M. Lam, and A. Gupta. Design and evaluation of a compiler algorithm for prefetching. In Proceedings of the Fifth International Conference on Architectural Support for Programming Languages and Operating Systems, pages 62-73, Boston, MA, October 1992.
-
(1992)
Proceedings of the Fifth International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 62-73
-
-
Mowry, T.1
Lam, M.2
Gupta, A.3
-
32
-
-
0023708930
-
Performance tradeoffs in cache design
-
Honolulu, HI, June
-
S. Przybylski, M. Horowitz, and J. Hennessy. Performance tradeoffs in cache design. In Proceedings of the 15th International Symposium on Computer Architecture, pages 290-298, Honolulu, HI, June 1988.
-
(1988)
Proceedings of the 15th International Symposium on Computer Architecture
, pp. 290-298
-
-
Przybylski, S.1
Horowitz, M.2
Hennessy, J.3
-
33
-
-
2842613617
-
A different kind of RISC
-
August
-
D. Fountain. A different kind of RISC. BYTE, August 1994.
-
(1994)
BYTE
-
-
Fountain, D.1
-
34
-
-
17044383224
-
MicroDesign Resources. Intel boosts Pentium Pro to 200 Mhz
-
November
-
MicroDesign Resources. Intel boosts Pentium Pro to 200 Mhz. Microprocessor Report, 9(17), November 1995.
-
(1995)
Microprocessor Report
, vol.9
, Issue.17
-
-
-
36
-
-
0020177251
-
Cache memories
-
September
-
A. J. Smith. Cache memories. Computing Surveys, 14(3):473-530, September 1982.
-
(1982)
Computing Surveys
, vol.14
, Issue.3
, pp. 473-530
-
-
Smith, A.J.1
-
37
-
-
0004802504
-
Bibliography and readings on cpu cache memories and related topics
-
January
-
A. J. Smith. Bibliography and readings on cpu cache memories and related topics. Computer Architecture News, 14(1):22-42, January 1986.
-
(1986)
Computer Architecture News
, vol.14
, Issue.1
, pp. 22-42
-
-
Smith, A.J.1
-
38
-
-
84939323181
-
Line (block) size choice for cpu caches
-
September
-
A. J. Smith. Line (block) size choice for cpu caches. IEEE Transactions on Computers, C-36(9):1063-1075, September 1987.
-
(1987)
IEEE Transactions on Computers
, vol.C-36
, Issue.9
, pp. 1063-1075
-
-
Smith, A.J.1
-
39
-
-
0042028057
-
Second bibliography on cache memories
-
June
-
A. J. Smith. Second bibliography on cache memories. Computer Architecture News, 19(4):154-182, June 1991.
-
(1991)
Computer Architecture News
, vol.19
, Issue.4
, pp. 154-182
-
-
Smith, A.J.1
-
41
-
-
0029508817
-
A new approach to cache management
-
Ann Arbor, MI, November
-
G. Tyson, M. Farrens, J. Matthews, and A. Pleszkun. A new approach to cache management. In Proceedings of the 28th International Symposium on Microarchitecture, Ann Arbor, MI, November 1995.
-
(1995)
Proceedings of the 28th International Symposium on Microarchitecture
-
-
Tyson, G.1
Farrens, M.2
Matthews, J.3
Pleszkun, A.4
-
42
-
-
0027764718
-
To copy or not to copy: A compile-time technique for assessing when data copying should be used to eliminate cache conflicts
-
Portland, OR, November
-
O. Temam, E. Granston, and W. Jalby. To copy or not to copy: A compile-time technique for assessing when data copying should be used to eliminate cache conflicts. In Proceedings of Supercomputing '93, Portland, OR, November 1993.
-
(1993)
Proceedings of Supercomputing '93
-
-
Temam, O.1
Granston, E.2
Jalby, W.3
-
43
-
-
0346206647
-
SPEC Benchmark Suite: Designed for today's advanced systems
-
SPEC, Fall
-
J. Uniejewski. SPEC Benchmark Suite: Designed for today's advanced systems. SPEC Newsletter Volume 1, Issue 1, SPEC, Fall 1989.
-
(1989)
SPEC Newsletter Volume 1
, vol.1
, Issue.1
-
-
Uniejewski, J.1
-
44
-
-
84910652234
-
A model for estimating trace-sample miss ratios
-
San Diego, CA, May
-
D. A. Wood, M. D. Hill, and R. E. Kessler. A model for estimating trace-sample miss ratios. In Proceedings of the ACM SIGMETRICS Conference on Measurement & Modeling Computer Systems, pages 79-89, San Diego, CA, May 1991.
-
(1991)
Proceedings of the ACM SIGMETRICS Conference on Measurement & Modeling Computer Systems
, pp. 79-89
-
-
Wood, D.A.1
Hill, M.D.2
Kessler, R.E.3
|