-
3
-
-
0037688265
-
-
Architecture and Language Implementation Group, University of Massachusetts, Amherst. Scale Compiler infrastructure
-
Architecture and Language Implementation Group, University of Massachusetts, Amherst. Scale Compiler infrastructure. In http://ali-www.cs.umass.edu/Scale.
-
-
-
-
4
-
-
0003465202
-
-
Technical Report 1342, Computer Sciences Department, Uniersity of Wisconsin, June
-
D. Burger and T. M. Austin. The simplescalar tool set version 2.0. Technical Report 1342, Computer Sciences Department, Uniersity of Wisconsin, June 1997.
-
(1997)
The Simplescalar Tool Set Version 2.0
-
-
Burger, D.1
Austin, T.M.2
-
6
-
-
0038702611
-
Simple and effective array prefetching for Java
-
Seattle, WA, Nov.
-
B. Cahoon and K. S. McKinley. Simple and effective array prefetching for Java. In ACM Java Grande, pages 86-95, Seattle, WA, Nov. 2002.
-
(2002)
ACM Java Grande
, pp. 86-95
-
-
Cahoon, B.1
McKinley, K.S.2
-
7
-
-
0026138044
-
Software prefetching
-
Santa Clara, CA, Apr.
-
D. Callahan, K. Kennedy, and A. Porterfield. Software prefetching. In Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, pages 40-52, Santa Clara, CA, Apr. 1991.
-
(1991)
Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 40-52
-
-
Callahan, D.1
Kennedy, K.2
Porterfield, A.3
-
8
-
-
84976831704
-
Compiler optimizations for improving data locality
-
San Jose, CA, Oct.
-
S. Carr, K. S. McKinley, and C. Tseng. Compiler optimizations for improving data locality. In Proceedings of the Sixth International Conference on Architectural Support for Programming Languages and Operating Systems, pages 252-262, San Jose, CA, Oct. 1994.
-
(1994)
Proceedings of the Sixth International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 252-262
-
-
Carr, S.1
McKinley, K.S.2
Tseng, C.3
-
9
-
-
0003758490
-
Generalized correlation-based hardware prefetching
-
Technical Report EECEG951, Cornell University, Feb.
-
M. Charney and A. Reeves. Generalized correlation-based hardware prefetching. Technical Report EE_CEG_95_1, Cornell University, Feb. 1995.
-
(1995)
-
-
Charney, M.1
Reeves, A.2
-
11
-
-
0029308368
-
Effective hardware based data prefetching
-
May
-
T. Chen and J. Baer. Effective hardware based data prefetching. IEEE Transactions on Computers, 44(5):609-623, May 1995.
-
(1995)
IEEE Transactions on Computers
, vol.44
, Issue.5
, pp. 609-623
-
-
Chen, T.1
Baer, J.2
-
13
-
-
0034839033
-
Speculative precomputation: Long-range prefetching of delinquent loads
-
June
-
J. D. Collins, H. Wang, D. M. Tullsen, C. Hughes, Y.-F. Lee, D. Lavery, and J. P. Shen. Speculative precomputation: Long-range prefetching of delinquent loads. In Proceedings of the 28th International Symposium on Computer Architecture, pages 14-25, June 2001.
-
(2001)
Proceedings of the 28th International Symposium on Computer Architecture
, pp. 14-25
-
-
Collins, J.D.1
Wang, H.2
Tullsen, D.M.3
Hughes, C.4
Lee, Y.-F.5
Lavery, D.6
Shen, J.P.7
-
14
-
-
0036949391
-
A stateless, content-directed data prefetching mechanism
-
San Jose, CA, October
-
R. Cooksey, S. Jordan, and D. Grunwald. A stateless, content-directed data prefetching mechanism. In Proceedings of the Tenth Annual International Conference on Architectural Support for Programming Languages and Operating Systems, pages 279-290, San Jose, CA, October 2002.
-
(2002)
Proceedings of the Tenth Annual International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 279-290
-
-
Cooksey, R.1
Jordan, S.2
Grunwald, D.3
-
15
-
-
84965078406
-
Fixed and adaptive sequential prefetching in shared-memory multiprocessors
-
St Charles, IL
-
F. Dahlgren, M. Dubois, and P. Stenstrom. Fixed and adaptive sequential prefetching in shared-memory multiprocessors. In Proceedings of the 1993 International Conference on Parallel Processing, pages 56-63, St Charles, IL, 1993.
-
(1993)
Proceedings of the 1993 International Conference on Parallel Processing
, pp. 56-63
-
-
Dahlgren, F.1
Dubois, M.2
Stenstrom, P.3
-
16
-
-
0038702612
-
Effectiveness of hardware-based stride and sequential prefetching in shared-memory multiprocessors
-
Raleigh, NC, Jan.
-
F. Dahlgren and P. Stenstrom. Effectiveness of hardware-based stride and sequential prefetching in shared-memory multiprocessors. In First International Symposium on High Performance Computer Architecture, pages 68-77, Raleigh, NC, Jan. 1995.
-
(1995)
First International Symposium on High Performance Computer Architecture
, pp. 68-77
-
-
Dahlgren, F.1
Stenstrom, P.2
-
17
-
-
0031611719
-
Precise miss analysis for program transformations with caches of arbitrary associativity
-
San Jose, CA, Oct.
-
S. Ghosh, M. Martonosi, and S. Malik. Precise miss analysis for program transformations with caches of arbitrary associativity. In Proceedings of the Eighth International Conference on Architectural Support for Programming Languages and Operating Systems, pages 228-239, San Jose, CA, Oct. 1998.
-
(1998)
Proceedings of the Eighth International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 228-239
-
-
Ghosh, S.1
Martonosi, M.2
Malik, S.3
-
19
-
-
0038364294
-
Memory-side prefetching for linked data structures
-
Technical Report UIUCDCS-R-2001-2221, University of Illinios, Urbana Champagne, May
-
C. J. Hughes and S. Adve. Memory-side prefetching for linked data structures. Technical Report UIUCDCS-R-2001-2221, University of Illinios, Urbana Champagne, May 2001.
-
(2001)
-
-
Hughes, C.J.1
Adve, S.2
-
22
-
-
0025429331
-
Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
-
Seattle, WA, June
-
N. P. Jouppi. Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers. In Proceedings of the 17th International Symposium on Computer Architecture, pages 364-373, Seattle, WA, June 1990.
-
(1990)
Proceedings of the 17th International Symposium on Computer Architecture
, pp. 364-373
-
-
Jouppi, N.P.1
-
23
-
-
0034581346
-
A prefetching technique for irregular accesses to linked data structures
-
Toulouse, France, Jan.
-
M. Karlsson, F. Dahlgren, and P. Sternstrom. A prefetching technique for irregular accesses to linked data structures. In Sixth International Symposium on High Performance Computer Architecture, page 206, Toulouse, France, Jan. 2000.
-
(2000)
Sixth International Symposium on High Performance Computer Architecture
, pp. 206
-
-
Karlsson, M.1
Dahlgren, F.2
Sternstrom, P.3
-
27
-
-
0025235788
-
An overview of the SPHINX speech recognition system
-
K.-F. Lee, H.-W. Hon, and R. Reddy. An overview of the SPHINX speech recognition system. In IEEE Transactions on Acoustics, Speech and Signal Precessing, volume 38(1), pages 35-44, 1990.
-
(1990)
IEEE Transactions on Acoustics, Speech and Signal Precessing
, vol.38
, Issue.1
, pp. 35-44
-
-
Lee, K.-F.1
Hon, H.-W.2
Reddy, R.3
-
29
-
-
0029509984
-
SPAID: Software prefetching in pointer- and call-intensive environments
-
Nov.
-
M. H. Lipasti, W. J. Schmidt, S. R. Kunkel, and R. R. Roediger. SPAID: Software prefetching in pointer- and call-intensive environments. In Proceedings of the 28th Annual IEEE/ACM International Symposium on Microachitecture, pages 231-236, Nov. 1995.
-
(1995)
Proceedings of the 28th Annual IEEE/ACM International Symposium on Microachitecture
, pp. 231-236
-
-
Lipasti, M.H.1
Schmidt, W.J.2
Kunkel, S.R.3
Roediger, R.R.4
-
31
-
-
0034839064
-
Tolerating memory latency through software-controlled pre-execution on simultaneous multithreading processors
-
June
-
C.-K. Luk. Tolerating memory latency through software-controlled pre-execution on simultaneous multithreading processors. In Proceedings of the 28th International Symposium on Computer Architecture, pages 40-51, June 2001.
-
(2001)
Proceedings of the 28th International Symposium on Computer Architecture
, pp. 40-51
-
-
Luk, C.-K.1
-
32
-
-
0030190854
-
Improving data locality with loop transformations
-
July
-
K. S. McKinley, S. Carr, and C. Tseng. Improving data locality with loop transformations. ACM Transactions on Programming Languages and Systems, 18(4):424-453, July 1996.
-
(1996)
ACM Transactions on Programming Languages and Systems
, vol.18
, Issue.4
, pp. 424-453
-
-
McKinley, K.S.1
Carr, S.2
Tseng, C.3
-
33
-
-
0026918402
-
Design and evaluation of a compiler algorithm for prefetching
-
Boston, MA, Oct.
-
T. Mowry, M. S. Lam, and A. Gupta. Design and evaluation of a compiler algorithm for prefetching. In Proceedings of the Fifth International Conference on Architectural Support for Programming Languages and Operating Systems, pages 62-73, Boston, MA, Oct. 1992.
-
(1992)
Proceedings of the Fifth International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 62-73
-
-
Mowry, T.1
Lam, M.S.2
Gupta, A.3
-
37
-
-
0035182089
-
Basic block distribution analysis to find periodic behavior and simulation points in applications
-
Barcelona, Spain, Sept.
-
T. Sherwood, E. Perelman, and B. Calder. Basic block distribution analysis to find periodic behavior and simulation points in applications. In Proceedings of the 2001 International Conference on Parallel Architectures and Compilation Techniques, pages 3-14, Barcelona, Spain, Sept. 2001.
-
(2001)
Proceedings of the 2001 International Conference on Parallel Architectures and Compilation Techniques
, pp. 3-14
-
-
Sherwood, T.1
Perelman, E.2
Calder, B.3
-
38
-
-
0034462352
-
Predictor-directed stream buffers
-
Monterey, California, Dec.
-
T. Sherwood, S. Sair, and B. Calder. Predictor-directed stream buffers. In Proceedings of the 33rd International Symposium on Microarchitecture, pages 42-53, Monterey, California, Dec. 2000.
-
(2000)
Proceedings of the 33rd International Symposium on Microarchitecture
, pp. 42-53
-
-
Sherwood, T.1
Sair, S.2
Calder, B.3
-
39
-
-
0030692465
-
Hybrid compiler/hardware prefetching for multiprocessors using low-overhead cache miss traps
-
Bloomington, IL, Aug.
-
J. Skeppstedt and M. Dubois. Hybrid compiler/hardware prefetching for multiprocessors using low-overhead cache miss traps. In Proceedings of the 1997 International Conference on Parallel Processing. pages 298-307, Bloomington, IL, Aug. 1997.
-
(1997)
Proceedings of the 1997 International Conference on Parallel Processing
, pp. 298-307
-
-
Skeppstedt, J.1
Dubois, M.2
-
40
-
-
0020177251
-
Cache memories
-
Sept.
-
A. J. Smith. Cache memories. Computing Surveys, 14(3):473-530, Sept. 1982.
-
(1982)
Computing Surveys
, vol.14
, Issue.3
, pp. 473-530
-
-
Smith, A.J.1
-
45
-
-
0036036096
-
Efficient discovery of regular stride patterns in irregular programs and its use in compiler prefetching
-
Berlin, Germany, June
-
Y. Wu. Efficient discovery of regular stride patterns in irregular programs and its use in compiler prefetching. In Proceedings of the SIGPLAN 2002 Conference on Programming Language Design and Implementation, pages 210-221, Berlin, Germany, June 2002.
-
(2002)
Proceedings of the SIGPLAN 2002 Conference on Programming Language Design and Implementation
, pp. 210-221
-
-
Wu, Y.1
-
47
-
-
0038702623
-
Speeding up irregular applications in shared memory multiprocessors: Memory binding and group prefetching
-
Santa Margherita Ligure, Italy, June
-
Z. Zhang and T. Torrellas. Speeding up irregular applications in shared memory multiprocessors: Memory binding and group prefetching. In Proceedings of the 22nd International Symposium on Computer Architecture, pages 1-19, Santa Margherita Ligure, Italy, June 1995.
-
(1995)
Proceedings of the 22nd International Symposium on Computer Architecture
, pp. 1-19
-
-
Zhang, Z.1
Torrellas, T.2
|