-
2
-
-
34548021671
-
Performance driven data cache prefetching in a dynamic software optimization system
-
J. Beyler and P. Clauss. Performance driven data cache prefetching in a dynamic software optimization system. In International Conference on Supercomputing, pages 202-209, 2007.
-
(2007)
International Conference on Supercomputing
, pp. 202-209
-
-
Beyler, J.1
Clauss, P.2
-
3
-
-
0003741020
-
-
PhD thesis, Princeton University, Princeton, NJ, USA
-
M. C. Carlisle. Olden: parallelizing programs with dynamic data structures on distributed-memory machines. PhD thesis, Princeton University, Princeton, NJ, USA, 1996.
-
(1996)
Olden: Parallelizing Programs with Dynamic Data Structures on Distributed-memory Machines
-
-
Carlisle, M.C.1
-
6
-
-
84948959230
-
Pointer cache assisted prefetching
-
Los Alamitos, CA, USA, IEEE Computer Society Press
-
J. Collins, S. Sair, B. Calder, and D. M. Tullsen. Pointer cache assisted prefetching. In MICRO 35: Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture, pages 62-73, Los Alamitos, CA, USA, 2002. IEEE Computer Society Press.
-
(2002)
MICRO 35: Proceedings of the 35th Annual ACM/IEEE International Symposium on Microarchitecture
, pp. 62-73
-
-
Collins, J.1
Sair, S.2
Calder, B.3
Tullsen, D.M.4
-
7
-
-
0036949391
-
A stateless, content-directed data prefetching mechanism
-
New York, NY, USA, ACM Press
-
R. Cooksey, S. Jourdan, and D. Grunwald. A stateless, content-directed data prefetching mechanism. In ASPLOS-X: Proceedings of the 10th international conference on Architectural support for programming languages and operating systems, pages 279-290, New York, NY, USA, 2002. ACM Press.
-
(2002)
ASPLOS-X: Proceedings of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 279-290
-
-
Cooksey, R.1
Jourdan, S.2
Grunwald, D.3
-
8
-
-
0030679080
-
Memory-system design considerations for dynamically-scheduled processors
-
New York, NY, USA, ACM Press
-
K. I. Farkas, P. Chow, N. P. Jouppi, and Z. Vranesic. Memory-system design considerations for dynamically-scheduled processors. In ISCA '97: Proceedings of the 24th annual international symposium on Computer architecture, pages 133-143, New York, NY, USA, 1997. ACM Press.
-
(1997)
ISCA '97: Proceedings of the 24th Annual International Symposium on Computer Architecture
, pp. 133-143
-
-
Farkas, K.I.1
Chow, P.2
Jouppi, N.P.3
Vranesic, Z.4
-
9
-
-
0034226001
-
SPEC CPU2000: Measuring CPU Performance in the New Millenium
-
July
-
J. Henning. SPEC CPU2000: Measuring CPU Performance in the New Millenium. IEEE Computer, 33(7):28-35, July 2000.
-
(2000)
IEEE Computer
, vol.33
, Issue.7
, pp. 28-35
-
-
Henning, J.1
-
10
-
-
40349103955
-
Memory prefetching using adaptive stream detection
-
Washington, DC, USA, IEEE Computer Society
-
I. Hur and C. Lin. Memory prefetching using adaptive stream detection. In MICRO 39: Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture, pages 397-408, Washington, DC, USA, 2006. IEEE Computer Society.
-
(2006)
MICRO 39: Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 397-408
-
-
Hur, I.1
Lin, C.2
-
11
-
-
1442337814
-
Stride prefetching by dynamically inspecting objects
-
New York, NY, USA, ACM Press
-
T. Inagaki, T. Onodera, H. Komatsu, and T. Nakatani. Stride prefetching by dynamically inspecting objects. In PLDI '03: Proceedings of the ACM SIGPLAN 2003 conference on Programming language design and implementation, pages 269-277, New York, NY, USA, 2003. ACM Press.
-
(2003)
PLDI '03: Proceedings of the ACM SIGPLAN 2003 Conference on Programming Language Design and Implementation
, pp. 269-277
-
-
Inagaki, T.1
Onodera, T.2
Komatsu, H.3
Nakatani, T.4
-
12
-
-
0030677583
-
Prefetching using markov predictors
-
New York, NY, USA, ACM Press
-
D. Joseph and D. Grunwald. Prefetching using markov predictors. In ISCA '97: Proceedings of the 24th annual international symposium on Computer architecture, pages 252-263, New York, NY, USA, 1997. ACM Press.
-
(1997)
ISCA '97: Proceedings of the 24th Annual International Symposium on Computer Architecture
, pp. 252-263
-
-
Joseph, D.1
Grunwald, D.2
-
13
-
-
0025429331
-
Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
-
New York, NY, USA, ACM Press
-
N. P. Jouppi. Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers. In ISCA '90: Proceedings of the 17th annual international symposium on Computer Architecture, pages 364-373, New York, NY, USA, 1990. ACM Press.
-
(1990)
ISCA '90: Proceedings of the 17th Annual International Symposium on Computer Architecture
, pp. 364-373
-
-
Jouppi, N.P.1
-
14
-
-
3042569221
-
Physical experimentation with prefetching helper threads on intel's hyper-threaded processors
-
Washington, DC, USA, IEEE Computer Society
-
D. Kim, S. S. wei Liao, P. H. Wang, J. del Cuvillo, X. Tian, X. Zou, H. Wang, D. Yeung, M. Girkar, and J. P. Shen. Physical experimentation with prefetching helper threads on intel's hyper-threaded processors. In CGO '04: Proceedings of the international symposium on Code generation and optimization, page 27, Washington, DC, USA, 2004. IEEE Computer Society.
-
(2004)
CGO '04: Proceedings of the International Symposium on Code Generation and Optimization
, pp. 27
-
-
Kim, D.1
Wei Liao, S.S.2
Wang, P.H.3
Del Cuvillo, J.4
Tian, X.5
Zou, X.6
Wang, H.7
Yeung, D.8
Girkar, M.9
Shen, J.P.10
-
15
-
-
0029509984
-
Spaid: Software prefetching in pointer- and call-intensive environments
-
Los Alamitos, CA, USA, IEEE Computer Society Press
-
M. H. Lipasti, W. J. Schmidt, S. R. Kunkel, and R. R. Roediger. Spaid: software prefetching in pointer- and call-intensive environments. In MICRO 28: Proceedings of the 28th annual international symposium on Microarchitecture, pages 231-236, Los Alamitos, CA, USA, 1995. IEEE Computer Society Press.
-
(1995)
MICRO 28: Proceedings of the 28th Annual International Symposium on Microarchitecture
, pp. 231-236
-
-
Lipasti, M.H.1
Schmidt, W.J.2
Kunkel, S.R.3
Roediger, R.R.4
-
16
-
-
33749382556
-
Dynamic helper threaded prefetching on the sun ultrasparc cmp processor
-
Washington, DC, USA, IEEE Computer Society
-
J. Lu, A. Das, W.-C. Hsu, K. Nguyen, and S. G. Abraham. Dynamic helper threaded prefetching on the sun ultrasparc cmp processor. In MICRO 38: Proceedings of the 38th annual IEEE/ACM International Symposium on Microarchitecture, pages 93-104, Washington, DC, USA, 2005. IEEE Computer Society.
-
(2005)
MICRO 38: Proceedings of the 38th Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 93-104
-
-
Lu, J.1
Das, A.2
Hsu, W.-C.3
Nguyen, K.4
Abraham, S.G.5
-
17
-
-
3042613777
-
Ispike: A post-link optimizer for the intel®itanium® architecture
-
Washington, DC, USA, IEEE Computer Society
-
C.-K. Luk, R. Muth, H. Patil, R. Cohn, and G. Lowney. Ispike: A post-link optimizer for the intel®itanium®architecture. In CGO '04: Proceedings of the international symposium on Code generation and optimization, page 15, Washington, DC, USA, 2004. IEEE Computer Society.
-
(2004)
CGO '04: Proceedings of the International Symposium on Code Generation and Optimization
, pp. 15
-
-
Luk, C.-K.1
Muth, R.2
Patil, H.3
Cohn, R.4
Lowney, G.5
-
18
-
-
0026918402
-
Design and evaluation of a compiler algorithm for prefetching
-
T. C. Mowry, M. S. Lam, and A. Gupta. Design and evaluation of a compiler algorithm for prefetching. SIGPLAN Notices, 27(9):62-73, 1992.
-
(1992)
SIGPLAN Notices
, vol.27
, Issue.9
, pp. 62-73
-
-
Mowry, T.C.1
Lam, M.S.2
Gupta, A.3
-
19
-
-
2342644731
-
Data cache prefetching using a global history buffer
-
Washington, DC, USA, IEEE Computer Society
-
K. J. Nesbit and J. E. Smith. Data cache prefetching using a global history buffer. In HPCA '04: Proceedings of the 10th International Symposium on High Performance Computer Architecture, page 96, Washington, DC, USA, 2004. IEEE Computer Society.
-
(2004)
HPCA '04: Proceedings of the 10th International Symposium on High Performance Computer Architecture
, pp. 96
-
-
Nesbit, K.J.1
Smith, J.E.2
-
20
-
-
0028294834
-
Evaluating stream buffers as a secondary cache replacement
-
Los Alamitos, CA, USA, IEEE Computer Society Press
-
S. Palacharla and R. E. Kessler. Evaluating stream buffers as a secondary cache replacement. In ISCA '94: Proceedings of the 21ST annual international symposium on Computer architecture, pages 24-33, Los Alamitos, CA, USA, 1994. IEEE Computer Society Press.
-
(1994)
ISCA '94: Proceedings of the 21ST Annual International Symposium on Computer Architecture
, pp. 24-33
-
-
Palacharla, S.1
Kessler, R.E.2
-
21
-
-
0031600692
-
Dependence based prefetching for linked data structures
-
New York, NY, USA, ACM Press
-
A. Roth, A. Moshovos, and G. S. Sohi. Dependence based prefetching for linked data structures. In ASPLOS-VIII: Proceedings of the eighth international conference on Architectural support for programming languages and operating systems, pages 115-126, New York, NY, USA, 1998. ACM Press.
-
(1998)
ASPLOS-VIII: Proceedings of the Eighth International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 115-126
-
-
Roth, A.1
Moshovos, A.2
Sohi, G.S.3
-
22
-
-
0032662988
-
Effective jump-pointer prefetching for linked data structures
-
Washington, DC, USA, IEEE Computer Society
-
A. Roth and G. S. Sohi. Effective jump-pointer prefetching for linked data structures. In ISCA '99: Proceedings of the 26th annual international symposium on Computer architecture, pages 111-121, Washington, DC, USA, 1999. IEEE Computer Society.
-
(1999)
ISCA '99: Proceedings of the 26th Annual International Symposium on Computer Architecture
, pp. 111-121
-
-
Roth, A.1
Sohi, G.S.2
-
23
-
-
79955884420
-
Quantifying load stream behavior
-
S. Sair, T. Sherwood, and B. Calder. Quantifying load stream behavior. In HPCA, pages 197-, 2002.
-
(2002)
HPCA
, pp. 197
-
-
Sair, S.1
Sherwood, T.2
Calder, B.3
-
24
-
-
0041358632
-
Prefetching in supercomputer instruction caches
-
Los Alamitos, CA, USA, IEEE Computer Society Press
-
J. E. Smith and W.-C. Hsu. Prefetching in supercomputer instruction caches. In Supercomputing '92: Proceedings of the 1992 ACM/IEEE conference on Supercomputing, pages 588-597, Los Alamitos, CA, USA, 1992. IEEE Computer Society Press.
-
(1992)
Supercomputing '92: Proceedings of the 1992 ACM/IEEE Conference on Supercomputing
, pp. 588-597
-
-
Smith, J.E.1
Hsu, W.-C.2
-
25
-
-
0036036096
-
Efficient discovery of regular stride patterns in irregular programs and its use in compiler prefetching
-
New York, NY, USA, ACM Press
-
Y. Wu. Efficient discovery of regular stride patterns in irregular programs and its use in compiler prefetching. In PLDI '02: Proceedings of the ACM SIGPLAN 2002 Conference on Programming language design and implementation, pages 210-221, New York, NY, USA, 2002. ACM Press.
-
(2002)
PLDI '02: Proceedings of the ACM SIGPLAN 2002 Conference on Programming Language Design and Implementation
, pp. 210-221
-
-
Wu, Y.1
-
26
-
-
0029199163
-
Speeding up irregular applications in shared-memory multiprocessors: Memory binding and group prefetching
-
New York, NY, USA, ACM Press
-
Z. Zhang and J. Torrellas. Speeding up irregular applications in shared-memory multiprocessors: memory binding and group prefetching. In ISCA '95: Proceedings of the 22nd annual international symposium on Computer architecture, pages 188-199, New York, NY, USA, 1995. ACM Press.
-
(1995)
ISCA '95: Proceedings of the 22nd Annual International Symposium on Computer Architecture
, pp. 188-199
-
-
Zhang, Z.1
Torrellas, J.2
-
27
-
-
22944462650
-
Enhancing memory-level parallelism via recovery-free value prediction
-
H. Zhou and T. M. Conte. Enhancing memory-level parallelism via recovery-free value prediction. IEEE Trans. Comput., 54(7):897-912, 2005.
-
(2005)
IEEE Trans. Comput.
, vol.54
, Issue.7
, pp. 897-912
-
-
Zhou, H.1
Conte, T.M.2
|