-
1
-
-
8344261610
-
Prefetch injection based on hardware monitoring and object metadata
-
A.-R. Adl-Tabatabai et al. Prefetch injection based on hardware monitoring and object metadata. In PLDI, 2004.
-
(2004)
PLDI
-
-
Adl-Tabatabai, A.-R.1
-
2
-
-
33646389168
-
Compiler directed contentaware prefetching for dynamic data structures
-
H. Al-Sukhni, I. Bratt, and D. A. Connors. Compiler directed contentaware prefetching for dynamic data structures. In PACT-12, 2003.
-
(2003)
PACT-12
-
-
Al-Sukhni, H.1
Bratt, I.2
Connors, D.A.3
-
3
-
-
64949108235
-
Structural variation detection using high-throughput sequencing
-
C. Alkan et al. Structural variation detection using high-throughput sequencing. In Pacific Symposium on Biocomputing, 2008.
-
(2008)
Pacific Symposium on Biocomputing
-
-
Alkan, C.1
-
4
-
-
0034844454
-
Data prefetching by dependence graph precomputation
-
M. Annavaram et al. Data prefetching by dependence graph precomputation. In ISCA-29, 2001.
-
(2001)
ISCA-29
-
-
Annavaram, M.1
-
5
-
-
0003758490
-
Generalized correlation-based hardware prefetching
-
Technical Report EE-CEG-95-1, Cornell Univ
-
M. J. Charney and A. P. Reeves. Generalized correlation-based hardware prefetching. Technical Report EE-CEG-95-1, Cornell Univ., 1995.
-
(1995)
-
-
Charney, M.J.1
Reeves, A.P.2
-
6
-
-
0034839033
-
Speculative precomputation: Long-range prefetching of delinquent loads
-
J. D. Collins et al. Speculative precomputation: long-range prefetching of delinquent loads. In ISCA-28, 2001.
-
(2001)
ISCA-28
-
-
Collins, J.D.1
-
9
-
-
0036949391
-
A stateless, content-directed data prefetching mechanism
-
R. Cooksey, S. Jourdan, and D. Grunwald. A stateless, content-directed data prefetching mechanism. In ASPLOS-X, 2002.
-
(2002)
ASPLOS-X
-
-
Cooksey, R.1
Jourdan, S.2
Grunwald, D.3
-
10
-
-
64949087147
-
-
Inside Intel Core Microarchitecture and Smart Memory Access, White Paper. Intel, Jul
-
J. Doweck. Inside Intel Core Microarchitecture and Smart Memory Access - White Paper. Intel, Jul 2006.
-
(2006)
-
-
Doweck, J.1
-
12
-
-
0007096183
-
Buffer block prefetching method
-
July
-
J. D. Gindele. Buffer block prefetching method. IBM Technical Disclosure Bulletin, 20(2):696-697, July 1977.
-
(1977)
IBM Technical Disclosure Bulletin
, vol.20
, Issue.2
, pp. 696-697
-
-
Gindele, J.D.1
-
13
-
-
0003278283
-
The microarchitecture of the Pentium 4 processor
-
Feb, Issue
-
G. Hinton et al. The microarchitecture of the Pentium 4 processor. Intel Technology Journal, Feb. 2001. Q1 2001 Issue.
-
(2001)
Intel Technology Journal
-
-
Hinton, G.1
-
14
-
-
0029666630
-
Informingmemory operations: Providing memory performance feedback in modern processors
-
M. Horowitz et al. Informingmemory operations: providing memory performance feedback in modern processors. In ISCA-23, 1996.
-
(1996)
ISCA-23
-
-
Horowitz, M.1
-
16
-
-
2342644731
-
Data cache prefetching using a global history buffer
-
K. J.Nesbit and J. E.Smith. Data cache prefetching using a global history buffer. In HPCA-10, 2004.
-
(2004)
HPCA-10
-
-
Nesbit, K.J.1
Smith, J.E.2
-
17
-
-
0030677583
-
Prefetching using Markov predictors
-
D. Joseph and D. Grunwald. Prefetching using Markov predictors. In ISCA-24, 1997.
-
(1997)
ISCA-24
-
-
Joseph, D.1
Grunwald, D.2
-
18
-
-
0025429331
-
Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
-
N. Jouppi. Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers. In ISCA-17, 1990.
-
(1990)
ISCA-17
-
-
Jouppi, N.1
-
19
-
-
85008031236
-
MinneSPEC: A new SPEC benchmark workload for simulation-based computer architecture research
-
A. KleinOsowski and D. Lilja. MinneSPEC: A new SPEC benchmark workload for simulation-based computer architecture research. Comp Arch Letters, 2002.
-
(2002)
Comp Arch Letters
-
-
KleinOsowski, A.1
Lilja, D.2
-
20
-
-
0034851536
-
Dead-block prediction and dead-block correlating prefetchers
-
A.-C. Lai, C. Fide, and B. Falsafi. Dead-block prediction and dead-block correlating prefetchers. In ISCA-28, 2001.
-
(2001)
ISCA-28
-
-
Lai, A.-C.1
Fide, C.2
Falsafi, B.3
-
21
-
-
0020765079
-
A real-time garbage collector based on the lifetimes of objects
-
June
-
H. Lieberman and C. Hewitt. A real-time garbage collector based on the lifetimes of objects. ACM Communications, 26, June 1983.
-
(1983)
ACM Communications
, vol.26
-
-
Lieberman, H.1
Hewitt, C.2
-
22
-
-
33749403197
-
SPAID: Software prefetching in pointer- and callintensive environments
-
M. H. Lipasti et al. SPAID: Software prefetching in pointer- and callintensive environments. In MICRO-28, 1995.
-
(1995)
MICRO-28
-
-
Lipasti, M.H.1
-
23
-
-
0034839064
-
Tolerating memory latency through software-controlled preexecution in simultaneous multithreading processors
-
C.-K. Luk. Tolerating memory latency through software-controlled preexecution in simultaneous multithreading processors. In ISCA, 2001.
-
(2001)
ISCA
-
-
Luk, C.-K.1
-
24
-
-
0003101188
-
Compiler-based prefetching for recursive data structures
-
C.-K. Luk and T. C. Mowry. Compiler-based prefetching for recursive data structures. In ASPLOS-7, 1996.
-
(1996)
ASPLOS-7
-
-
Luk, C.-K.1
Mowry, T.C.2
-
25
-
-
84962144701
-
Balancing throughput and fairness in SMT processors
-
K. Luo, J. Gummaraju, and M. Franklin. Balancing throughput and fairness in SMT processors. In ISPASS, 2001.
-
(2001)
ISPASS
-
-
Luo, K.1
Gummaraju, J.2
Franklin, M.3
-
26
-
-
33644917917
-
Address-value delta (AVD) prediction: Increasing the effectiveness of runahead execution by exploting regular memory allocation patterns
-
O. Mutlu et al. Address-value delta (AVD) prediction: Increasing the effectiveness of runahead execution by exploting regular memory allocation patterns. In MICRO-38, 2005.
-
(2005)
MICRO-38
-
-
Mutlu, O.1
-
28
-
-
17644388982
-
Microlib: A case for the quantitative comparison of micro-architecture mechanisms
-
D. G. Perez et al. Microlib: A case for the quantitative comparison of micro-architecture mechanisms. In MICRO-37, 2004.
-
(2004)
MICRO-37
-
-
Perez, D.G.1
-
29
-
-
0029273301
-
Supporting dynamic data structures on distributed memory machines
-
Mar
-
A. Rogers et al. Supporting dynamic data structures on distributed memory machines. ACM TOPLAS, 17(2), Mar. 1995.
-
(1995)
ACM TOPLAS
, vol.17
, Issue.2
-
-
Rogers, A.1
-
30
-
-
0031600692
-
Dependence based prefetching for linked data structures
-
A. Roth, A. Moshovos, and G. S. Sohi. Dependence based prefetching for linked data structures. In ASPLOS-8, 1998.
-
(1998)
ASPLOS-8
-
-
Roth, A.1
Moshovos, A.2
Sohi, G.S.3
-
31
-
-
0032662988
-
Effective jump-pointer prefetching for linked data structures
-
A. Roth and G. S. Sohi. Effective jump-pointer prefetching for linked data structures. In ISCA-26, 1999.
-
(1999)
ISCA-26
-
-
Roth, A.1
Sohi, G.S.2
-
33
-
-
0034443570
-
Symbiotic job scheduling for a simultaneous multithreading processor
-
A. Snavely and D. M. Tullsen. Symbiotic job scheduling for a simultaneous multithreading processor. In ASPLOS-IX, 2000.
-
(2000)
ASPLOS-IX
-
-
Snavely, A.1
Tullsen, D.M.2
-
34
-
-
0036296856
-
Using a user-level memory thread for correlation prefetching
-
Y. Solihin, J. Lee, and J. Torrellas. Using a user-level memory thread for correlation prefetching. In ISCA-29, 2002.
-
(2002)
ISCA-29
-
-
Solihin, Y.1
Lee, J.2
Torrellas, J.3
-
35
-
-
33845894426
-
Spatial memory streaming
-
S. Somogyi et al. Spatial memory streaming. In ISCA-33, 2006.
-
(2006)
ISCA-33
-
-
Somogyi, S.1
-
36
-
-
34547655822
-
Feedback directed prefetching: Improving the performance and bandwidth-efficiency of hardware prefetchers
-
S. Srinath et al. Feedback directed prefetching: Improving the performance and bandwidth-efficiency of hardware prefetchers. In HPCA, 2007.
-
(2007)
HPCA
-
-
Srinath, S.1
-
37
-
-
16244410220
-
A static filter for reducing prefetch traffic
-
Technical Report CSE-TR-400-99, University of Michigan
-
V. Srinivasan et al. A static filter for reducing prefetch traffic. Technical Report CSE-TR-400-99, University of Michigan, 1999.
-
(1999)
-
-
Srinivasan, V.1
-
38
-
-
64949101828
-
-
J. Tendler, S. Dodson, S. Fields, H. Le, and B. Sinharoy. POWER4 system microarchitecture. IBM Technical White Paper, Oct. 2001.
-
J. Tendler, S. Dodson, S. Fields, H. Le, and B. Sinharoy. POWER4 system microarchitecture. IBM Technical White Paper, Oct. 2001.
-
-
-
-
39
-
-
0038345683
-
Guided region prefetching: A cooperative hardware/software approach
-
Z. Wang et al. Guided region prefetching: a cooperative hardware/software approach. In ISCA-30, 2003.
-
(2003)
ISCA-30
-
-
Wang, Z.1
-
40
-
-
0033705677
-
Push vs. pull: Data movement for linked data structures
-
C.-L. Yang and A. R. Lebeck. Push vs. pull: Data movement for linked data structures. In ICS-2000, 2000.
-
(2000)
ICS-2000
-
-
Yang, C.-L.1
Lebeck, A.R.2
-
41
-
-
84944748972
-
A hardware-based cache pollution filtering mechanism for aggressive prefetches
-
X. Zhuang and H.-H. S. Lee. A hardware-based cache pollution filtering mechanism for aggressive prefetches. In ICPP-32, 2003.
-
(2003)
ICPP-32
-
-
Zhuang, X.1
Lee, H.-H.S.2
-
42
-
-
0012525243
-
Benchmark health considered harmful
-
C. Zilles. Benchmark health considered harmful. Computer Architecture News, 29(3), 2001.
-
(2001)
Computer Architecture News
, vol.29
, Issue.3
-
-
Zilles, C.1
-
43
-
-
0034856097
-
Execution-based prediction using speculative slices
-
C. Zilles and G. Sohi. Execution-based prediction using speculative slices. In ISCA-28, 2001.
-
(2001)
ISCA-28
-
-
Zilles, C.1
Sohi, G.2
|