-
1
-
-
34548021671
-
Performance driven data cache prefetching in a dynamic software optimization system
-
J. C. Beyler and P. Clauss. Performance Driven Data Cache Prefetching in a Dynamic Software Optimization System. In ICS, 2007.
-
(2007)
ICS
-
-
Beyler, J.C.1
Clauss, P.2
-
2
-
-
0034839033
-
Speculative precomputation: Long-range prefetching of delinquent loads
-
J. D. Collins, H. Wang, D. M. Tullsen, C. Hughes, Y.-F. Lee, D. Lavery, and J. P. Shen. Speculative Precomputation: Long-range Prefetching of Delinquent Loads. In ISCA, 2001.
-
(2001)
ISCA
-
-
Collins, J.D.1
Wang, H.2
Tullsen, D.M.3
Hughes, C.4
Lee, Y.-F.5
Lavery, D.6
Shen, J.P.7
-
4
-
-
77952570425
-
StatStack: Efficient modeling of LRU caches
-
D. Eklov and E. Hagersten. StatStack: Efficient Modeling of LRU caches. In ISPASS, 2010.
-
(2010)
ISPASS
-
-
Eklov, D.1
Hagersten, E.2
-
5
-
-
84859463353
-
When prefetching works, when it doesn't, and why
-
Mar.
-
J. Lee, H. Kim, and R. Vuduc. When Prefetching Works, When It Doesn't, and Why. ACM TACO, 9(1), Mar. 2012.
-
(2012)
ACM TACO
, vol.9
, Issue.1
-
-
Lee, J.1
Kim, H.2
Vuduc, R.3
-
7
-
-
67650020024
-
The performance of runtime data cache prefetching in a dynamic optimization system
-
J. Lu, H. Chen, R. Fu, W.-C. Hsu, B. Othmer, P.-C. Yew, and D.-Y. Chen. The Performance of Runtime Data Cache Prefetching in a Dynamic Optimization System. In MICRO, 2003.
-
(2003)
MICRO
-
-
Lu, J.1
Chen, H.2
Fu, R.3
Hsu, W.-C.4
Othmer, B.5
Yew, P.-C.6
Chen, D.-Y.7
-
8
-
-
31944440969
-
Pin: Building customized program analysis tools with dynamic instrumentation
-
C.-K. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S. Wallace, V. J. Reddi, and K. Hazelwood. Pin: Building Customized Program Analysis Tools with Dynamic Instrumentation. In PLDI, 2005.
-
(2005)
PLDI
-
-
Luk, C.-K.1
Cohn, R.2
Muth, R.3
Patil, H.4
Klauser, A.5
Lowney, G.6
Wallace, S.7
Reddi, V.J.8
Hazelwood, K.9
-
9
-
-
3042613777
-
Ispike: A post-link optimizer for the intel itanium architecture
-
C.-K. Luk, R. Muth, H. Patil, R. Cohn, and G. Lowney. Ispike: A Post-link Optimizer for the Intel Itanium Architecture. In CGO, 2004.
-
(2004)
CGO
-
-
Luk, C.-K.1
Muth, R.2
Patil, H.3
Cohn, R.4
Lowney, G.5
-
10
-
-
0036375948
-
Profile-guided post-link stride prefetching
-
C.-K. Luk, R. Muth, H. Patil, R. Weiss, P. G. Lowney, and R. Cohn. Profile-Guided Post-Link Stride Prefetching. In ICS, 2002.
-
(2002)
ICS
-
-
Luk, C.-K.1
Muth, R.2
Patil, H.3
Weiss, R.4
Lowney, P.G.5
Cohn, R.6
-
11
-
-
67650568324
-
Scenario based optimization: A framework for statically enabling online optimizations
-
J. Mars and R. Hundt. Scenario Based Optimization: A Framework for Statically Enabling Online Optimizations. In CGO, pages 169-179, 2009.
-
(2009)
CGO
, pp. 169-179
-
-
Mars, J.1
Hundt, R.2
-
12
-
-
0026918402
-
Design and evaluation of a compiler algorithm for prefetching
-
T. C. Mowry, M. S. Lam, and A. Gupta. Design and evaluation of a compiler algorithm for prefetching. In ASPLOS, 1992.
-
(1992)
ASPLOS
-
-
Mowry, T.C.1
Lam, M.S.2
Gupta, A.3
-
16
-
-
70450285524
-
Scaling the bandwidth wall: Challenges in and avenues for CMP scaling
-
B. M. Rogers, A. Krishna, G. B. Bell, K. Vu, X. Jiang, and Y. Solihin. Scaling the Bandwidth Wall: Challenges in and Avenues for CMP Scaling. In ISCA, 2009.
-
(2009)
ISCA
-
-
Rogers, B.M.1
Krishna, A.2
Bell, G.B.3
Vu, K.4
Jiang, X.5
Solihin, Y.6
-
17
-
-
78650832741
-
Reducing cache pollution through detection and elimination of non-temporal memory accesses
-
A. Sandberg, D. Eklöv, and E. Hagersten. Reducing Cache Pollution Through Detection and Elimination of Non-Temporal Memory Accesses. In SC, 2010.
-
(2010)
SC
-
-
Sandberg, A.1
Eklöv, D.2
Hagersten, E.3
-
20
-
-
74049129459
-
A case for integrated processor-cache partitioning in chip multiprocessors
-
S. Srikantaiah, R. Das, A. K. Mishra, C. R. Das, and M. Kandemir. A case for integrated processor-cache partitioning in chip multiprocessors. In SC, 2009.
-
(2009)
SC
-
-
Srikantaiah, S.1
Das, R.2
Mishra, A.K.3
Das, C.R.4
Kandemir, M.5
-
21
-
-
0036036096
-
Efficient discovery of regular stride patterns in irregular programs and its use in compiler prefetching
-
Y. Wu. Efficient Discovery of Regular Stride Patterns in Irregular Programs and Its Use in Compiler Prefetching. In PLDI, 2002.
-
(2002)
PLDI
-
-
Wu, Y.1
-
22
-
-
84885986325
-
A self-repairing prefetcher in an event-driven dynamic optimization framework
-
W. Zhang, B. Calder, and D. M. Tullsen. A Self-Repairing Prefetcher in an Event-Driven Dynamic Optimization Framework. In CGO, 2006.
-
(2006)
CGO
-
-
Zhang, W.1
Calder, B.2
Tullsen, D.M.3
-
23
-
-
34547686261
-
Ubiquitous memory introspection
-
Q. Zhao, R. Rabbah, S. Amarasinghe, L. Rudolph, and W.-F. Wong. Ubiquitous Memory Introspection. In CGO, 2007
-
(2007)
CGO
-
-
Zhao, Q.1
Rabbah, R.2
Amarasinghe, S.3
Rudolph, L.4
Wong, W.-F.5
|