-
1
-
-
4544238422
-
Prefetch Injection Based on Hardware Monitoring and Object Metadata
-
A.-R. Adl-Tabatabai, R.L. Hudson, M.J. Serrano, and S. Subramoney, "Prefetch Injection Based on Hardware Monitoring and Object Metadata," Proc. ACM SIGPLAN '04 Conf. Programming Language Design and Implementation, pp. 267-276, 2004.
-
(2004)
Proc. ACM SIGPLAN '04 Conf. Programming Language Design and Implementation
, pp. 267-276
-
-
Adl-Tabatabai, A.-R.1
Hudson, R.L.2
Serrano, M.J.3
Subramoney, S.4
-
2
-
-
0032686330
-
Correlated Load-Address Predictors
-
M. Bekerman, S. Jourdan, R. Ronen, G. Kirshenboim, L. Rappoport, A. Yoaz, and U. Weiser, "Correlated Load-Address Predictors," Proc. 26th Int'l Symp. Computer Architecture, pp. 54-63, 1999.
-
(1999)
Proc. 26th Int'l Symp. Computer Architecture
, pp. 54-63
-
-
Bekerman, M.1
Jourdan, S.2
Ronen, R.3
Kirshenboim, G.4
Rappoport, L.5
Yoaz, A.6
Weiser, U.7
-
3
-
-
70450271824
-
CAVA: Hiding L2 Misses with Checkpoint-Assisted Value Prediction
-
Dec
-
L. Ceze, K. Strauss, J. Tuck, J. Renau, and J. Torrellas, "CAVA: Hiding L2 Misses with Checkpoint-Assisted Value Prediction," IEEE Computer Architecture Letters, vol. 3, Dec. 2004.
-
(2004)
IEEE Computer Architecture Letters
, vol.3
-
-
Ceze, L.1
Strauss, K.2
Tuck, J.3
Renau, J.4
Torrellas, J.5
-
4
-
-
0003758490
-
Correlation-Based Hardware Prefetching,
-
PhD thesis, Cornell Univ, Aug
-
M. Charney, "Correlation-Based Hardware Prefetching," PhD thesis, Cornell Univ., Aug. 1995.
-
(1995)
-
-
Charney, M.1
-
6
-
-
4644226058
-
Microarchitecture Optimizations for Exploiting Memory-Level Parallelism
-
Y. Chou, B. Fahs, and S. Abraham, "Microarchitecture Optimizations for Exploiting Memory-Level Parallelism," Proc. 31st Int'l Symp. Computer Architecture, pp. 76-87, 2004.
-
(2004)
Proc. 31st Int'l Symp. Computer Architecture
, pp. 76-87
-
-
Chou, Y.1
Fahs, B.2
Abraham, S.3
-
7
-
-
84948959230
-
Pointer Cache Assisted Prefetching
-
J.D. Collins, S. Sair, B. Calder, and D.M. Tullsen, "Pointer Cache Assisted Prefetching," Proc. 35th Int'l Symp. Microarchitecture, pp. 62-73, 2002.
-
(2002)
Proc. 35th Int'l Symp. Microarchitecture
, pp. 62-73
-
-
Collins, J.D.1
Sair, S.2
Calder, B.3
Tullsen, D.M.4
-
8
-
-
0036949391
-
A Stateless, Content-Directed Data Prefetching Mechanism
-
R. Cooksey, S. Jourdan, and D. Grunwald, "A Stateless, Content-Directed Data Prefetching Mechanism," Proc. 10th Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 279-290, 2002.
-
(2002)
Proc. 10th Int'l Conf. Architectural Support for Programming Languages and Operating Systems
, pp. 279-290
-
-
Cooksey, R.1
Jourdan, S.2
Grunwald, D.3
-
9
-
-
0030662863
-
Improving Data Cache Performance by Pre-Executing Instructions Under a Cache Miss
-
J. Dundas and T. Mudge, "Improving Data Cache Performance by Pre-Executing Instructions Under a Cache Miss," Proc. 1997 Int'l Conf. Supercomputing, pp. 68-75, 1997.
-
(1997)
Proc. 1997 Int'l Conf. Supercomputing
, pp. 68-75
-
-
Dundas, J.1
Mudge, T.2
-
12
-
-
0031641244
-
Power Considerations in the Design of the Alpha 21264 Microprocessor
-
M.K. Gowan, L.L. Biro, and D.B. Jackson, "Power Considerations in the Design of the Alpha 21264 Microprocessor," Proc. 35th Ann. Design Automation Conf., pp. 726-731, 1998.
-
(1998)
Proc. 35th Ann. Design Automation Conf
, pp. 726-731
-
-
Gowan, M.K.1
Biro, L.L.2
Jackson, D.B.3
-
14
-
-
0025429331
-
Improving Direct-Mapped Cache Performance by the Addition of a Small Fully-Associative Cache and Prefetch Buffers
-
N.P. Jouppi, "Improving Direct-Mapped Cache Performance by the Addition of a Small Fully-Associative Cache and Prefetch Buffers," Proc. 17th Int'l Symp. Computer Architecture, pp. 364-373, 1990.
-
(1990)
Proc. 17th Int'l Symp. Computer Architecture
, pp. 364-373
-
-
Jouppi, N.P.1
-
15
-
-
0034581346
-
A Prefetching Technique for Irregular Accesses to Linked Data Structures
-
M. Karlsson, F. Dahlgren, and P. Strenstrom, "A Prefetching Technique for Irregular Accesses to Linked Data Structures," Proc. Sixth Int'l Symp. High Performance Computer Architecture, pp. 206-217, 2000.
-
(2000)
Proc. Sixth Int'l Symp. High Performance Computer Architecture
, pp. 206-217
-
-
Karlsson, M.1
Dahlgren, F.2
Strenstrom, P.3
-
16
-
-
28444492331
-
Checkpointed Early Load Retirement
-
N. Kirman, M. Kirman, M. Chaudhuri, and J.F. Martínez, "Checkpointed Early Load Retirement," Proc. 11th Int'l Symp. High Performance Computer Architecture, pp. 16-27, 2005.
-
(2005)
Proc. 11th Int'l Symp. High Performance Computer Architecture
, pp. 16-27
-
-
Kirman, N.1
Kirman, M.2
Chaudhuri, M.3
Martínez, J.F.4
-
17
-
-
85008031236
-
MinneSPEC: A New SPEC Benchmark Workload for Simulation-Based Computer Architecture Research
-
June
-
A. KleinOsowski and D.J. Lilja, "MinneSPEC: A New SPEC Benchmark Workload for Simulation-Based Computer Architecture Research," IEEE Computer Architecture Letters, vol. 1, June 2002.
-
(2002)
IEEE Computer Architecture Letters
, vol.1
-
-
KleinOsowski, A.1
Lilja, D.J.2
-
19
-
-
0029509984
-
SPAID: Software Prefetching in Pointer- and Call-Intensive Environments
-
M.H. Lipasti, W.J. Schmidt, S.R. Kunkel, and R.R. Roediger, "SPAID: Software Prefetching in Pointer- and Call-Intensive Environments," Proc. 28th Int'l Symp. Microarchitecture, pp. 232-236, 1995.
-
(1995)
Proc. 28th Int'l Symp. Microarchitecture
, pp. 232-236
-
-
Lipasti, M.H.1
Schmidt, W.J.2
Kunkel, S.R.3
Roediger, R.R.4
-
20
-
-
0002666476
-
Value Locality and Load Value Prediction
-
M.H. Lipasti, C. Wilkerson, and J.P. Shen, "Value Locality and Load Value Prediction," Proc. Seventh Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 226-237, 1996.
-
(1996)
Proc. Seventh Int'l Conf. Architectural Support for Programming Languages and Operating Systems
, pp. 226-237
-
-
Lipasti, M.H.1
Wilkerson, C.2
Shen, J.P.3
-
22
-
-
30344476979
-
An Analysis of the Performance Impact of Wrong-Path Memory References on Out-of-Order and Runahead Execution Processors
-
Dec
-
O. Mutlu, H. Kim, D.N. Armstrong, and Y.N. Patt, "An Analysis of the Performance Impact of Wrong-Path Memory References on Out-of-Order and Runahead Execution Processors," IEEE Trans. Computers, vol. 54, no. 12, pp. 1556-1571, Dec. 2005.
-
(2005)
IEEE Trans. Computers
, vol.54
, Issue.12
, pp. 1556-1571
-
-
Mutlu, O.1
Kim, H.2
Armstrong, D.N.3
Patt, Y.N.4
-
23
-
-
33644917917
-
Address-Value Delta (AVD) Prediction: Increasing the Effectiveness of Runahead Execution by Exploiting Regular Memory Allocation Patterns
-
O. Mutlu, H. Kim, and Y.N. Patt, "Address-Value Delta (AVD) Prediction: Increasing the Effectiveness of Runahead Execution by Exploiting Regular Memory Allocation Patterns," Proc. 38th Int'l Symp. Microarchitecture, pp. 233-244, 2005.
-
(2005)
Proc. 38th Int'l Symp. Microarchitecture
, pp. 233-244
-
-
Mutlu, O.1
Kim, H.2
Patt, Y.N.3
-
24
-
-
27544498766
-
Techniques for Efficient Processing in Runahead Execution Engines
-
O. Mutlu, H. Kim, and Y.N. Patt, "Techniques for Efficient Processing in Runahead Execution Engines," Proc. 32nd Int'l Symp. Computer Architecture, pp. 370-381, 2005.
-
(2005)
Proc. 32nd Int'l Symp. Computer Architecture
, pp. 370-381
-
-
Mutlu, O.1
Kim, H.2
Patt, Y.N.3
-
25
-
-
84955506994
-
Runahead Execution: An Alternative to Very Large Instruction Windows for Out-of-Order Processors
-
O. Mutlu, J. Stark, C. Wilkerson, and Y.N. Patt, "Runahead Execution: An Alternative to Very Large Instruction Windows for Out-of-Order Processors," Proc. Ninth Int'l Symp. High Performance Computer Architecture, pp. 129-140, 2003.
-
(2003)
Proc. Ninth Int'l Symp. High Performance Computer Architecture
, pp. 129-140
-
-
Mutlu, O.1
Stark, J.2
Wilkerson, C.3
Patt, Y.N.4
-
26
-
-
1342282617
-
Runahead Execution: An Effective Alternative to Large Instruction Windows
-
Nov./Dec
-
O. Mutlu, J. Stark, C. Wilkerson, and Y.N. Patt, "Runahead Execution: An Effective Alternative to Large Instruction Windows," IEEE Micro, vol. 23, no. 6, pp. 20-25, Nov./Dec. 2003.
-
(2003)
IEEE Micro
, vol.23
, Issue.6
, pp. 20-25
-
-
Mutlu, O.1
Stark, J.2
Wilkerson, C.3
Patt, Y.N.4
-
27
-
-
33749399900
-
Reducing Load Latency through Memory Instruction Characterization,
-
PhD thesis, Univ. of Michigan
-
P. Racunas, "Reducing Load Latency through Memory Instruction Characterization," PhD thesis, Univ. of Michigan, 2003.
-
(2003)
-
-
Racunas, P.1
-
28
-
-
0029273301
-
Supporting Dynamic Data Structures on Distributed Memory Machines
-
Mar
-
A. Rogers, M.C. Carlisle, J. Reppy, and L. Hendren, "Supporting Dynamic Data Structures on Distributed Memory Machines," ACM Trans. Programming Languages and Systems, vol. 17, no. 2, pp. 233-263, Mar. 1995.
-
(1995)
ACM Trans. Programming Languages and Systems
, vol.17
, Issue.2
, pp. 233-263
-
-
Rogers, A.1
Carlisle, M.C.2
Reppy, J.3
Hendren, L.4
-
29
-
-
0031600692
-
Dependence Based Prefetching for Linked Data Structures
-
A. Roth, A. Moshovos, and G.S. Sohi, "Dependence Based Prefetching for Linked Data Structures," Proc. Eighth Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 115-126, 1998.
-
(1998)
Proc. Eighth Int'l Conf. Architectural Support for Programming Languages and Operating Systems
, pp. 115-126
-
-
Roth, A.1
Moshovos, A.2
Sohi, G.S.3
-
32
-
-
0036953769
-
Automatically Characterizing Large Scale Program Behavior
-
T. Sherwood, E. Perelman, G. Hamerly, and B. Calder, "Automatically Characterizing Large Scale Program Behavior," Proc. 10th Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 45-57, 2002.
-
(2002)
Proc. 10th Int'l Conf. Architectural Support for Programming Languages and Operating Systems
, pp. 45-57
-
-
Sherwood, T.1
Perelman, E.2
Hamerly, G.3
Calder, B.4
-
33
-
-
0036296856
-
Using a User-Level Memory Thread for Correlation Prefetching
-
Y. Solihin, J. Lee, and J. Torrellas, "Using a User-Level Memory Thread for Correlation Prefetching," Proc. 29th Int'l Symp. Computer Architecture, pp. 171-182, 2002.
-
(2002)
Proc. 29th Int'l Symp. Computer Architecture
, pp. 171-182
-
-
Solihin, Y.1
Lee, J.2
Torrellas, J.3
-
35
-
-
0003535436
-
POWER4 System Microarchitecture
-
paper, Oct
-
J. Tendler, S. Dodson, S. Fields, H. Le, and B. Sinharoy, "POWER4 System Microarchitecture," IBM technical white paper, Oct. 2001.
-
(2001)
IBM technical white
-
-
Tendler, J.1
Dodson, S.2
Fields, S.3
Le, H.4
Sinharoy, B.5
-
37
-
-
1342320051
-
The Memory Gap and the Future of High Performance Memories
-
Mar
-
M.V. Wilkes, "The Memory Gap and the Future of High Performance Memories," ACM Computer Architecture News, vol. 29, no. 1, pp. 2-7, Mar. 2001.
-
(2001)
ACM Computer Architecture News
, vol.29
, Issue.1
, pp. 2-7
-
-
Wilkes, M.V.1
-
38
-
-
0036036096
-
Efficient Discovery of Regular Stride Patterns in Irregular Programs and Its Use in Compiler Prefetching
-
Y. Wu, "Efficient Discovery of Regular Stride Patterns in Irregular Programs and Its Use in Compiler Prefetching," Proc. ACM SIGPLAN '02 Conf. Programming Language Design and Implementation, pp. 210-221, 2002.
-
(2002)
Proc. ACM SIGPLAN '02 Conf. Programming Language Design and Implementation
, pp. 210-221
-
-
Wu, Y.1
-
39
-
-
0003158656
-
Hitting the Memory Wall: Implications of the Obvious
-
Mar
-
W. Wulf and S. McKee, "Hitting the Memory Wall: Implications of the Obvious," ACM Computer Architecture News, vol. 23, no. 1, pp. 20-24, Mar. 1995.
-
(1995)
ACM Computer Architecture News
, vol.23
, Issue.1
, pp. 20-24
-
-
Wulf, W.1
McKee, S.2
-
41
-
-
1142293060
-
Enhancing Memory Level Parallelism via Recovery-Free Value Prediction
-
H. Zhou and T.M. Conte, "Enhancing Memory Level Parallelism via Recovery-Free Value Prediction," Proc. 17th Int'l Conf. Supercomputing, pp. 326-335, 2003.
-
(2003)
Proc. 17th Int'l Conf. Supercomputing
, pp. 326-335
-
-
Zhou, H.1
Conte, T.M.2
-
42
-
-
0012525243
-
Benchmark Health Considered Harmful
-
June
-
C.B. Zilles, "Benchmark Health Considered Harmful," Computer Architecture News, vol. 29, no. 3, pp. 4-5, June 2001.
-
(2001)
Computer Architecture News
, vol.29
, Issue.3
, pp. 4-5
-
-
Zilles, C.B.1
|