-
1
-
-
0003158656
-
Hitting the memory wall: Implications of the obvious
-
DOI: 10.1145/216585.216588.
-
W. A. Wulf and S. A. McKee. "Hitting the Memory Wall: Implications of the Obvious." ACM SIGARCH Computer Architecture News, v. 23 no. 1, 1995. DOI: 10.1145/216585.216588. xiii
-
(1995)
ACM SIGARCH Computer Architecture News
, vol.23
, Issue.1
-
-
Wulf, W.A.1
McKee, S.A.2
-
2
-
-
84878619560
-
TLB Improvements for Chip Multiprocessors: Inter-Core Cooperative Prefetchers and Shared Last-Level TLBs
-
DOI: 10.1145/2445572.2445574.
-
D. Lustig, A. Bhattacharjee, and M. Martonosi. "TLB Improvements for Chip Multiprocessors: Inter-Core Cooperative Prefetchers and Shared Last-Level TLBs." ACM Transactions on Architecture and Code Optimization, v. 10, no. 1, 2013. DOI: 10.1145/2445572.2445574. xiii
-
(2013)
ACM Transactions on Architecture and Code Optimization
, vol.10
, Issue.1
-
-
Lustig, D.1
Bhattacharjee, A.2
Martonosi, M.3
-
4
-
-
67651111633
-
Te Memory System: You Can't Avoid It, You Can't Ignore It, You Can't Fake It
-
DOI: 10.2200/S00201ED1V01Y200907CAC007.
-
B. Jacob. "Te Memory System: You Can't Avoid It, You Can't Ignore It, You Can't Fake It." Synthesis Lectures on Computer Architecture, v. 4, no. 1, 2009. DOI: 10.2200/S00201ED1V01Y200907CAC007. 2
-
(2009)
Synthesis Lectures on Computer Architecture
, vol.4
, Issue.1
, pp. 2
-
-
Jacob, B.1
-
5
-
-
0018106484
-
Sequential Program Prefetching in Memory Hierarchies
-
DOI: 10.1109/C-M.1978.218016. 7
-
A. J. Smith. "Sequential Program Prefetching in Memory Hierarchies." Computer, v. 11, no. 12, 1978. DOI: 10.1109/C-M.1978.218016. 7, 15
-
(1978)
Computer
, vol.11
, Issue.12
, pp. 15
-
-
Smith, A.J.1
-
6
-
-
2842517957
-
Te IBM System/360 Model 91: Machine Philosophy and Instruction-Handling
-
DOI: 10.1147/rd.111.0008. 8
-
D. W. Anderson, F. J. Sparacio, and R. M. Tomasulo. "Te IBM System/360 Model 91: Machine Philosophy and Instruction-Handling." IBM Journal of Research and Development, v. 11 no. 1, 1967. DOI: 10.1147/rd.111.0008. 8
-
(1967)
IBM Journal of Research and Development
, vol.11
, Issue.1
-
-
Anderson, D.W.1
Sparacio, F.J.2
Tomasulo, R.M.3
-
7
-
-
66749100028
-
Temporal Instruction Fetch Streaming
-
DOI: 10.1109/MICRO.2008.4771774. 8
-
M. Ferdman, T F Wenisch, A. Ailamaki, B. Falsaf, A. Moshovos. "Temporal Instruction Fetch Streaming." In Proc. of the 41st Annual ACM/IEEE International Symposium on Microarchitecture, 2008. DOI: 10.1109/MICRO.2008.4771774. 8, 10, 12
-
(2008)
Proc. of the 41st Annual ACM/IEEE International Symposium on Microarchitecture
, vol.10
, pp. 12
-
-
Ferdman, M.1
Wenisch, T.F.2
Ailamaki, A.3
Falsaf, B.4
Moshovos, A.5
-
10
-
-
34548767664
-
Enlarging Instruction Streams
-
DOI: 10.1109/TC.2007.70742. 8
-
O. J. Santana, A. Ramirez, and M. Valero. "Enlarging Instruction Streams." IEEE Trans-actions on Computers, v. 56, no. 10, 2007. DOI: 10.1109/TC.2007.70742. 8, 11
-
(2007)
IEEE Trans-actions on Computers
, vol.56
, Issue.10
, pp. 11
-
-
Santana, O.J.1
Ramirez, A.2
Valero, M.3
-
13
-
-
8344281427
-
Non-Sequential Instruction Cache Prefetching for Multiple-Issue Processors
-
DOI: 10.1142/S0129053399000065. 9
-
A. V. Veidenbaum, Q. Zhao, and A. Shameer. "Non-Sequential Instruction Cache Prefetching for Multiple-Issue Processors." International Journal of High Speed Computing, v. 10, no. 1, 1999. DOI: 10.1142/ S0129053399000065. 9
-
(1999)
International Journal of High Speed Computing
, vol.10
, Issue.1
-
-
Veidenbaum, A.V.1
Zhao, Q.2
Shameer, A.3
-
17
-
-
0034818890
-
Branch History Guided Instruction Prefetching
-
DOI: 10.1109/HPCA.2001.903271. 11
-
V. Srinivasan, E. S. Davidson, G. S. Tyson, M. J. Charney, and T. R. Puzak. "Branch History Guided Instruction Prefetching." In Proc. of the 7th International Symposium on High-Performance Computer Architecture, 2001. DOI: 10.1109/HPCA.2001.903271. 11
-
(2001)
Proc. of the 7th International Symposium on High-Performance Computer Architecture
-
-
Srinivasan, V.1
Davidson, E.S.2
Tyson, G.S.3
Charney, M.J.4
Puzak, T.R.5
-
20
-
-
2442585659
-
Call Graph Prefetching for Data-base Applications
-
DOI: 10.1145/945506.945509. 11
-
M. Annavaram, J. M. Patel, and E. S. Davidson. "Call Graph Prefetching for Data-base Applications." ACM Transactions on Computer Systems, v. 21, no. 4, 2003. DOI: 10.1145/945506.945509. 11
-
(2003)
ACM Transactions on Computer Systems
, vol.21
, Issue.4
-
-
Annavaram, M.1
Patel, J.M.2
Davidson, E.S.3
-
22
-
-
2342523901
-
Hardware Support for Prescient Instruction Prefetch
-
DOI: 10.1109/HPCA.2004.10028.12
-
T. M. Aamodt, P. Chow, P. Hammarlund, H. Wang, and J. P. Shen. "Hardware Support for Prescient Instruction Prefetch." Proc. of the 10th International Symposium on High-Per-formance Computer Architecture, 2004. DOI: 10.1109/HPCA.2004.10028.12
-
(2004)
Proc. of the 10th International Symposium on High-Per-formance Computer Architecture
-
-
Aamodt, T.M.1
Chow, P.2
Hammarlund, P.3
Wang, H.4
Shen, J.P.5
-
23
-
-
0032308865
-
Cooperative Prefetching: Compiler and Hardware Support for Efective Instruction Prefetching in Modern Processors
-
DOI: 10.1109/MICRO.1998.742780.12
-
C-K. Luk, T. C. Mowry. "Cooperative Prefetching: Compiler and Hardware Support for Efective Instruction Prefetching In Modern Processors." In Proc. of the 31st annual ACM/IEEE International Symposium on Microarchitecture, 1998. DOI: 10.1109/MICRO.1998.742780.12
-
(1998)
Proc. of the 31st Annual ACM/IEEE International Symposium on Microarchitecture
-
-
Luk, C.-K.1
Mowry, T.C.2
-
29
-
-
0026267802
-
An Efective On-Chip Preloading Scheme to Reduce Data Access Penalty
-
DOI: 10.1145/125826.125932. 15
-
J.-L. Baer and T.-F Chen. "An Efective On-Chip Preloading Scheme to Reduce Data Access Penalty." In Proc. of Supercomputing, 1991. DOI: 10.1145/125826.125932. 15, 16
-
(1991)
Proc. of Supercomputing
, vol.16
-
-
Baer, J.-L.1
Chen, T.-F.2
-
30
-
-
0038702612
-
Efectiveness of Hardware-Based Stride and Sequential Prefetching in Shared-Memory Multiprocessors
-
IEEE Symposium on High-Performance Computer Architecture DOI: 10.1109/HPCA.1995.386554. 16
-
F Dahlgren and P. Stenstrom. "Efectiveness of Hardware-Based Stride and Sequential Prefetching in Shared-Memory Multiprocessors." In Proc. of the 1st IEEE Symposium on High-Performance Computer Architecture, 1995. DOI: 10.1109/HPCA.1995.386554. 16
-
(1995)
Proc. of the 1st
-
-
Dahlgren, F.1
Stenstrom, P.2
-
31
-
-
79551718643
-
Access Map Pattern Matching for High Performance Data Cache Prefetch
-
Y. Ishii, M. Inaba and K. Hiraki. "Access Map Pattern Matching for High Performance Data Cache Prefetch." Journal of Instruction-Level Parallelism, v. 13, 2011. 16, 28
-
(2011)
Journal of Instruction-Level Parallelism
, vol.13
, Issue.16
, pp. 28
-
-
Ishii, Y.1
Inaba, M.2
Hiraki, K.3
-
32
-
-
0037340044
-
A Decoupled Predictor-Directed Stream Prefetch-ing Architecture
-
DOI: 10.1109/TC.2003.1183943.
-
S. Sair, T. Sherwood, and B. Calder. "A Decoupled Predictor-Directed Stream Prefetch-ing Architecture." IEEE Transactions on Computers, v. 52, no. 3, 2003. DOI: 10.1109/TC.2003.1183943. 16
-
(2003)
IEEE Transactions on Computers
, vol.52
, Issue.3
, pp. 16
-
-
Sair, S.1
Sherwood, T.2
Calder, B.3
-
34
-
-
0025429331
-
Improving Direct-Mapped Cache Performance by the Addition of a Small Fully-Associative Cache and Prefetch Bufers
-
DOI: 10.1145/325164.325162. 16,24
-
N. P. Jouppi. "Improving Direct-Mapped Cache Performance by the Addition of a Small Fully-Associative Cache and Prefetch Bufers." In Proc. of the 17th Annual International Symposium on Computer Architecture, 1990. DOI: 10.1145/325164.325162. 16, 24
-
(1990)
Proc. of the 17th Annual International Symposium on Computer Architecture
-
-
Jouppi, N.P.1
-
38
-
-
0016930686
-
Dynamic Improvement of Locality in Virtual Mem-ory Systems
-
DOI: 10.1109/TSE.1976.233801. 17
-
J-L. Baer, J-L., and G. R. Sager. "Dynamic Improvement of Locality in Virtual Mem-ory Systems." IEEE Transactions on Software Engineering, v. 1, 1976. DOI: 10.1109/TSE.1976.233801. 17
-
(1976)
IEEE Transactions on Software Engineering
, vol.1
-
-
Baer J -L, J.-L.1
Sager, G.R.2
-
41
-
-
0036038136
-
Dynamic Hot Data Stream Prefetching for Gener-al-Purpose Programs
-
DOI: 10.1145/512529.512554. 17, 19, 20, 24, 36
-
T. M. Chilimbi and M. Hirzel. "Dynamic Hot Data Stream Prefetching for Gener-al-Purpose Programs." In Proc. of the Conference on Programming Language Design and Implementation, 2002. DOI: 10.1145/512529.512554. 17, 19, 20, 24, 36
-
(2002)
Proc. of the Conference on Programming Language Design and Implementation
-
-
Chilimbi, T.M.1
Hirzel, M.2
-
42
-
-
27544508955
-
Temporal Streaming of Shared Memory
-
June DOI: 10.1109/ISCA.2005.50. 17, 20,. 22
-
T. F Wenisch, S. Somogyi, N. Hardavellas, J. Kim, A. Ailamaki, and B. Falsaf. "Temporal Streaming of Shared Memory." In Proc. of the 32nd Annual International Symposium on Computer Architecture, June 2005. DOI: 10.1109/ISCA.2005.50. 17, 20, 22
-
(2005)
Proc. of the 32nd Annual International Symposium on Computer Architecture
-
-
Wenisch, T.F.1
Somogyi, S.2
Hardavellas, N.3
Kim, J.4
Ailamaki, A.5
Falsaf, B.6
-
50
-
-
0033075109
-
Prefetching Using Markov Predictors
-
DOI: 10.1109/12.752653. 18
-
D. Joseph and D. Grunwald. "Prefetching Using Markov Predictors." IEEE Transactions on Computers, v. 48 no. 2, 1999. DOI: 10.1109/12.752653. 18
-
(1999)
IEEE Transactions on Computers
, vol.48
, Issue.2
-
-
Joseph, D.1
Grunwald, D.2
-
51
-
-
0034851536
-
Dead-Block Prediction and Dead-Block Correlating Prefetchers
-
DOI: 10.1145/379240.379259. 19, 20, 21
-
A.-C. Lai, C. Fide, and B. Falsaf. "Dead-Block Prediction and Dead-Block Correlating Prefetchers." In Proc. of the 28th Annual International Symposium on Computer Architecture, 2001. DOI: 10.1145/379240.379259. 19, 20, 21
-
(2001)
Proc. of the 28th Annual International Symposium on Computer Architecture
-
-
Lai, A.-C.1
Fide, C.2
Falsaf, B.3
-
53
-
-
0042850375
-
Correlation Prefetching with a User-Level Memory Tread
-
DOI: 10.1109/TPDS.2003.1206504. 20
-
Y. Solihin, J. Lee, and J. Torrellas. "Correlation Prefetching with a User-Level Memory Tread." IEEE Transactions on Parallel and Distributed Systems, v. 14, no. 6, 2003. DOI: 10.1109/TPDS.2003.1206504. 20
-
(2003)
IEEE Transactions on Parallel and Distributed Systems
, vol.14
, Issue.6
-
-
Solihin, Y.1
Lee, J.2
Torrellas, J.3
-
55
-
-
56449097232
-
Temporal Streams in Commercial Server Applications
-
DOI: 10.1109/IISWC.2008.4636095. 20, 22, 32
-
T.F. Wenisch, M. Ferdman, A. Ailamaki, B. Falsaf, and A. Moshovos. "Temporal Streams in Commercial Server Applications." In Proc. of the IEEE International Symposium on Workload Characterization, 2008. DOI: 10.1109/IISWC.2008.4636095. 20, 22, 32
-
(2008)
Proc. of the IEEE International Symposium on Workload Characterization
-
-
Wenisch, T.F.1
Ferdman, M.2
Ailamaki, A.3
Falsaf, B.4
Moshovos, A.5
-
60
-
-
0033691729
-
Selective, Accurate, and Timely Self-Invalidation Using Last-Touch Prediction
-
DOI: 10.1145/339647.339669. 20
-
A-C. Lai, and B. Falsaf. "Selective, Accurate, and Timely Self-Invalidation Using Last-Touch Prediction." In Proc. of the 27th Annual International Symposium on Computer Architecture, 2000. DOI: 10.1145/339647.339669. 20, 28
-
(2000)
Proc. of the 27th Annual International Symposium on Computer Architecture
, vol.28
-
-
Lai, A.-C.1
Falsaf, B.2
-
62
-
-
66749155879
-
Cache Bursts: A New Approach for Eliminating Dead Blocks and Increasing Cache Efciency
-
DOI: 10.1109/MICRO.2008.4771793. 20, 21
-
H. Liu, M. Ferdman, J. Huh, and D. Burger. "Cache Bursts: A New Approach for Eliminating Dead Blocks and Increasing Cache Efciency." In Proc. of the 41st Annual ACM/IEEE International Symposium on Microarchitecture, 2008. DOI: 10.1109/MICRO.2008.4771793. 20, 21
-
(2008)
Proc. of the 41st Annual ACM/IEEE International Symposium on Microarchitecture
-
-
Liu, H.1
Ferdman, M.2
Huh, J.3
Burger, D.4
-
64
-
-
0027149156
-
Modeling Live and Dead Lines in Cache Memory Systems
-
DOI: 10.1109/12.192209. 20
-
A. Mendelson, D. Tiebaut, and D. K. Pradhan. "Modeling Live and Dead Lines in Cache Memory Systems." IEEE Transactions on Computers, v. 4 2, n o. 1. DOI: 10.1109/12.192209. 20
-
IEEE Transactions on Computers
, vol.4
, Issue.2
, pp. 1
-
-
Mendelson, A.1
Tiebaut, D.2
Pradhan, D.K.3
-
67
-
-
64949123191
-
Practical Of-Chip Meta-Data for Temporal Memory Streaming
-
DOI: 10.1109/HPCA.2009.4798239. 22, 25
-
T. F. Wenisch, M. Ferdman, A. Ailamaki, B. Falsaf, A. Moshovos. "Practical Of-Chip Meta-Data for Temporal Memory Streaming." In Proc. of the 15th International Symposium on High Performance Computer Architecture, 2009. DOI: 10.1109/HPCA.2009.4798239. 22, 25
-
(2009)
Proc. of the 15th International Symposium on High Performance Computer Architecture
-
-
Wenisch, T.F.1
Ferdman, M.2
Ailamaki, A.3
Falsaf, B.4
Moshovos, A.5
-
68
-
-
2342644731
-
Data Cache Prefetching Using a Global History Bufer
-
DOI: 10.1109/HPCA.2004.10030. 22, 23, 24, 26, 27, 28
-
K. J. Nesbit and J. E. Smith. "Data Cache Prefetching Using a Global History Bufer." In Proc. of the 10th IEEE Symposium on High-Performance Computer Architecture, 2004. DOI: 10.1109/HPCA.2004.10030. 22, 23, 24, 26, 27, 28
-
(2004)
Proc. of the 10th IEEE Symposium on High-Performance Computer Architecture
-
-
Nesbit, K.J.1
Smith, J.E.2
-
69
-
-
10444284911
-
AC/DC: An Adaptive Data Cache Prefetcher
-
DOI: 10.1109/PACT.2004.1342548. 23, 24, 27
-
K. J. Nesbit, A. S. Dhodapkar, and J. E. Smith. "AC/DC: An Adaptive Data Cache Prefetcher." In Proc. of the 13th International Conference on Parallel Architectures and Compilation Techniques, 2004. DOI: 10.1109/PACT.2004.1342548. 23, 24, 27
-
(2004)
Proc. of the 13th International Conference on Parallel Architectures and Compilation Techniques
-
-
Nesbit, K.J.1
Dhodapkar, A.S.2
Smith, J.E.3
-
70
-
-
77949614728
-
Making Ad-dress-Correlated Prefetching Practical
-
DOI: 10.1109/MM.2010.21. 25
-
T. F. Wenisch, M. Ferdman, A. Ailamaki, B. Falsaf and A. Moshovos. "Making Ad-dress-Correlated Prefetching Practical." IEEE Micro, v. 30, no. 1, 2010. DOI: 10.1109/MM.2010.21. 25
-
(2010)
IEEE Micro
, vol.30
, Issue.1
-
-
Wenisch, T.F.1
Ferdman, M.2
Ailamaki, A.3
Falsaf, B.4
Moshovos, A.5
-
71
-
-
84877712179
-
Application Data Prefetching on the IBM Blue Gene/Q Supercomputer
-
DOI: 10.1109/SC.2012.19. 25
-
I. Chung, C. Kim, H.-F. Wen, and G. Cong. "Application Data Prefetching on the IBM Blue Gene/Q Supercomputer." In International Conference on High Performance Computing, Networking, Storage and Analysis, 2012. DOI: 10.1109/SC.2012.19. 25
-
(2012)
International Conference on High Performance Computing, Networking, Storage and Analysis
-
-
Chung, I.1
Kim, C.2
Wen, H.-F.3
Cong, G.4
-
74
-
-
79551697130
-
Storage Efcient Hardware Prefetching Using Delta Correlating Prediction Tables
-
M. Grannaes, M. Jahre, and L. Natvig. "Storage Efcient Hardware Prefetching Using Delta Correlating Prediction Tables." Journal of Instruction-Level Parallelism, v. 13, 2011. 28
-
(2011)
Journal of Instruction-Level Parallelism
, vol.13
, pp. 28
-
-
Grannaes, M.1
Jahre, M.2
Natvig, L.3
-
75
-
-
79551702603
-
Combining Local and Global History for High Perfor-mance Data Prefetching
-
M. Dimitrov and H. Zhou. "Combining Local and Global History for High Perfor-mance Data Prefetching." Journal of Instruction-Level Parallelism, v. 13, 2011. 28
-
(2011)
Journal of Instruction-Level Parallelism
, vol.13
, pp. 28
-
-
Dimitrov, M.1
Zhou, H.2
-
76
-
-
79551706790
-
Enhancements for Accurate and Timely Streaming Prefetcher
-
G. Liu, Z. Huang, J-K. Peir, X. Shi, and L. Peng. "Enhancements for Accurate and Timely Streaming Prefetcher." Journal of Instruction-Level Parallelism, v. 13, 2011. 28
-
(2011)
Journal of Instruction-Level Parallelism
, vol.13
, pp. 28
-
-
Liu, G.1
Huang, Z.2
Peir, J.-K.3
Shi, X.4
Peng, L.5
-
77
-
-
79751510498
-
Multi-Level Adaptive Prefetching Based on Performance Gradient Tracking
-
L. M. Ramos, J. L. Briz, P. E. Ibáñez, and V. Viñals. "Multi-Level Adaptive Prefetching Based on Performance Gradient Tracking." Journal of Instruction-Level Parallelism, v. 13, 2011. 28
-
(2011)
Journal of Instruction-Level Parallelism
, vol.13
, pp. 28
-
-
Ramos, L.M.1
Briz, J.L.2
Ibáñez, P.E.3
Viñals, V.4
-
78
-
-
79551700079
-
Data Prefetching by Exploiting Global and Local Access Pat-terns
-
A. Sharif and H-H. Lee. "Data Prefetching by Exploiting Global and Local Access Pat-terns. " Journal of Instruction-Level Parallelism, v. 13, 2011. 28
-
(2011)
Journal of Instruction-Level Parallelism
, vol.13
, pp. 28
-
-
Sharif, A.1
Lee, H.-H.2
-
82
-
-
33845894426
-
Spatial Memory Streaming
-
DOI: 10.1109/ISCA.2006.38. 29, 31, 32
-
S. Somogyi, T. F. Wenisch, A. Ailamaki, B. Falsaf, and A. Moshovos. "Spatial Memory Streaming." In Proc. of the 33rd Annual International Symposium on Computer Architecture, 2006. DOI: 10.1109/ISCA.2006.38. 29, 31, 32
-
(2006)
Proc. of the 33rd Annual International Symposium on Computer Architecture
-
-
Somogyi, S.1
Wenisch, T.F.2
Ailamaki, A.3
Falsaf, B.4
Moshovos, A.5
-
83
-
-
2342482320
-
Accurate and Complexity-Efective Spatial Pattern Prediction
-
Feb DOI: 10.1109/HPCA.2004.10010. 29, 30
-
C. F Chen, S.-H. Yang, B. Falsaf, and A. Moshovos. "Accurate and Complexity-Efective Spatial Pattern Prediction." In Proc. of the 10th IEEE Symposium on High-Performance Computer Architecture, Feb. 2004. DOI: 10.1109/HPCA.2004.10010. 29, 30
-
(2004)
Proc. of the 10th IEEE Symposium on High-Performance Computer Architecture
-
-
Chen, C.F.1
Yang, S.-H.2
Falsaf, B.3
Moshovos, A.4
-
85
-
-
79952940839
-
Spatial Memory Streaming
-
DOI: 10.1109/ISCA.2006.38. 29
-
S. Somogyi, T. F Wenisch, M. Ferdman, and B. Falsaf. "Spatial Memory Streaming." Journal of Instruction-Level Parallelism, v. 13, 2011. DOI: 10.1109/ISCA.2006.38. 29
-
(2011)
Journal of Instruction-Level Parallelism
, vol.13
-
-
Somogyi, S.1
Wenisch, T.F.2
Ferdman, M.3
Falsaf, B.4
-
86
-
-
0028324009
-
Decoupled Sectored Caches: Conciliating Low Tag Implementation Cost and Low Miss Ratio
-
DOI: 10.1145/191995.192072. 30
-
A. Seznec. "Decoupled Sectored Caches: Conciliating Low Tag Implementation Cost and Low Miss Ratio." In Proc. of the 21st Annual International Symposium on Computer Architecture, 1994. DOI: 10.1145/191995.192072. 30
-
(1994)
Proc. of the 21st Annual International Symposium on Computer Architecture
-
-
Seznec, A.1
-
87
-
-
0033672408
-
Gated-Vdd: A Circuit Technique to Reduce Leakage in Deep-Submicron Cache Memories
-
DOI: 10.1145/344166.344526. 30
-
M. D. Powell, S-H. Yang, B. Falsaf, K. Roy, and T N. Vijaykumar. "Gated-Vdd: A Circuit Technique to Reduce Leakage in Deep-Submicron Cache Memories." In Proc. of the International Symposium on Low Power Electronics and Design, 2000. DOI: 10.1145/344166.344526. 30
-
(2000)
Proc. of the International Symposium on Low Power Electronics and Design
-
-
Powell, M.D.1
Yang, S.-H.2
Falsaf, B.3
Roy, K.4
Vijaykumar, T.N.5
-
91
-
-
70450279104
-
Spatio-Temporal Memory Streaming
-
DOI: 10.1145/1555754.1555766. 32
-
S. Somogyi, T. F. Wenisch, A. Ailamaki, and B. Falsafi. "Spatio-Temporal Memory Streaming." In Proc. of the 36th Annual International Symposium on Computer Architecture, 2009. DOI: 10.1145/1555754.1555766. 32, 33
-
(2009)
Proc. of the 36th Annual International Symposium on Computer Architecture
, vol.33
-
-
Somogyi, S.1
Wenisch, T.F.2
Ailamaki, A.3
Falsafi, B.4
-
93
-
-
0034839033
-
Speculative Precomputation: Long-Range Prefetching of Delinquent Loads
-
DOI: 10.1145/379240.379248. 33
-
J. D. Collins, H. Wang, D. M. Tullsen, C. Hughes, Y.-F Lee, D. Lavery, and J. P. Shen. "Speculative Precomputation: Long-Range Prefetching of Delinquent Loads." In Proc. of the 28th Annual International Symposium on Computer Architecture, 2001. DOI: 10.1145/379240.379248. 33
-
(2001)
Proc. of the 28th Annual International Symposium on Computer Architecture
-
-
Collins, J.D.1
Wang, H.2
Tullsen, D.M.3
Hughes, C.4
Lee, Y.-F.5
Lavery, D.6
Shen, J.P.7
-
96
-
-
47349095223
-
Future Execution: A Prefetching Mechanism that Uses Multiple Cores to Speed Up Single Treads
-
DOI: 10.1145/1187976.1187979. 33
-
I. Ganusov and M. Burtscher. "Future Execution: A Prefetching Mechanism that Uses Multiple Cores to Speed Up Single Treads." ACM Transactions on Architecture and Code Optimization, v. 3, no. 4, 2006. DOI: 10.1145/1187976.1187979. 33
-
(2006)
ACM Transactions on Architecture and Code Optimization
, vol.3
, Issue.4
-
-
Ganusov, I.1
Burtscher, M.2
-
97
-
-
68849120952
-
Prefetching with Helper Threads for Loosely Coupled Multiprocessor Systems
-
DOI: 10.1109/TPDS.2008.224. 33
-
J. Lee, C. Jung, D. Lim, and Y. Solihin. "Prefetching with Helper Threads for Loosely Coupled Multiprocessor Systems." IEEE Transactions on Parallel and Distributed Systems, v. 20, no. 9, 2009. DOI: 10.1109/TPDS.2008. 224. 33
-
(2009)
IEEE Transactions on Parallel and Distributed Systems
, vol.20
, Issue.9
-
-
Lee, J.1
Jung, C.2
Lim, D.3
Solihin, Y.4
-
104
-
-
1342282617
-
Runahead execution: An Efective Alternative to Large Instruction Windows
-
DOI: 10.1109/MM.2003.1261383.34
-
O. Mutlu, J. Stark, C. Wilkerson, and Y. N. Patt. "Runahead execution: An Efective Alternative to Large Instruction Windows." IEEE Micro, v. 23, no. 6, 2003. DOI: 10.1109/MM.2003.1261383.34
-
(2003)
IEEE Micro
, vol.23
, Issue.6
-
-
Mutlu, O.1
Stark, J.2
Wilkerson, C.3
Patt, Y.N.4
-
106
-
-
33644903196
-
Efficient Runahead Execution: Power-Efcient Memory Latency Tolerance
-
DOI: 10.1109/MM.2006.10. 34
-
O. Mutlu, H. Kim, and Y N. Patt. "Efficient Runahead Execution: Power-Efcient Memory Latency Tolerance." IEEE Micro, v. 26, no. 1, 2006. DOI: 10.1109/MM.2006.10. 34
-
(2006)
IEEE Micro
, vol.26
, Issue.1
-
-
Mutlu, O.1
Kim, H.2
Patt, Y.N.3
-
107
-
-
12844269176
-
Continual Flow Pipelines
-
DOI: 10.1145/1024393.1024407. 34
-
S. T Srinivasan, R. Rajwar, H. Akkary, A. Gandhi, and M. Upton. "Continual Flow Pipelines." In Proc. of the International Conference on Architectural Support for Programming Languages and Operating Systems, 2004. DOI: 10.1145/1024393.1024407. 34
-
(2004)
Proc. of the International Conference on Architectural Support for Programming Languages and Operating Systems
-
-
Srinivasan, S.T.1
Rajwar, R.2
Akkary, H.3
Gandhi, A.4
Upton, M.5
-
109
-
-
77951007282
-
Extending Data Prefetching to Cope with Context Switch Misses
-
DOI: 10.1109/ICCD.2009.5413144. 34, 35
-
H. Cui and S. Suleyman. "Extending Data Prefetching to Cope with Context Switch Misses." In Proc. of the International Conference on Computer Design, 2009. DOI: 10.1109/ICCD.2009.5413144. 34, 35
-
(2009)
Proc. of the International Conference on Computer Design
-
-
Cui, H.1
Suleyman, S.2
-
111
-
-
84880308617
-
RECAP: A Region-Based Cure for the Common Cold (Cache)
-
DOI: 10.1145/2370816.2370887. 34
-
J. Zebchuk, H. W. Cain, X. Tong, V. Srinivasan and A. Moshovos. "RECAP: A Region-Based Cure for the Common Cold (Cache)." In Proc. of the 19th Annual International Symposium on High Performance Computer Architecture, 2013. DOI: 10.1145/2370816.2370887. 34
-
Proc. of the 19th Annual International Symposium on High Performance Computer Architecture
, vol.2013
-
-
Zebchuk, J.1
Cain, H.W.2
Tong, X.3
Srinivasan, V.4
Moshovos, A.5
-
118
-
-
84863379287
-
PACMan: Prefetch-Aware Cache Management for High Performance Caching
-
DOI: 10.1145/2155620.2155672. 35
-
C-J. Wu, A. Jaleel, M. Martonosi, S. Steely Jr, and J. Emer. "PACMan: Prefetch-Aware Cache Management for High Performance Caching." In Proc. of the 44th Annual ACM/IEEE International Symposium on Microarchitecture, 2011. DOI: 10.1145/2155620.2155672. 35
-
(2011)
Proc. of the 44th Annual ACM/IEEE International Symposium on Microarchitecture
-
-
Wu, C.-J.1
Jaleel, A.2
Martonosi, M.3
Steely Jr., S.4
Emer, J.5
-
119
-
-
79551699363
-
Efcient Prefetching with Hybrid Schemes and Use of Program Feedback to Adjust Prefetcher Aggressiveness
-
S. Verma, D. M. Koppelman, and L. Peng. "Efcient Prefetching with Hybrid Schemes and Use of Program Feedback to Adjust Prefetcher Aggressiveness." Journal of Instruction-Level Parallelism, v. 13, 2011. 35
-
(2011)
Journal of Instruction-Level Parallelism
, vol.13
, pp. 35
-
-
Verma, S.1
Koppelman, D.M.2
Peng, L.3
-
125
-
-
0032650093
-
Memory Forwarding: Enabling Aggressive Layout Optimizations by Guaranteeing the Safety of Data Relocation
-
DOI: 10.1145/300979.300987. 36
-
C.-K. Luk and T. C. Mowry. "Memory Forwarding: Enabling Aggressive Layout Optimizations by Guaranteeing the Safety of Data Relocation." In Proc. of the 26th Annual International Symposium on Computer Architecture, 1999. DOI: 10.1145/300979.300987. 36
-
(1999)
Proc. of the 26th Annual International Symposium on Computer Architecture
-
-
Luk, C.-K.1
Mowry, T.C.2
-
126
-
-
4644295630
-
Evaluating the Imagine Stream Architecture
-
J. H. Ahn, W. J. Dally, B. Khailany, U. J. Kapasi, and A. Das. "Evaluating the Imagine Stream Architecture." In Proc. of the 31st Annual International Symposium on Computer Architecture, 2004. 36
-
(2004)
Proc. of the 31st Annual International Symposium on Computer Architecture
, vol.36
-
-
Ahn, J.H.1
Dally, W.J.2
Khailany, B.3
Kapasi, U.J.4
Das, A.5
-
127
-
-
84877083867
-
Merrimac: Supercomputing with Streams
-
DOI: 10.1145/1048935.1050187. 36
-
W. J. Dally, F Labonte, A. Das, P. Hanrahan, J.-H. Ahn, J. Gummaraju, M. Erez, N. Jayasena, I. Buck, T. J. Knight, and U. J. Kapasi. "Merrimac: Supercomputing with Streams." In Proc. of 'Supercomputing, 2003. DOI: 10.1145/1048935.1050187. 36
-
(2003)
Proc. of 'Supercomputing
-
-
Dally, W.J.1
Labonte, F.2
Das, A.3
Hanrahan, P.4
Ahn, J.-H.5
Gummaraju, J.6
Erez, M.7
Jayasena, N.8
Buck, I.9
Knight, T.J.10
Kapasi, U.J.11
-
128
-
-
0036959649
-
A Stream Compiler for Communication-Exposed Architectures
-
DOI: 10.1145/605397.605428. 36
-
M. I. Gordon, W. Thies, M. Karczmarek, J. Lin, A. S. Meli, A. A. Lamb, C. Leger, J. Wong, H. Hoffmann, D. Maze, and S. Amarasinghe. "A Stream Compiler for Communication-Exposed Architectures." In Proc. of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems, 2002. DOI: 10.1145/605397.605428. 36
-
(2002)
Proc. of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems
-
-
Gordon, M.I.1
Thies, W.2
Karczmarek, M.3
Lin, J.4
Meli, A.S.5
Lamb, A.A.6
Leger, C.7
Wong, J.8
Hoffmann, H.9
Maze, D.10
Amarasinghe, S.11
-
130
-
-
80052528714
-
Dark Silicon and the End of Multicore Scaling
-
DOI: 10.1145/2000064.2000108. 40
-
H. Esmaeilzadeh, E. Blem, R. S. Amant, K. Sankaralingam, and D. Burger. "Dark Silicon and the End of Multicore Scaling." In Proc. of the 38th Annual International Symposium on Computer Architecture, 2011. DOI: 10.1145/2000064.2000108. 40
-
(2011)
Proc. of the 38th Annual International Symposium on Computer Architecture
-
-
Esmaeilzadeh, H.1
Blem, E.2
Amant, R.S.3
Sankaralingam, K.4
Burger, D.5
-
131
-
-
79961040286
-
Toward Dark Silicon in Servers
-
DOI: 10.1109/MM.2011.77.40
-
N. Hardavellas, M. Ferdman, B. Falsaf, and A. Ailamaki. "Toward Dark Silicon in Servers." In IEEE Micro, v. 31, no. 4, 2011. DOI: 10.1109/MM.2011.77.40.
-
(2011)
IEEE Micro
, vol.31
, Issue.4
-
-
Hardavellas, N.1
Ferdman, M.2
Falsaf, B.3
Ailamaki, A.4
|