-
1
-
-
0034825713
-
Performance of Hardware Compressed Main Memory
-
B. Abali, H. Franke, X. Shen, D. E. Poff, and T. B. Smith, "Performance of Hardware Compressed Main Memory," in 7th HPCA, 2001.
-
(2001)
7th HPCA
-
-
Abali, B.1
Franke, H.2
Shen, X.3
Poff, D.E.4
Smith, T.B.5
-
3
-
-
34547676257
-
Interactions between compression and prefetching in chip multiprocessors
-
A. R. Alameldeen, and D. A. Wood, "Interactions between compression and prefetching in chip multiprocessors," in 13th HPCA, 2007.
-
(2007)
13th HPCA
-
-
Alameldeen, A.R.1
Wood, D.A.2
-
5
-
-
0026917364
-
Reducing memory latency via non-blocking and prefetching caches
-
T.-F. Chen, and J.-L. Baer, "Reducing memory latency via non-blocking and prefetching caches," in 5th ASPLOS, 1992.
-
(1992)
5th ASPLOS
-
-
Chen, T.-F.1
Baer, J.-L.2
-
6
-
-
0036038136
-
Dynamic hot data stream prefetching for general-purpose programs
-
T. M. Chilimbi, and M. Hirzel, "Dynamic hot data stream prefetching for general-purpose programs," in PLDI, 2002.
-
(2002)
PLDI
-
-
Chilimbi, T.M.1
Hirzel, M.2
-
7
-
-
47349132413
-
Low-Cost Epoch-Based Correlation Prefetching for Commercial Applications
-
Y. Chou, "Low-Cost Epoch-Based Correlation Prefetching for Commercial Applications," in 40th Micro, 2007.
-
(2007)
40th Micro
-
-
Chou, Y.1
-
8
-
-
0027621679
-
Practical prefetching via data compression
-
K. Curewitz, P. Krishnan, and J. Vitter, "Practical prefetching via data compression," ACM SIGMOD Record, vol. 22, no. 2, pp. 266, 1993.
-
(1993)
ACM SIGMOD Record
, vol.22
, Issue.2
, pp. 266
-
-
Curewitz, K.1
Krishnan, P.2
Vitter, J.3
-
9
-
-
70450233836
-
Stream chaining: Exploiting multiple levels of correlation in data prefetching
-
P. Diaz, and M. Cintra, "Stream chaining: Exploiting multiple levels of correlation in data prefetching," in 36th ISCA, 2009.
-
(2009)
36th ISCA
-
-
Diaz, P.1
Cintra, M.2
-
10
-
-
27544435752
-
A Robust Main-Memory Compression Scheme
-
M. Ekman, and P. Stenstrom, "A Robust Main-Memory Compression Scheme," in 32nd ISCA, 2005.
-
(2005)
32nd ISCA
-
-
Ekman, M.1
Stenstrom, P.2
-
11
-
-
36949027123
-
Last-touch correlated data streaming
-
M. Ferdman, and B. Falsafi, "Last-touch correlated data streaming," in ISPASS, 2007.
-
(2007)
ISPASS
-
-
Ferdman, M.1
Falsafi, B.2
-
13
-
-
77956977035
-
Stride directed prefetching in scalar processors
-
J. Fu, J. H. Patel, and B. L. Janssens, "Stride directed prefetching in scalar processors," in 25th MICRO, 1992.
-
(1992)
25th MICRO
-
-
Fu, J.1
Patel, J.H.2
Janssens, B.L.3
-
15
-
-
0035187053
-
Exploring the design space of future CMPs
-
J. Huh, D. Burger, and S. W. Keckler, "Exploring the design space of future CMPs," in PACT, 2001.
-
(2001)
PACT
-
-
Huh, J.1
Burger, D.2
Keckler, S.W.3
-
16
-
-
0030677583
-
Prefetching using Markov predictors
-
D. Joseph, and D. Grunwald, "Prefetching using Markov predictors," in 24th ISCA, 1997.
-
(1997)
24th ISCA
-
-
Joseph, D.1
Grunwald, D.2
-
17
-
-
0025429331
-
Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
-
N. P. Jouppi, "Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers," in 17th ISCA, 1990.
-
(1990)
17th ISCA
-
-
Jouppi, N.P.1
-
18
-
-
0036287598
-
Going the distance for TLB prefetching: An application-driven study
-
G. B. Kandiraju, and A. Sivasubramaniam, "Going the distance for TLB prefetching: An application-driven study," in 29th ISCA, 2002.
-
(2002)
29th ISCA
-
-
Kandiraju, G.B.1
Sivasubramaniam, A.2
-
19
-
-
70450235471
-
Architecting phase change memory as a scalable DRAM alternative
-
B. C. Lee, E. Ipek, O. Mutlu, and D. Burger, "Architecting phase change memory as a scalable DRAM alternative," in 36th ISCA, 2009.
-
(2009)
36th ISCA
-
-
Lee, B.C.1
Ipek, E.2
Mutlu, O.3
Burger, D.4
-
21
-
-
77954995885
-
Debunking the 100x GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU
-
V. Lee, C. Kim, J. Chhugani, M. Deisher, D. Kim et al., "Debunking the 100x GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU," in 37th ISCA, 2010.
-
(2010)
37th ISCA
-
-
Lee, V.1
Kim, C.2
Chhugani, J.3
Deisher, M.4
Kim, D.5
-
22
-
-
79960875021
-
Emerging Applications for Multi/Many-Core Processor
-
V. Lee, Y. Chen, and P. Dubey, "Emerging Applications for Multi/Many-Core Processor," in 38th ISCA, 2011.
-
(2011)
38th ISCA
-
-
Lee, V.1
Chen, Y.2
Dubey, P.3
-
23
-
-
0036469676
-
Simics: A full system simulation platform
-
P. Magnusson, M. Christensson, J. Eskilson, D. Forsgren, G. Hållberg et al., "Simics: A full system simulation platform," Computer, pp. 50-58, 2002.
-
(2002)
Computer
, pp. 50-58
-
-
Magnusson, P.1
Christensson, M.2
Eskilson, J.3
Forsgren, D.4
Hållberg, G.5
-
24
-
-
33748870886
-
Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset
-
M. Martin, D. Sorin, B. Beckmann, M. Marty, M. Xu et al., "Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset," ACM SIGARCH Computer Architecture News, vol. 33, no. 4, pp. 92-99, 2005.
-
(2005)
ACM SIGARCH Computer Architecture News
, vol.33
, Issue.4
, pp. 92-99
-
-
Martin, M.1
Sorin, D.2
Beckmann, B.3
Marty, M.4
Xu, M.5
-
27
-
-
2342644731
-
Data cache prefetching using a Global History Buffer
-
K. J. Nesbit, and J. E. Smith, "Data cache prefetching using a Global History Buffer," in 10th HPCA, 2004.
-
(2004)
10th HPCA
-
-
Nesbit, K.J.1
Smith, J.E.2
-
28
-
-
84866840089
-
-
OpenMP, http://www.openmp.org/.
-
-
-
-
29
-
-
84956111432
-
Lattice BGK models for Navier-Stokes equation
-
Y. Qian, D. d'Humieres, and P. Lallemand, "Lattice BGK models for Navier-Stokes equation," EPL (Europhysics Letters), vol. 17, pp. 479, 1992.
-
(1992)
EPL (Europhysics Letters)
, vol.17
, pp. 479
-
-
Qian, Y.1
D'Humieres, D.2
Lallemand, P.3
-
30
-
-
70450273507
-
Scalable high performance main memory system using phase-change memory technology
-
M. K. Qureshi, V. Srinivasan, and J. A. Rivers, "Scalable high performance main memory system using phase-change memory technology," in 36th ISCA, 2009.
-
(2009)
36th ISCA
-
-
Qureshi, M.K.1
Srinivasan, V.2
Rivers, J.A.3
-
31
-
-
55449106208
-
Phase-change random access memory: A scalable technology
-
S. Raoux, G. Burr, M. Breitwisch, C. Rettner, Y. C. Chen et al., "Phase-change random access memory: A scalable technology," IBM Journal of Research and Development, vol. 52, no. 4.5, pp. 465-479, 2008.
-
(2008)
IBM Journal of Research and Development
, vol.52
, Issue.4-5
, pp. 465-479
-
-
Raoux, S.1
Burr, G.2
Breitwisch, M.3
Rettner, C.4
Chen, Y.C.5
-
33
-
-
70450077484
-
Designing efficient sorting algorithms for manycore gpus
-
N. Satish, M. Harris, and M. Garland, "Designing efficient sorting algorithms for manycore gpus," in IPDPS, 2009.
-
(2009)
IPDPS
-
-
Satish, N.1
Harris, M.2
Garland, M.3
-
34
-
-
33847108092
-
Coterminous locality and coterminous group data prefetching on chip-multiprocessors
-
X. Shi, Z. Yang, J.-K. Peir, L. Peng, Y.-K. Chen et al., "Coterminous locality and coterminous group data prefetching on chip-multiprocessors," in 20th IPDPS, 2006.
-
(2006)
20th IPDPS
-
-
Shi, X.1
Yang, Z.2
Peir, J.-K.3
Peng, L.4
Chen, Y.-K.5
-
35
-
-
0036296856
-
Using a user-level memory thread for correlation prefetching
-
Y. Solihin, J. Lee, and J. Torrellas, "Using a user-level memory thread for correlation prefetching," in 29th ISCA, 2002.
-
(2002)
29th ISCA
-
-
Solihin, Y.1
Lee, J.2
Torrellas, J.3
-
36
-
-
33845894426
-
Spatial memory streaming
-
S. Somogyi, T. F. Wenisch, A. Ailamaki, B. Falsafi, and A. Moshovos, "Spatial memory streaming," in 33rd ISCA, 2006.
-
(2006)
33rd ISCA
-
-
Somogyi, S.1
Wenisch, T.F.2
Ailamaki, A.3
Falsafi, B.4
Moshovos, A.5
-
37
-
-
70450279104
-
Spatio-temporal memory streaming
-
S. Somogyi, T. F. Wenisch, A. Ailamaki, and B. Falsafi, "Spatio-temporal memory streaming," in 36th ISCA, 2009.
-
(2009)
36th ISCA
-
-
Somogyi, S.1
Wenisch, T.F.2
Ailamaki, A.3
Falsafi, B.4
-
38
-
-
34547655822
-
Feedback directed prefetching: Improving the performance and bandwidth-efficiency of hardware prefetchers
-
S. Srinath, O. Mutlu, H. Kim, and Y. N. Patt, "Feedback directed prefetching: Improving the performance and bandwidth-efficiency of hardware prefetchers," in 13th HPCA, 2007.
-
(2007)
13th HPCA
-
-
Srinath, S.1
Mutlu, O.2
Kim, H.3
Patt, Y.N.4
-
39
-
-
0038138424
-
POWER4 system microarchitecture
-
Oct
-
J. Tendler, S. Dodson, S. Fields, H. Le, and B. Sinharoy, "POWER4 system microarchitecture," IBM Technical White Paper, Oct, 2001.
-
(2001)
IBM Technical White Paper
-
-
Tendler, J.1
Dodson, S.2
Fields, S.3
Le, H.4
Sinharoy, B.5
-
40
-
-
84866851700
-
-
POSIX thread, https://computing.llnl.gov/tutorials/pthreads/.
-
POSIX Thread
-
-
-
41
-
-
0035266001
-
IBM memory expansion technology (MXT)
-
R. Tremaine, P. Franaszek, J. Robinson, C. Schulz, T. Smith et al., "IBM memory expansion technology (MXT)," IBM Journal of Research and Development, vol. 45, no. 2, pp. 271-285, 2001.
-
(2001)
IBM Journal of Research and Development
, vol.45
, Issue.2
, pp. 271-285
-
-
Tremaine, R.1
Franaszek, P.2
Robinson, J.3
Schulz, C.4
Smith, T.5
-
43
-
-
27544508955
-
Temporal Streaming of Shared Memory
-
T. F. Wenisch, S. Somogyi, N. Hardavellas, J. Kim, A. Ailamaki et al., "Temporal Streaming of Shared Memory," in 32nd ISCA, 2005.
-
(2005)
32nd ISCA
-
-
Wenisch, T.F.1
Somogyi, S.2
Hardavellas, N.3
Kim, J.4
Ailamaki, A.5
-
44
-
-
56449097232
-
Temporal streams in commercial server applications
-
T. F. Wenisch, M. Ferdman, A. Ailamaki, B. Falsafi, and A. Moshovos, "Temporal streams in commercial server applications," in IEEE International Symposium on Workload Characterization, 2008.
-
IEEE International Symposium on Workload Characterization, 2008
-
-
Wenisch, T.F.1
Ferdman, M.2
Ailamaki, A.3
Falsafi, B.4
Moshovos, A.5
-
45
-
-
64949123191
-
Practical off-chip meta-data for temporal memory streaming
-
T. F. Wenisch, M. Ferdman, A. Ailamaki, B. Falsafi, and A. Moshovos, "Practical off-chip meta-data for temporal memory streaming," in 15th HPCA, 2009.
-
(2009)
15th HPCA
-
-
Wenisch, T.F.1
Ferdman, M.2
Ailamaki, A.3
Falsafi, B.4
Moshovos, A.5
-
47
-
-
0038364440
-
Frequent value locality and its applications
-
J. Yang, and R. Gupta, "Frequent value locality and its applications," ACM Trans. on Embedded Computing Systems, vol. 1, no. 1, pp. 79-105, 2002.
-
(2002)
ACM Trans. on Embedded Computing Systems
, vol.1
, Issue.1
, pp. 79-105
-
-
Yang, J.1
Gupta, R.2
-
49
-
-
36949014308
-
PTLsim: A Cycle Accurate Full System x86-64 Microarchitectural Simulator
-
M. T. Yourst, "PTLsim: A Cycle Accurate Full System x86-64 Microarchitectural Simulator," in ISPASS, 2007.
-
(2007)
ISPASS
-
-
Yourst, M.T.1
-
50
-
-
0034443222
-
Frequent value locality and value-centric data cache design
-
Y. Zhang, J. Yang, and R. Gupta, "Frequent value locality and value-centric data cache design," in 9th ASPLOS, 2000.
-
(2000)
9th ASPLOS
-
-
Zhang, Y.1
Yang, J.2
Gupta, R.3
-
52
-
-
0012525243
-
Benchmark health considered harmful
-
C. Zilles, "Benchmark health considered harmful," ACM SIGARCH Computer Architecture News, vol. 29, no. 3, pp. 4-5, 2001.
-
(2001)
ACM SIGARCH Computer Architecture News
, vol.29
, Issue.3
, pp. 4-5
-
-
Zilles, C.1
-
53
-
-
0017493286
-
A universal algorithm for sequential data compression
-
J. Ziv, and A. Lempel, "A universal algorithm for sequential data compression," IEEE Transactions on Information Theory, vol. 23, no. 3, pp. 337-343, 1977.
-
(1977)
IEEE Transactions on Information Theory
, vol.23
, Issue.3
, pp. 337-343
-
-
Ziv, J.1
Lempel, A.2
-
54
-
-
0018019231
-
Compression of individual sequences via variable-rate coding
-
J. Ziv, and A. Lempel, "Compression of individual sequences via variable-rate coding," IEEE Transactions on Information Theory, vol. 24, no. 5, pp. 530-536, 1978.
-
(1978)
IEEE Transactions on Information Theory
, vol.24
, Issue.5
, pp. 530-536
-
-
Ziv, J.1
Lempel, A.2
|