-
2
-
-
33947715600
-
IPC considered harmful for multiprocessor workloads
-
A. Alameldeen and D. Wood, "IPC considered harmful for multiprocessor workloads," IEEE Micro, vol. 26, no. 4, 2006.
-
(2006)
IEEE Micro
, vol.26
, Issue.4
-
-
Alameldeen, A.1
Wood, D.2
-
3
-
-
47349112480
-
Scavenger: A new last level cache architecture with global block priority
-
A. Basu, N. Kirman, M. Kirman, M. Chaudhuri, and J. Martinez, "Scavenger: A new last level cache architecture with global block priority," in Proc. of the 40th annual IEEE/ACM Intl Symp. on Microarchitecture, 2007.
-
(2007)
Proc. of the 40th Annual IEEE/ACM Intl Symp. on Microarchitecture
-
-
Basu, A.1
Kirman, N.2
Kirman, M.3
Chaudhuri, M.4
Martinez, J.5
-
4
-
-
0003003638
-
A study of replacement algorithms for a virtualstorage computer
-
L. A. Belady, "A study of replacement algorithms for a virtualstorage computer," IBM Syst. J., vol. 5, no. 2, 1966.
-
(1966)
IBM Syst. J.
, vol.5
, Issue.2
-
-
Belady, L.A.1
-
5
-
-
63549095070
-
The PARSEC benchmark suite: Characterization and architectural implications
-
C. Bienia, S. Kumar, J. P. Singh, and K. Li, "The PARSEC benchmark suite: Characterization and architectural implications," in Proc. of the 17th Intl. Conf. on Parallel Architectures and Compilation Techniques, 2008.
-
(2008)
Proc. of the 17th Intl. Conf. on Parallel Architectures and Compilation Techniques
-
-
Bienia, C.1
Kumar, S.2
Singh, J.P.3
Li, K.4
-
6
-
-
0014814325
-
Space/time trade-offs in hash coding with allowable errors
-
B. H. Bloom, "Space/time trade-offs in hash coding with allowable errors," Commun. ACM, vol. 13, no. 7, 1970.
-
(1970)
Commun. ACM
, vol.13
, Issue.7
-
-
Bloom, B.H.1
-
8
-
-
33846703999
-
Disintermediated active communication
-
A. Bracy, K. Doshi, and Q. Jacobson, "Disintermediated active communication," Comput. Archit. Lett., vol. 5, no. 2, 2006.
-
(2006)
Comput. Archit. Lett.
, vol.5
, Issue.2
-
-
Bracy, A.1
Doshi, K.2
Jacobson, Q.3
-
12
-
-
35348862407
-
BulkSC: Bulk enforcement of sequential consistency
-
L. Ceze, J. Tuck, P. Montesinos, and J. Torrellas, "BulkSC: bulk enforcement of sequential consistency," in Proc. of the 34th annual Intl. Symp. on Computer architecture, 2007.
-
(2007)
Proc. of the 34th Annual Intl. Symp. on Computer Architecture
-
-
Ceze, L.1
Tuck, J.2
Montesinos, P.3
Torrellas, J.4
-
13
-
-
33845866604
-
Bulk disambiguation of speculative threads in multiprocessors
-
L. Ceze, J. Tuck, J. Torrellas, and C. Cascaval, "Bulk disambiguation of speculative threads in multiprocessors," in Proc. of the 33rd annual Intl. Symp. on Computer Architecture, 2006.
-
(2006)
Proc. of the 33rd Annual Intl. Symp. on Computer Architecture
-
-
Ceze, L.1
Tuck, J.2
Torrellas, J.3
Cascaval, C.4
-
16
-
-
57849167541
-
An efficient hardware-based multi-hash scheme for high speed IP lookup
-
S. Demetriades, M. Hanna, S. Cho, and R. Melhem, "An efficient hardware-based multi-hash scheme for high speed IP lookup," in Proc. of the 16th IEEE Symp. on High Performance Interconnects, 2008.
-
(2008)
Proc. of the 16th IEEE Symp. on High Performance Interconnects
-
-
Demetriades, S.1
Hanna, M.2
Cho, S.3
Melhem, R.4
-
17
-
-
49549096253
-
A sub-1W to 2W low-power IA processor for mobile internet devices and ultra-mobile PCs in 45nm hi-K metal gate CMOS
-
G. Gerosa et al., "A sub-1W to 2W low-power IA processor for mobile internet devices and ultra-mobile PCs in 45nm hi-K metal gate CMOS," in IEEE Intl. Solid-State Circuits Conf., 2008.
-
(2008)
IEEE Intl. Solid-State Circuits Conf.
-
-
Gerosa, G.1
-
19
-
-
4644359934
-
Transactional memory coherence and consistency
-
L. Hammond, V. Wong, M. Chen, B. D. Carlstrom, J. D. Davis, B. Hertzberg, M. K. Prabhu, H. Wijaya, C. Kozyrakis, and K. Olukotun, "Transactional memory coherence and consistency," in Proc. of the 31st annual Intl. Symp. on Computer Architecture, 2004.
-
(2004)
Proc. of the 31st Annual Intl. Symp. on Computer Architecture
-
-
Hammond, L.1
Wong, V.2
Chen, M.3
Carlstrom, B.D.4
Davis, J.D.5
Hertzberg, B.6
Prabhu, M.K.7
Wijaya, H.8
Kozyrakis, C.9
Olukotun, K.10
-
21
-
-
0024903997
-
Evaluating associativity in cpu caches
-
M. D. Hill and A. J. Smith, "Evaluating associativity in cpu caches," IEEE Trans. Comput., vol. 38, no. 12, 1989.
-
(1989)
IEEE Trans. Comput.
, vol.38
, Issue.12
-
-
Hill, M.D.1
Smith, A.J.2
-
22
-
-
77952123736
-
A 48-core IA-32 message-passing processor with DVFS in 45nm CMOS
-
J. Howard et al., "A 48-core IA-32 message-passing processor with DVFS in 45nm CMOS," in IEEE Intl. Solid-State Circuits Conf., 2010.
-
(2010)
IEEE Intl. Solid-State Circuits Conf.
-
-
Howard, J.1
-
23
-
-
63549149925
-
Adaptive insertion policies for managing shared caches
-
A. Jaleel, W. Hasenplaugh, M. Qureshi, J. Sebot, S. Steely, Jr., and J. Emer, "Adaptive insertion policies for managing shared caches," in Proc. of the 17th intl. conf. on Parallel Architectures and Compilation Techniques, 2008.
-
(2008)
Proc. of the 17th Intl. Conf. on Parallel Architectures and Compilation Techniques
-
-
Jaleel, A.1
Hasenplaugh, W.2
Qureshi, M.3
Sebot, J.4
Steely Jr., S.5
Emer, J.6
-
24
-
-
77954998134
-
High performance cache replacement using re-reference interval prediction (RRIP)
-
A. Jaleel, K. Theobald, S. C. S. Jr, and J. Emer, "High performance cache replacement using re-reference interval prediction (RRIP)," in Proc. of the 37th annual Intl. Symp. on Computer Architecture, 2010.
-
(2010)
Proc. of the 37th Annual Intl. Symp. on Computer Architecture
-
-
Jaleel, A.1
Theobald, K.2
Jr, S.C.S.3
Emer, J.4
-
25
-
-
0025429331
-
Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
-
N. P. Jouppi, "Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers," in Proc. of the 17th annual Intl. Symp. on Computer Architecture, 1990.
-
(1990)
Proc. of the 17th Annual Intl. Symp. on Computer Architecture
-
-
Jouppi, N.P.1
-
26
-
-
2342640788
-
Using prime numbers for cache indexing to eliminate conflict misses
-
M. Kharbutli, K. Irwin, Y. Solihin, and J. Lee, "Using prime numbers for cache indexing to eliminate conflict misses," in Proc. of the 10th Intl. Symp. on High Performance Computer Architecture, 2004.
-
(2004)
Proc. of the 10th Intl. Symp. on High Performance Computer Architecture
-
-
Kharbutli, M.1
Irwin, K.2
Solihin, Y.3
Lee, J.4
-
27
-
-
0036949388
-
An adaptive, nonuniform cache structure for wire-delay dominated on-chip caches
-
C. Kim, D. Burger, and S. W. Keckler, "An adaptive, nonuniform cache structure for wire-delay dominated on-chip caches," in Proc. of the 10th intl. conf. on Architectural Support for Programming Languages and Operating Systems, 2002.
-
(2002)
Proc. of the 10th Intl. Conf. on Architectural Support for Programming Languages and Operating Systems
-
-
Kim, C.1
Burger, D.2
Keckler, S.W.3
-
30
-
-
76749146060
-
McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures
-
S. Li, J. H. Ahn, R. D. Strong, J. B. Brockman, D. M. Tullsen, and N. P. Jouppi, "McPAT: an integrated power, area, and timing modeling framework for multicore and manycore architectures," in Proc. of the 42nd annual IEEE/ACM Intl. Symp. on Microarchitecture, 2009.
-
(2009)
Proc. of the 42nd Annual IEEE/ACM Intl. Symp. on Microarchitecture
-
-
Li, S.1
Ahn, J.H.2
Strong, R.D.3
Brockman, J.B.4
Tullsen, D.M.5
Jouppi, N.P.6
-
31
-
-
31944440969
-
Pin: Building customized program analysis tools with dynamic instrumentation
-
C.-K. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S. Wallace, V. J. Reddi, and K. Hazelwood, "Pin: building customized program analysis tools with dynamic instrumentation," in Proc. of the ACM SIGPLAN conf. on Programming Language Design and Implementation, 2005.
-
(2005)
Proc. of the ACM SIGPLAN Conf. on Programming Language Design and Implementation
-
-
Luk, C.-K.1
Cohn, R.2
Muth, R.3
Patil, H.4
Klauser, A.5
Lowney, G.6
Wallace, S.7
Reddi, V.J.8
Hazelwood, K.9
-
38
-
-
47349104267
-
Implementing signatures for transactional memory
-
D. Sanchez, L. Yen, M. D. Hill, and K. Sankaralingam, "Implementing signatures for transactional memory," in Proc. of the 40th annual IEEE/ACM Intl. Symp. on Microarchitecture, 2007.
-
(2007)
Proc. of the 40th Annual IEEE/ACM Intl. Symp. on Microarchitecture
-
-
Sanchez, D.1
Yen, L.2
Hill, M.D.3
Sankaralingam, K.4
-
40
-
-
77952200539
-
A 40nm 16-core 128-thread CMT SPARC SoC processor
-
J. Shin et al., "A 40nm 16-core 128-thread CMT SPARC SoC processor," in IEEE Intl. Solid-State Circuits Conf., 2010.
-
(2010)
IEEE Intl. Solid-State Circuits Conf.
-
-
Shin, J.1
-
42
-
-
71149094440
-
The bulk multicore architecture for improved programmability
-
J. Torrellas, L. Ceze, J. Tuck, C. Cascaval, P. Montesinos, W. Ahn, and M. Prvulovic, "The bulk multicore architecture for improved programmability," Commun. ACM, vol. 52, no. 12, 2009.
-
(2009)
Commun. ACM
, vol.52
, Issue.12
-
-
Torrellas, J.1
Ceze, L.2
Tuck, J.3
Cascaval, C.4
Montesinos, P.5
Ahn, W.6
Prvulovic, M.7
-
43
-
-
77952179543
-
The implementation of POWER7: A highly parallel and scalable multi-core high-end server processor
-
D. Wendel et al., "The implementation of POWER7: A highly parallel and scalable multi-core high-end server processor," in IEEE Intl. Solid-State Circuits Conf., 2010.
-
(2010)
IEEE Intl. Solid-State Circuits Conf.
-
-
Wendel, D.1
-
45
-
-
0031232542
-
Two fast and highassociativity cache schemes
-
C. Zhang, X. Zhang, and Y. Yan, "Two fast and highassociativity cache schemes," IEEE Micro, vol. 17, no. 5, 1997.
-
(1997)
IEEE Micro
, vol.17
, Issue.5
-
-
Zhang, C.1
Zhang, X.2
Yan, Y.3
|