-
1
-
-
44349142233
-
Integration challenges and trade-offs for terascale architectures
-
August
-
M. Azimi et al. Integration challenges and trade-offs for terascale architectures. Intel Tech. J., 11(3):173-184, August 2007.
-
(2007)
Intel Tech. J.
, vol.11
, Issue.3
, pp. 173-184
-
-
Azimi, M.1
-
2
-
-
49249086142
-
Larrabee: A many-core x86 architecture for visual computing
-
August
-
L. Seiler et al. Larrabee: a many-core x86 architecture for visual computing. Intel Tech. J., 27(3):1-15, August 2008.
-
(2008)
Intel Tech. J.
, vol.27
, Issue.3
, pp. 1-15
-
-
Seiler, L.1
-
5
-
-
27544495466
-
Victim replication: Maximizing capacity while hiding wire delay in tiled chip multiprocessors
-
M. Zhang et al. Victim replication: maximizing capacity while hiding wire delay in tiled chip multiprocessors. ISCA, 2005.
-
(2005)
ISCA
-
-
Zhang, M.1
-
6
-
-
27544432313
-
Optimizing replication, communication, and capacity allocation in cmps
-
Z. Chishti et al. Optimizing replication, communication, and capacity allocation in cmps. ISCA, 2005.
-
(2005)
ISCA
-
-
Chishti, Z.1
-
7
-
-
33845903561
-
Cooperative caching for chip multiprocessors
-
J. Chang and G. S. Sohi. Cooperative caching for chip multiprocessors. ISCA, 2006.
-
(2006)
ISCA
-
-
Chang, J.1
Sohi, G.S.2
-
8
-
-
34548008288
-
Asr: Adaptive selective replication for cmp caches
-
B Beckmann et al. Asr: Adaptive selective replication for cmp caches. MICRO, 2006.
-
(2006)
MICRO
-
-
Beckmann, B.1
-
9
-
-
64949187933
-
Adaptive spill-receive for robust high-performance caching in cmps
-
M. K. Qureshi. Adaptive spill-receive for robust high-performance caching in cmps. HPCA, 2009.
-
(2009)
HPCA
-
-
Qureshi, M.K.1
-
10
-
-
70350601187
-
Reactive nuca: Near-optimal block placement and replication in distributed caches
-
H. Hardavellas et al. Reactive nuca: near-optimal block placement and replication in distributed caches. ISCA, 2009.
-
(2009)
ISCA
-
-
Hardavellas, H.1
-
11
-
-
77951427968
-
Bank-aware dynamic cache partitioning for multicore architectures
-
D. Kaseridis et al. Bank-aware dynamic cache partitioning for multicore architectures. ICPP, 2009.
-
(2009)
ICPP
-
-
Kaseridis, D.1
-
14
-
-
21644472427
-
Managing wire delay in large chip-multiprocessor caches
-
B. Beckmann and D. Wood. Managing wire delay in large chip-multiprocessor caches,. MICRO, 2004.
-
(2004)
MICRO
-
-
Beckmann, B.1
Wood, D.2
-
15
-
-
32844471317
-
A nuca substrate for flexible cmp cache sharing
-
J. Huh et al. A nuca substrate for flexible cmp cache sharing. ICS, 2005.
-
(2005)
ICS
-
-
Huh, J.1
-
16
-
-
77952567684
-
Utility-based partitioning of shared caches
-
M. K. Qureshi et al. Utility-based partitioning of shared caches. MICRO, 2006.
-
(2006)
MICRO
-
-
Qureshi, M.K.1
-
17
-
-
35348900723
-
Virtual hierarchies to support server consolidation
-
M. R. Marty et al. Virtual hierarchies to support server consolidation. ISCA, 2007.
-
(2007)
ISCA
-
-
Marty, M.R.1
-
18
-
-
35348816719
-
Virtual private caches
-
K. J. Nesbit et al. Virtual private caches. ISCA, 2007.
-
(2007)
ISCA
-
-
Nesbit, K.J.1
-
19
-
-
77952567861
-
Stimuluscache: Boosting performance of chip multiprocessors with excess cache
-
H. Lee et al. Stimuluscache: Boosting performance of chip multiprocessors with excess cache. HPCA, 2010.
-
(2010)
HPCA
-
-
Lee, H.1
-
20
-
-
77955009892
-
Elastic cooperative caching: An autonomous dynamically adaptive memory hierarchy for chip multiprocessors
-
E. Herrero et al. Elastic cooperative caching: An autonomous dynamically adaptive memory hierarchy for chip multiprocessors. ISCA, 2010.
-
(2010)
ISCA
-
-
Herrero, E.1
-
21
-
-
0038138424
-
Power4 system microarchitecture
-
October
-
J. Tendler et al. Power4 system microarchitecture. IBM Technical White Paper, October 2001.
-
(2001)
IBM Technical White Paper
-
-
Tendler, J.1
-
22
-
-
27544463701
-
Near-optimal worst-case throughput routing for two-dimensional mesh networks
-
D. Seo et al. Near-optimal worst-case throughput routing for two-dimensional mesh networks. ISCA, 2005.
-
(2005)
ISCA
-
-
Seo, D.1
-
23
-
-
0031630017
-
Lamport clocks: Verifying a directory cachecoherence protocol
-
M. Plakal et al. Lamport clocks: Verifying a directory cachecoherence protocol. SPAA, 1998.
-
(1998)
SPAA
-
-
Plakal, M.1
-
24
-
-
0001617669
-
Reducing memory and traffic requirements for scalable directory-based cache coherence schemes
-
A. Gupta et al. Reducing memory and traffic requirements for scalable directory-based cache coherence schemes. ICPP, 1990.
-
(1990)
ICPP
-
-
Gupta, A.1
-
25
-
-
77749317534
-
Two-phase trace-driven simulation (tpts): A fast multicore processor architecture simulation approach
-
March
-
H. Lee et al. Two-phase trace-driven simulation (tpts): A fast multicore processor architecture simulation approach. Software: Practice and Experience (SPE), March 2010.
-
(2010)
Software: Practice and Experience (SPE)
-
-
Lee, H.1
-
26
-
-
63549095070
-
The parsec benchmark suite: Characterization and arch. implications
-
C. Bienia et al. The parsec benchmark suite: Characterization and arch. implications. PACT, 2008.
-
(2008)
PACT
-
-
Bienia, C.1
-
27
-
-
79955912201
-
Symbiotic job scheduling for a simultaneous multithreading processor
-
A. Snavely et al. Symbiotic job scheduling for a simultaneous multithreading processor. ASPLOS, 2005.
-
(2005)
ASPLOS
-
-
Snavely, A.1
|