-
1
-
-
77952212767
-
A wire-speed power™ processor: 2.3ghz 45nm soi with 16 cores and 64 threads
-
C. Johnson et al. A wire-speed power™ processor: 2.3ghz 45nm soi with 16 cores and 64 threads. ISSCC 2010.
-
(2010)
ISSCC
-
-
Johnson, C.1
-
2
-
-
63149128672
-
Larrabee: A many-core x86 architecture for visual computing
-
L. Seiler et al. Larrabee: a many-core x86 architecture for visual computing. IEEE Micro, 29(1):10-21, 2009.
-
(2009)
IEEE Micro
, vol.29
, Issue.1
, pp. 10-21
-
-
Seiler, L.1
-
4
-
-
84863543742
-
Architecture support for accelerator-rich CMPs
-
J. Cong et al. Architecture support for accelerator-rich CMPs. DAC 2012.
-
(2012)
DAC
-
-
Cong, J.1
-
6
-
-
84857883486
-
The Accelerator Store: A shared memory framework for accelerator-based systems
-
M. J. Lyonsy et al. The Accelerator Store: a shared memory framework for accelerator-based systems. ACM Trans. Architecture and Code Optimization, 8(4):48, 2012.
-
(2012)
ACM Trans. Architecture and Code Optimization
, vol.8
, Issue.4
, pp. 48
-
-
Lyonsy, M.J.1
-
7
-
-
80052679438
-
Buffer-Integrated-Cache: A cost-effective SRAM architecture for handheld and embedded platforms
-
C. F. Fajardo et al. Buffer-Integrated-Cache: a cost-effective SRAM architecture for handheld and embedded platforms. DAC 2011.
-
(2011)
DAC
-
-
Fajardo, C.F.1
-
8
-
-
80052721321
-
An energy-efficient adaptive hybrid cache
-
J. Cong et al. An energy-efficient adaptive hybrid cache. ISLPED 2011.
-
(2011)
ISLPED
-
-
Cong, J.1
-
9
-
-
84855778017
-
Combined loop transformation and hierarchy allocation for data reuse optimization
-
J. Cong et al. Combined loop transformation and hierarchy allocation for data reuse optimization. ICCAD 2011.
-
(2011)
ICCAD
-
-
Cong, J.1
-
10
-
-
84859950069
-
Platform characterization for domain-specific computing
-
A. Bui et al. Platform characterization for domain-specific computing. ASPDAC 2012.
-
(2012)
ASPDAC
-
-
Bui, A.1
-
11
-
-
34548008288
-
ASR: Adaptive selective replication for CMP Caches
-
B. M. Beckmann et al. ASR: adaptive selective replication for CMP Caches. MICRO 2006.
-
(2006)
MICRO
-
-
Beckmann, B.M.1
-
12
-
-
40349095122
-
Managing distributed, shared L2 caches through OS-level page allocation
-
S. Cho and L. Jin. Managing distributed, shared L2 caches through OS-level page allocation. MICRO 2006.
-
(2006)
MICRO
-
-
Cho, S.1
Jin, L.2
-
13
-
-
84865564739
-
A shared Buffer-in-NUCA management scheme for accelerator-rich CMPs
-
University of California
-
J. Cong et al. A shared Buffer-in-NUCA management scheme for accelerator-rich CMPs. University of California, Los Angeles Computer Science Department Technical Report 120012, 2012.
-
(2012)
Los Angeles Computer Science Department Technical Report 120012
-
-
Cong, J.1
-
14
-
-
34548042910
-
Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches
-
M. Qureshi and Y. Patt. Utility-based cache partitioning: a low-overhead, high-performance, runtime mechanism to partition shared caches. MICRO 2006.
-
(2006)
MICRO
-
-
Qureshi, M.1
Patt, Y.2
-
15
-
-
84865559232
-
-
HP Cacti, http://quid.hpl.hp.com:9081/cacti/.
-
-
-
-
16
-
-
0036469676
-
Simics: A full system simulation platform
-
P. S. Magnusson et al. Simics: a full system simulation platform. IEEE Trans. Computer, 35(2):50-58, 2002.
-
(2002)
IEEE Trans. Computer
, vol.35
, Issue.2
, pp. 50-58
-
-
Magnusson, P.S.1
-
17
-
-
33748870886
-
Multifacet's general execution-driven multiprocessor simulator toolset
-
M. M. K. Martin et al. Multifacet's general execution-driven multiprocessor simulator toolset. ACM SIGARCH Computer Architecture News, 33(4):92-99, 2005.
-
(2005)
ACM SIGARCH Computer Architecture News
, vol.33
, Issue.4
, pp. 92-99
-
-
Martin, M.M.K.1
-
19
-
-
76749146060
-
McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures
-
S. Li et al. McPAT: an integrated power, area, and timing modeling framework for multicore and manycore architectures. MICRO 2009.
-
(2009)
MICRO
-
-
Li, S.1
|