-
1
-
-
80052528714
-
Dark silicon and the end of multicore scaling
-
H. Esmaeilzadeh, E. Blem, R. St. Amant, K. Sankaralingam, and D. Burger, "Dark silicon and the end of multicore scaling, " in IEEE Int. Symp. on Comp. Arch. (ISCA), 2011, pp. 365-376.
-
(2011)
IEEE Int. Symp. on Comp. Arch. (ISCA)
, pp. 365-376
-
-
Esmaeilzadeh, H.1
Blem, E.2
Amant, R.St.3
Sankaralingam, K.4
Burger, D.5
-
2
-
-
77951154340
-
The GPU computing era
-
Mar.-Apr.
-
J. Nickolls and W.J. Dally, "The GPU Computing Era, " IEEE Micro, vol. 30, no. 2, pp. 56 -69, Mar-Apr 2010.
-
(2010)
IEEE Micro
, vol.30
, Issue.2
, pp. 56-69
-
-
Nickolls, J.1
Dally, W.J.2
-
3
-
-
84879718328
-
-
Online
-
NVIDIA. What is CUDA? [Online]. http://www.nvidia.com/object/cuda-home- new.html.
-
What is CUDA?
-
-
-
5
-
-
84859702950
-
AMD fusion APU:Llano
-
Mar.-Apr.
-
A. Branover, D. Foley, and M. Steinman, "AMD Fusion APU:Llano, " IEEE Micro, vol. 32, no. 2, pp. 28-37, Mar-Apr 2012.
-
(2012)
IEEE Micro
, vol.32
, Issue.2
, pp. 28-37
-
-
Branover, A.1
Foley, D.2
Steinman, M.3
-
6
-
-
79955708930
-
A fully integrated multi-CPU, GPU and memory controller 32nm processor
-
M. Yuffe, E. Knoll, M. Mehalel, J. Shor, and T. Kurts, "A fully integrated multi-CPU, GPU and memory controller 32nm processor, " in IEEE Int. Solid-State Circuits Conf. (ISSCC), 2011, pp. 264-266.
-
(2011)
IEEE Int. Solid-State Circuits Conf. (ISSCC)
, pp. 264-266
-
-
Yuffe, M.1
Knoll, E.2
Mehalel, M.3
Shor, J.4
Kurts, T.5
-
7
-
-
76749140917
-
Qilin: Exploiting parallelism on heterogeneous multiprocessors with adaptive mapping
-
Chi-Keung Luk, Sunpyo Hong, and Hyesoon Kim, "Qilin: Exploiting parallelism on heterogeneous multiprocessors with adaptive mapping, " in IEEE/ACM International Symposium on Microarchitecture (MICRO), 2009, pp. 45-55.
-
(2009)
IEEE/ACM International Symposium on Microarchitecture (MICRO)
, pp. 45-55
-
-
Luk, C.-K.1
Hong, S.2
Kim, H.3
-
8
-
-
67650046428
-
Merge: A programming model for heterogeneous multi-core systems
-
M. Linderman, J. Collins, H. Wang, and Meng T., "Merge: a programming model for heterogeneous multi-core systems", ACM Int. Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2008, pp. 287-296.
-
(2008)
ACM Int. Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
, pp. 287-296
-
-
Linderman, M.1
Collins, J.2
Wang, H.3
Meng, T.4
-
9
-
-
35448978324
-
EXOCHI: Architecture and programming environment for a heterogeneous multi-core multithreaded system
-
P. Wang et al., "EXOCHI: Architecture and Programming Environment for a Heterogeneous Multi-core Multithreaded System, " in ACM SIGPLAN Conf. on Programming Language Design and Implementation (PLDI), pp. 156-166.
-
ACM SIGPLAN Conf. on Programming Language Design and Implementation (PLDI)
, pp. 156-166
-
-
Wang, P.1
-
10
-
-
79952090633
-
Power-Efficient work distribution method for CPU-GPU heterogeneous system
-
G. Wang and X. Ren, "Power-Efficient Work Distribution Method for CPU-GPU Heterogeneous System, " in IEEE Parallel and Dist. Proc. with Applications (ISPA), 2010, pp. 122-129.
-
(2010)
IEEE Parallel and Dist. Proc. with Applications (ISPA)
, pp. 122-129
-
-
Wang, G.1
Ren, X.2
-
12
-
-
36949001469
-
An analysis of efficient multi-core global power management policies: Maximizing performance for a given power budget
-
C. Isci, A. Buyuktosunoglu, C.-Y. Cher, P. Bose, and M. Martonosi, "An Analysis of Efficient Multi-Core Global Power Management Policies: Maximizing Performance for a Given Power Budget, " in IEEE Int. Symp. on Microarch. (MICRO), 2006, pp. 347 -358.
-
(2006)
IEEE Int. Symp. on Microarch. (MICRO)
, pp. 347-358
-
-
Isci, C.1
Buyuktosunoglu, A.2
Cher, C.-Y.3
Bose, P.4
Martonosi, M.5
-
13
-
-
84863037228
-
Improving throughput of power-constrained GPUs using dynamic voltage/frequency and core scaling
-
J. Lee, V. Sathish, M. Schulte, K. Compton, and N.S. Kim, "Improving Throughput of Power-Constrained GPUs Using Dynamic Voltage/Frequency and Core Scaling, " in IEEE/ACM Int. Conf. on Parallel Architectures and Compilation Techniques (PACT), 2011, pp. 111-120.
-
(2011)
IEEE/ACM Int. Conf. on Parallel Architectures and Compilation Techniques (PACT)
, pp. 111-120
-
-
Lee, J.1
Sathish, V.2
Schulte, M.3
Compton, K.4
Kim, N.S.5
-
15
-
-
84859464490
-
The gem5 simulator
-
Aug.
-
N. Binkert et al., "The gem5 simulator, " ACM SIGARCH Comp. Arch. News, vol. 39, no. 2, pp. 1-7, Aug 2011.
-
(2011)
ACM SIGARCH Comp. Arch. News
, vol.39
, Issue.2
, pp. 1-7
-
-
Binkert, N.1
-
16
-
-
70349169075
-
Analyzing CUDA workloads using a detailed GPU simulator
-
A. Bakhoda, G. Yuan, W. W. L. Fung, H. Wong, and T. M. Aamodt, "Analyzing CUDA Workloads using a Detailed GPU Simulator, " in IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2009.
-
(2009)
IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)
-
-
Bakhoda, A.1
Yuan, G.2
Fung, W.W.L.3
Wong, H.4
Aamodt, T.M.5
-
17
-
-
84867527289
-
-
Online
-
NVIDIA. GeForce GTS 260M Specification. [Online]. http://www.geforce.com/ hardware/notebook-gpus/geforce-gts-260m/specifications.
-
NVIDIA
-
-
-
19
-
-
76749146060
-
McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures
-
Sheng Li et al., "McPAT: an integrated power, area, and timing modeling framework for multicore and manycore architectures, " in IEEE Int. Symp. on Microarch. (MICRO), 2009, pp. 469-480.
-
(2009)
IEEE Int. Symp. on Microarch. (MICRO)
, pp. 469-480
-
-
Li, S.1
-
21
-
-
33750600861
-
New generation of predictive technology model for sub-45nm early design exploration
-
Nov.
-
W. Zhao and Y. Cao, "New generation of Predictive Technology Model for sub-45nm early design exploration, " IEEE T. on Electron Devices, vol. 53, no. 11, pp. 2816-2823, Nov 2006.
-
(2006)
IEEE T. on Electron Devices
, vol.53
, Issue.11
, pp. 2816-2823
-
-
Zhao, W.1
Cao, Y.2
|