-
1
-
-
84880271648
-
-
Arizona state university predictive technology model.
-
Arizona state university predictive technology model. , http://ptm.asu.edu.
-
-
-
-
2
-
-
84880285921
-
-
Cacti 6.0: A tool to understand large caches.
-
Cacti 6.0: A tool to understand large caches. http://www.cs.utah.edu/ rajeev/cacti6/.
-
-
-
-
3
-
-
84880301447
-
-
Nvidia cuda sdk 4.2. developer.nvidia.com/cuda/cudadownloads
-
Nvidia cuda sdk 4.2. developer.nvidia.com/cuda/cudadownloads.
-
-
-
-
4
-
-
84880302548
-
-
fermi white paper v1.1.
-
Nvidia, fermi white paper v1.1. http://www.nvidia.com/content/PDF/fermi white papers/NVIDIA Fermi Compute Architecture Whitepaper.pdf.
-
-
-
-
5
-
-
84880297167
-
-
Parboil benchmark suite.
-
Parboil benchmark suite. http://impact.crhc.illinois.edu/parboil.php.
-
-
-
-
6
-
-
85016664946
-
Iatac: A smart predictor to turn-off l2 cache lines
-
J. Abella, A. Gonzalez, X. Vera, and M. F. P. O'Boyle. Iatac: A smart predictor to turn-off l2 cache lines. ACM Transactions on Architecture and Code Optimiization, 2(1):55-77, 2005.
-
(2005)
ACM Transactions on Architecture and Code Optimiization
, vol.2
, Issue.1
, pp. 55-77
-
-
Abella, J.1
Gonzalez, A.2
Vera, X.3
O'boyle, M.F.P.4
-
7
-
-
3543076272
-
Poweraware compilation for register file energy reduction
-
Dec
-
J. L. Ayala, A. Veidenbaum, and M. Lopez-Vallejo. Poweraware compilation for register file energy reduction. International Journal of Parallel Programming, 31(6):451-467, Dec. 2003.
-
(2003)
International Journal of Parallel Programming
, vol.31
, Issue.6
, pp. 451-467
-
-
Ayala, J.L.1
Veidenbaum, A.2
Lopez-Vallejo, M.3
-
8
-
-
70349169075
-
Analyzing cuda workloads using a detailed gpu simulator
-
April
-
A. Bakhoda, G. Yuan, W. Fung, H. Wong, and T. Aamodt. Analyzing cuda workloads using a detailed gpu simulator. In IEEE International Symposium on Performance Analysis of Systems and Software, pages 163-174, April 2009.
-
(2009)
IEEE International Symposium on Performance Analysis of Systems and Software
, pp. 163-174
-
-
Bakhoda, A.1
Yuan, G.2
Fung, W.3
Wong, H.4
Aamodt, T.5
-
11
-
-
0033716803
-
Multiple-banked register file architectures
-
J.-L. Cruz, A. Gonzalez, M. Valero, and N. P. Topham. Multiple-banked register file architectures. In Proceedings of the 27th annual international symposium on Computer architecture, pages 316-325, 2000.
-
(2000)
Proceedings of the 27th Annual International Symposium on Computer Architecture
, pp. 316-325
-
-
Cruz, J.-L.1
Gonzalez, A.2
Valero, M.3
Topham, N.P.4
-
12
-
-
0036294454
-
Drowsy caches: Simple techniques for reducing leakage power
-
K. Flautner, N. S. Kim, S. Martin, D. Blaauw, and T. Mudge. Drowsy caches: simple techniques for reducing leakage power. In Proceedings of the 29th Annual International Symposium on Computer Architecture, pages 148-157, 2002.
-
(2002)
Proceedings of the 29th Annual International Symposium on Computer Architecture
, pp. 148-157
-
-
Flautner, K.1
Kim, N.S.2
Martin, S.3
Blaauw, D.4
Mudge, T.5
-
13
-
-
80052533471
-
Energy-efficient mechanisms for managing thread context in throughput processors
-
M. Gebhart, D. R. Johnson, D. Tarjan, S. W. Keckler, W. J. Dally, E. Lindholm, and K. Skadron. Energy-efficient mechanisms for managing thread context in throughput processors. In Proceedings of the 38th annual international symposium on Computer architecture, pages 235-246, 2011.
-
(2011)
Proceedings of the 38th Annual International Symposium on Computer Architecture
, pp. 235-246
-
-
Gebhart, M.1
Johnson, D.R.2
Tarjan, D.3
Keckler, S.W.4
Dally, W.J.5
Lindholm, E.6
Skadron, K.7
-
14
-
-
77953597369
-
Register file partitioning and recompilation for register file power reduction
-
May
-
X. Guan and Y. Fei. Register file partitioning and recompilation for register file power reduction. ACM Transactions on Design Automation of Electronic Systems, 15(3):24:1-24:30, May 2010.
-
(2010)
ACM Transactions on Design Automation of Electronic Systems
, vol.15
, Issue.3
, pp. 241-2430
-
-
Guan, X.1
Fei, Y.2
-
15
-
-
12344265531
-
-
PhD Dissertation, Department of Electrical Engineering, Stanford University, August
-
R. Ho. On-chip wires: Scaling and efficiency. PhD Dissertation, Department of Electrical Engineering, Stanford University, August 2003.
-
(2003)
On-chip Wires: Scaling and Efficiency
-
-
Ho, R.1
-
16
-
-
26044463497
-
A lower-power register file based on complementary pass-transistor adiabatic logic
-
July
-
J. Hu, T. Xu, and H. Li. A lower-power register file based on complementary pass-transistor adiabatic logic. IEICE-Transactions on Information and Systems, E88-D(7):1479-1485, July 2005.
-
(2005)
IEICE-Transactions on Information and Systems
, vol.E88-D
, Issue.7
, pp. 1479-1485
-
-
Hu, J.1
Xu, T.2
Li, H.3
-
17
-
-
0029288557
-
Trends in lowpower ram circuit technologies
-
April
-
K. Itoh, K. Sasaki, and Y. Nakagome. Trends in lowpower ram circuit technologies. Proceedings of the IEEE, 83(4):524-543, April 1995.
-
(1995)
Proceedings of the IEEE
, vol.83
, Issue.4
, pp. 524-543
-
-
Itoh, K.1
Sasaki, K.2
Nakagome, Y.3
-
18
-
-
84948956783
-
Drowsy instruction caches: Leakage power reduction using dynamic voltage scaling and cache sub-bank prediction
-
N. S. Kim, K. Flautner, D. Blaauw, and T. Mudge. Drowsy instruction caches: Leakage power reduction using dynamic voltage scaling and cache sub-bank prediction. In Proceedings of 35th Annual IEEE/ACM International Symposium on Microarchitecture, pages 219-230, 2002.
-
(2002)
Proceedings of 35th Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 219-230
-
-
Kim, N.S.1
Flautner, K.2
Blaauw, D.3
Mudge, T.4
-
21
-
-
84861416065
-
Parallelization of particle filter algorithms
-
M., A., M. Goodrum, J., A. Trotter, S. Aksel, T., K. Acton, and Skadron. Parallelization of particle filter algorithms. In 3rd Workshop on Emerging Applications and Many-core Architecture (EAMA), 2010.
-
(2010)
3rd Workshop on Emerging Applications and Many-core Architecture (EAMA)
-
-
Goodrum, M.A.M.1
Trotter, J.A.2
Aksel, S.3
Acton, T.K.4
Skadron5
-
23
-
-
84863342255
-
Improving gpu performance via large warps and two-level warp scheduling
-
ACM
-
V. Narasiman, M. Shebanow, C. J. Lee, R. Miftakhutdinov, O. Mutlu, and Y. N. Patt. Improving gpu performance via large warps and two-level warp scheduling. In Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture, pages 308-317. ACM, 2011.
-
(2011)
Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 308-317
-
-
Narasiman, V.1
Shebanow, M.2
Lee, C.J.3
Miftakhutdinov, R.4
Mutlu, O.5
Patt, Y.N.6
-
25
-
-
33748999034
-
-
June
-
S. Park, A. Shrivastava, N. Dutt, A. Nicolau, Y. Paek, and E. Earlie. Bypass aware instruction scheduling for register file power reduction. volume 41, pages 173-181, June 2006.
-
(2006)
Bypass Aware Instruction Scheduling for Register File Power Reduction
, vol.41
, pp. 173-181
-
-
Park, S.1
Shrivastava, A.2
Dutt, N.3
Nicolau, A.4
Paek, Y.5
Earlie, E.6
-
26
-
-
44149102218
-
Register file power reduction using bypass sensitive compiler
-
June
-
S. Park, A. Shrivastava, N. Dutt, A. Nicolau, Y. Paek, and E. Earlie. Register file power reduction using bypass sensitive compiler. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 27(6):1155-1159, June 2008.
-
(2008)
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
, vol.27
, Issue.6
, pp. 1155-1159
-
-
Park, S.1
Shrivastava, A.2
Dutt, N.3
Nicolau, A.4
Paek, Y.5
Earlie, E.6
-
27
-
-
80455130016
-
Power gating strategies on gpus
-
P.-H. Wang, C.-L. Yang, Y.-M. Chen, and Y.-J. Cheng. Power gating strategies on gpus. ACM Transactions on Architecture and Code Optimimization, 8(3):13:1-13:25, 2011.
-
(2011)
ACM Transactions on Architecture and Code Optimimization
, vol.8
, Issue.3
, pp. 131-1325
-
-
Wang, P.-H.1
Yang, C.-L.2
Chen, Y.-M.3
Cheng, Y.-J.4
-
28
-
-
0020830611
-
A divided word-line structure in the static ram and its application to a 64k full cmos ram
-
October
-
M. Yoshimoto, K. Anami, H. Shinohara, T. Yoshihara, H. Takagi, S. Nagao, S. Kayano, and T. Nakano. A divided word-line structure in the static ram and its application to a 64k full cmos ram. IEEE Journal of Solid-State Circuits, 18(5):479-485, October. 1983.
-
(1983)
IEEE Journal of Solid-State Circuits
, vol.18
, Issue.5
, pp. 479-485
-
-
Yoshimoto, M.1
Anami, K.2
Shinohara, H.3
Yoshihara, T.4
Takagi, H.5
Nagao, S.6
Kayano, S.7
Nakano, T.8
-
29
-
-
80052539481
-
Sram-dram hybrid memory with applications to efficient register files in fine-grained multi-threading
-
W.-k. S. Yu, R. Huang, S. Q. Xu, S.-E. Wang, E. Kan, and G. E. Suh. Sram-dram hybrid memory with applications to efficient register files in fine-grained multi-threading. In Proceedings of the 38th annual International Symposium on Computer Architecture, pages 247-258, 2011.
-
(2011)
Proceedings of the 38th Annual International Symposium on Computer Architecture
, pp. 247-258
-
-
Yu, W.-K.S.1
Huang, R.2
Xu, S.Q.3
Wang, S.-E.4
Kan, E.5
Suh, G.E.6
|