-
1
-
-
35348872507
-
Transparent control independence (TCI)
-
June
-
A. S. Al-Zawawi, V. K. R eddy, E. Rotenberg, and H. H. Akkary. Transparent control independence (TCI). SIGAR CH Comput. Archit. News, 35:448-459, June 2007.
-
(2007)
SIGAR CH Comput. Archit. News
, vol.35
, pp. 448-459
-
-
Al-Zawawi, A.S.1
Eddy, V.K.R.2
Rotenberg, E.3
Akkary, H.H.4
-
2
-
-
70649092154
-
Rodinia: A benchmark suite for heterogeneous computing.
-
S. Che, M. Boyer, J. Meng, D. Tarjan, J. W. Sheaffer, S.-H. Lee, and K. Skadron. Rodinia: A benchmark suite for heterogeneous computing. IEEE Workload Characterization Symposium, 0:44-54, 2009.
-
(2009)
IEEE Workload Characterization Symposium
, pp. 44-54
-
-
Che, S.1
Boyer, M.2
Meng, J.3
Tarjan, D.4
Sheaffer, J.W.5
Lee, S.-H.6
Skadron, K.7
-
3
-
-
78049512154
-
Barra: A parallel functional simulator for GPGPU
-
S. Collange, M. Daumas, D. Defour, and D. Parello. Barra: a parallel functional simulator for GPGPU. In IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), pages 351-360, 2010.
-
(2010)
IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS)
, pp. 351-360
-
-
Collange, S.1
Daumas, M.2
Defour, D.3
Parello, D.4
-
8
-
-
84856515692
-
PEPSC: A power-efficient processor for scientific computing
-
G. Dasika, A. Sethia, T. Mudge, and S. Mahlke. PEPSC: A power-efficient processor for scientific computing. In PACT, 2011.
-
(2011)
PACT
-
-
Dasika, G.1
Sethia, A.2
Mudge, T.3
Mahlke, S.4
-
10
-
-
84863351470
-
SIMD re-convergence at thread frontiers
-
December
-
G. Diamos, A. Kerr, H. Wu, S. Yalamanchili, B. Ashbaugh, and S. Maiyuran. SIMD re-convergence at thread frontiers. In MICRO 44: Proceedings of the 44th annual IEEE/ACM International Symposium on Microarchitecture, December 2011.
-
(2011)
MICRO 44: Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture
-
-
Diamos, G.1
Kerr, A.2
Wu, H.3
Yalamanchili, S.4
Ashbaugh, B.5
Maiyuran, S.6
-
11
-
-
70449647744
-
CASH: Revisiting hardware sharing in single-chip parallel processor
-
R. Dolbeau and A. Seznec. CASH: Revisiting hardware sharing in single-chip parallel processor. Journal of Instruction-Level Parallelism, 6:1-16, 2004.
-
(2004)
Journal of Instruction-Level Parallelism
, vol.6
, pp. 1-16
-
-
Dolbeau, R.1
Seznec, A.2
-
15
-
-
68549096107
-
Dynamic warp formation: Efficient MIMD control flow on SIMD graphics hardware
-
July
-
W. W. L. Fung, I. Sham, G. Yuan, and T. M. Aamodt. Dynamic warp formation: Efficient MIMD control flow on SIMD graphics hardware. ACM Trans. Archit. Code Optim., 6:7:1-7:37, July 2009.
-
(2009)
ACM Trans. Archit. Code Optim.
, vol.6
, pp. 71-737
-
-
Fung, W.W.L.1
Sham, I.2
Yuan, G.3
Aamodt, T.M.4
-
16
-
-
80052533471
-
Energyefficient mechanisms for managing thread context in throughput processors
-
M. Gebhart, D. R. Johnson, D. Tarjan, S. W. Keckler, W. J. Dally, E. Lindholm, and K. Skadron. Energyefficient mechanisms for managing thread context in throughput processors. In Proceeding of the 38th annual international symposium on Computer architecture, pages 235-246, 2011.
-
(2011)
Proceeding of the 38th Annual International Symposium on Computer Architecture
, pp. 235-246
-
-
Gebhart, M.1
Johnson, D.R.2
Tarjan, D.3
Keckler, S.W.4
Dally, W.J.5
Lindholm, E.6
Skadron, K.7
-
18
-
-
15044343841
-
The Vector- Thread architecture
-
R. Krashinsky, C. Batten, M. Hampton, S. Gerding, B. Pharris, J. Casper, and K. Asanovic. The Vector- Thread architecture. IEEE MICRO, 24(6):84-90, 2004.
-
(2004)
IEEE MICRO
, vol.24
, Issue.6
, pp. 84-90
-
-
Krashinsky, R.1
Batten, C.2
Hampton, M.3
Gerding, S.4
Pharris, B.5
Casper, J.6
Asanovic, K.7
-
21
-
-
77954995885
-
Debunking the 100X GPU vs. CPU myth: An evaluation of throughput computing on CPU and GPU
-
V. W. Lee, C. Kim, J. Chhugani, M. Deisher, D. Kim, A. D. Nguyen, N. Satish, M. Smelyanskiy, S. Chennupaty, P. Hammarlund, R. Singhal, and P. Dubey. Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU. In ISCA '10: Proceedings of the 37th annual international symposium on Computer architecture, pages 451-460, 2010.
-
(2010)
ISCA ' 10: Proceedings of the 37th Annual International Symposium on Computer Architecture
, pp. 451-460
-
-
Lee, V.W.1
Kim, C.2
Chhugani, J.3
Deisher, M.4
Kim, D.5
Nguyen, A.D.6
Satish, N.7
Smelyanskiy, M.8
Chennupaty, S.9
Hammarlund, P.10
Singhal, R.11
Dubey, P.12
-
22
-
-
44849137198
-
NVIDIA Tesla: A unified graphics and computing architecture
-
J. E. Lindholm, J. Nickolls, S. Oberman, and J. Montrym. NVIDIA Tesla: A unified graphics and computing architecture. IEEE Micro, 28(2):39-55, 2008.
-
(2008)
IEEE Micro
, vol.28
, Issue.2
, pp. 39-55
-
-
Lindholm, J.E.1
Nickolls, J.2
Oberman, S.3
Montrym, J.4
-
23
-
-
79951689916
-
Minimal multi-threading: Finding and removing redundant instructions in multithreaded processors
-
G. Long, D. Franklin, S. Biswas, P. Ortiz, J. Oberg, D. Fan, and F. T. Chong. Minimal multi-threading: Finding and removing redundant instructions in multithreaded processors. In Proceedings of the 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO '43, pages 337-348, 2010.
-
(2010)
Proceedings of the 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO '43
, pp. 337-348
-
-
Long, G.1
Franklin, D.2
Biswas, S.3
Ortiz, P.4
Oberg, J.5
Fan, D.6
Chong, F.T.7
-
24
-
-
77954976292
-
Dynamic warp subdivision for int rated branch and memory divergence tolerance
-
J. Meng, D. Tarjan, and K. Skadron. Dynamic warp subdivision for int rated branch and memory divergence tolerance. SIGARCH Comput. Archit. News, 38(3):235-246, 2010.
-
(2010)
SIGARCH Comput. Archit. News
, vol.38
, Issue.3
, pp. 235-246
-
-
Meng, J.1
Tarjan, D.2
Skadron, K.3
-
25
-
-
84864829539
-
Scheduler in multi-threaded processor prioritizing instructions passing qualification rule
-
US Patent 7949855, May
-
P. C. Mills, J. E. Lindholm, B. W. Coon, G. M. Tarolli, and J. M. Burgess. Scheduler in multi-threaded processor prioritizing instructions passing qualification rule. US Patent 7949855, May 2011.
-
(2011)
-
-
Mills, P.C.1
Lindholm, J.E.2
Coon, B.W.3
Tarolli, G.M.4
Burgess, J.M.5
-
26
-
-
84863342255
-
Improving GPU performance via large warps and two-level warp scheduling
-
December
-
V. Narasiman, C. J. Lee, M. Shebanow, R. Miftakhutdinov, O. Mutlu, and Y. N. Patt. Improving GPU performance via large warps and two-level warp scheduling. In MICRO 44: Proceedings of the 44th annual IEEE/ACM International Symposium on Microarchitecture, December 2011.
-
(2011)
MICRO 44: Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture
-
-
Narasiman, V.1
Lee, C.J.2
Shebanow, M.3
Miftakhutdinov, R.4
Mutlu, O.5
Patt, Y.N.6
-
27
-
-
77951154340
-
The GPU computing era
-
March
-
J. Nickolls and W. J. Dally. The GPU computing era. IEEE Micro, 30:56-69, March 2010.
-
(2010)
IEEE Micro
, vol.30
, pp. 56-69
-
-
Nickolls, J.1
Dally, W.J.2
-
28
-
-
85184640695
-
-
NVIDIA CUDA SDK, 2010. http://www. nvidia.com/cuda/.
-
(2010)
-
-
-
29
-
-
33644661238
-
Contentaddressable memory (CAM) circuits and architectures: A tutorial and survey
-
march
-
K. Pagiamtzis and A. Sheikholeslami. Contentaddressable memory (CAM) circuits and architectures: a tutorial and survey. IEEE Journal of Solid-State Circuits, 41(3):712-727, march 2006.
-
(2006)
IEEE Journal of Solid-State Circuits
, vol.41
, Issue.3
, pp. 712-727
-
-
Pagiamtzis, K.1
Sheikholeslami, A.2
-
30
-
-
34547456450
-
Vector lane threading
-
S. Rivoire, R. Schultz, T. Okuda, and C. Kozyrakis. Vector lane threading. In Proceedings of the 2006 International Conference on Parallel Processing, ICPP '06, pages 55-64, 2006.
-
(2006)
Proceedings of the 2006 International Conference on Parallel Processing, ICPP '06
, pp. 55-64
-
-
Rivoire, S.1
Schultz, R.2
Okuda, T.3
Kozyrakis, C.4
-
31
-
-
0029183524
-
Simultaneous multithreading: Maximizing on-chip parallelism
-
May
-
D. M. Tullsen, S. J. Eggers, and H. M. Levy. Simultaneous multithreading: maximizing on-chip parallelism. SIGARCH Comput. Archit. News, 23:392-403, May 1995.
-
(1995)
SIGARCH Comput. Archit. News
, vol.23
, pp. 392-403
-
-
Tullsen, D.M.1
Eggers, S.J.2
Levy, H.M.3
|