-
2
-
-
78049338046
-
The single-chip cloud computer
-
April
-
M. Baron. The single-chip cloud computer. Microprocessor Report, April 2010.
-
(2010)
Microprocessor Report
-
-
Baron, M.1
-
3
-
-
70649115688
-
Understanding PARSEC performance on contemporary CMPs
-
Austin, TX
-
M. Bhadauria, V. M.Weaver, and S. A. McKee. Understanding PARSEC performance on contemporary CMPs. In IISWC, 2009, pages 98-107, Austin, TX.
-
(2009)
IISWC
, pp. 98-107
-
-
Bhadauria, M.1
Weaver, V.M.2
McKee, S.A.3
-
4
-
-
63549095070
-
The parsec benchmark suite: Characterization and architectural implications
-
C. Bienia, S. Kumar, J. P. Singh, and K. Li. The parsec benchmark suite: Characterization and architectural implications. In PACT '08.
-
PACT '08
-
-
Bienia, C.1
Kumar, S.2
Singh, J.P.3
Li, K.4
-
5
-
-
3242815471
-
Scaling to the end of silicon with EDGE architectures
-
and the TRIPS Team. July
-
D. Burger, S. W. Keckler, K. S. McKinley, M. Dahlin, L. K. John, C. Lin, C. R. Moore, J. Burrill, R. G. McDonald, W. Yoder, and the TRIPS Team. Scaling to the end of silicon with EDGE architectures. IEEE Computer, 37(7):44-55, July 2004.
-
(2004)
IEEE Computer
, vol.37
, Issue.7
, pp. 44-55
-
-
Burger, D.1
Keckler, S.W.2
McKinley, K.S.3
Dahlin, M.4
John, L.K.5
Lin, C.6
Moore, C.R.7
Burrill, J.8
McDonald, R.G.9
Yoder, W.10
-
6
-
-
84944416023
-
The reconfigurable streaming vector processor (rsvptm)
-
S. Ciricescu, R. Essick, B. Lucas, P. May, K. Moat, J. Norris, M. Schuette, and A. Saidi. The reconfigurable streaming vector processor (rsvptm). In MICRO 36, page 141, 2003.
-
(2003)
MICRO 36
, pp. 141
-
-
Ciricescu, S.1
Essick, R.2
Lucas, B.3
May, P.4
Moat, K.5
Norris, J.6
Schuette, M.7
Saidi, A.8
-
7
-
-
27544482359
-
An architecture framework for transparent instruction set customization in embedded processors
-
N. Clark, J. Blome, M. Chu, S. Mahlke, S. Biles, and K. Flautner. An architecture framework for transparent instruction set customization in embedded processors. In ISCA '05, pages 272-283, 2005.
-
(2005)
ISCA '05
, pp. 272-283
-
-
Clark, N.1
Blome, J.2
Chu, M.3
Mahlke, S.4
Biles, S.5
Flautner, K.6
-
8
-
-
52649095061
-
Veal: Virtualized execution accelerator for loops
-
N. Clark, A. Hormati, and S. Mahlke. Veal: Virtualized execution accelerator for loops. In ISCA '08, pages 389-400, 2008.
-
(2008)
ISCA '08
, pp. 389-400
-
-
Clark, N.1
Hormati, A.2
Mahlke, S.3
-
9
-
-
21644435314
-
Application-specific processing on a general-purpose core via transparent instruction set customization
-
N. Clark, M. Kudlur, H. Park, S. Mahlke, and K. Flautner. Application-specific processing on a general-purpose core via transparent instruction set customization. In MICRO 37, pages 30-40, 2004.
-
(2004)
MICRO 37
, pp. 30-40
-
-
Clark, N.1
Kudlur, M.2
Park, H.3
Mahlke, S.4
Flautner, K.5
-
10
-
-
0026243790
-
Efficiently computing static single assignment form and the control dependence graph
-
Oct
-
R. Cytron, J. Ferrante, B. K. Rosen, M. N. Wegman, and F. K. Zadeck. Efficiently computing static single assignment form and the control dependence graph. ACMTOPLAS, 13(4):451-490, Oct 1991.
-
(1991)
ACMTOPLAS
, vol.13
, Issue.4
, pp. 451-490
-
-
Cytron, R.1
Ferrante, J.2
Rosen, B.K.3
Wegman, M.N.4
Zadeck, F.K.5
-
11
-
-
64949125875
-
Toward a Multicore Architecture for Real-time Ray-tracing
-
V. Govindaraju, P. Djeu, K. Sankaralingam, M. Vernon, and W. R. Mark. Toward a Multicore Architecture for Real-time Ray-tracing. In MICRO '08, pages 176-187.
-
MICRO '08
, pp. 176-187
-
-
Govindaraju, V.1
Djeu, P.2
Sankaralingam, K.3
Vernon, M.4
Mark, W.R.5
-
14
-
-
70350708988
-
Ambric'S New Parallel Processor - Globally Asynchronous Architecture Eases Parallel Programming
-
October
-
T. R. Halfhill. Ambric'S New Parallel Processor - Globally Asynchronous Architecture Eases Parallel Programming. Microprocessor Report, October 2006.
-
(2006)
Microprocessor Report
-
-
Halfhill, T.R.1
-
15
-
-
79955922294
-
MathStar Challenges FPGAs
-
July
-
T. R. Halfill. MathStar Challenges FPGAs. Microprocessor Report, 20(7):29-35, July 2006.
-
(2006)
Microprocessor Report
, vol.20
, Issue.7
, pp. 29-35
-
-
Halfill, T.R.1
-
16
-
-
77954995378
-
Understanding sources of inefficiency in general-purpose chips
-
R. Hameed, W. Qadeer, M. Wachs, O. Azizi, A. Solomatnikov, B. C. Lee, S. Richardson, C. Kozyrakis, and M. Horowitz. Understanding sources of inefficiency in general-purpose chips. In ISCA, pages 37-47, 2010.
-
(2010)
ISCA
, pp. 37-47
-
-
Hameed, R.1
Qadeer, W.2
Wachs, M.3
Azizi, O.4
Solomatnikov, A.5
Lee, B.C.6
Richardson, S.7
Kozyrakis, C.8
Horowitz, M.9
-
18
-
-
79953126016
-
Navigo: An early-stage model to study power-constrained architectures and specialization
-
M. Hempstead, G.-Y. Wei, and D. Brooks. Navigo: An early-stage model to study power-constrained architectures and specialization. In Workshop on Modeling, Benchmarking, and Simulation, 2009.
-
Workshop on Modeling, Benchmarking, and Simulation, 2009
-
-
Hempstead, M.1
Wei, G.-Y.2
Brooks, D.3
-
19
-
-
40349103955
-
Memory prefetching using adaptive stream detection
-
I. Hur and C. Lin. Memory prefetching using adaptive stream detection. In MICRO 39, pages 397-408.
-
MICRO 39
, pp. 397-408
-
-
Hur, I.1
Lin, C.2
-
20
-
-
58049200369
-
Comparative architectural characterization of SPEC CPU2000 and CPU2006 benchmarks on the intel core2 duo processor
-
A. Kejariwal, A. V. Veidenbaum, A. Nicolau, X. Tian, M. Girkar, H. Saito, and U. Banerjee. Comparative architectural characterization of SPEC CPU2000 and CPU2006 benchmarks on the intel core2 duo processor. In International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation, pages 132-141.
-
International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation
, pp. 132-141
-
-
Kejariwal, A.1
Veidenbaum, A.V.2
Nicolau, A.3
Tian, X.4
Girkar, M.5
Saito, H.6
Banerjee, U.7
-
21
-
-
0036949290
-
Design and evaluation of compiler algorithms for pre-execution
-
DOI 10.1145/635508.605415
-
D. Kim and D. Yeung. Design and evaluation of compiler algorithms for pre-execution. SIGPLAN Not., 37(10):159-170, 2002. (Pubitemid 44892231)
-
(2002)
Operating Systems Review (ACM)
, vol.36
, Issue.5
, pp. 159-170
-
-
Kim, D.1
Yeung, D.2
-
22
-
-
0020126847
-
The burroughs scientific processor (bsp)
-
May
-
D. J. Kuck and R. A. Stokes. The burroughs scientific processor (bsp). IEEE Trans. Comput., 31:363-376, May 1982.
-
(1982)
IEEE Trans. Comput.
, vol.31
, pp. 363-376
-
-
Kuck, D.J.1
Stokes, R.A.2
-
23
-
-
3042658703
-
LLVM: A compilation framework for life-long program analysis & transformation
-
C. Lattner and V. Adve. LLVM: A compilation framework for life-long program analysis & transformation. In CGO '04, pages 75-88.
-
CGO '04
, pp. 75-88
-
-
Lattner, C.1
Adve, V.2
-
24
-
-
35348861326
-
Comparing memory systems for chip multiprocessors
-
J. Leverich, H. Arakida, A. Solomatnikov, A. Firoozshahian, M. Horowitz, and C. Kozyrakis. Comparing memory systems for chip multiprocessors. In ISCA '07, pages 358-368.
-
ISCA '07
, pp. 358-368
-
-
Leverich, J.1
Arakida, H.2
Solomatnikov, A.3
Firoozshahian, A.4
Horowitz, M.5
Kozyrakis, C.6
-
25
-
-
0026980852
-
Effective compiler support for predicated execution using the hyperblock
-
S. Mahlke, D. Lin, W. Chen, R. Hank, and R. Bringmann. Effective compiler support for predicated execution using the hyperblock. In ISCA '92, pages 45-54.
-
ISCA '92
, pp. 45-54
-
-
Mahlke, S.1
Lin, D.2
Chen, W.3
Hank, R.4
Bringmann, R.5
-
26
-
-
0031617466
-
An auto-backgate-controlled MT-CMOS circuit
-
H. Makino, Y. Tujihashi, K. Nii, C. Morishima, and Y. Hayakawa. An auto-backgate-controlled MT-CMOS circuit. In Proceedings Symposium on VLSI Circuits, pages 42-43, 1998.
-
(1998)
Proceedings Symposium on VLSI Circuits
, pp. 42-43
-
-
Makino, H.1
Tujihashi, Y.2
Nii, K.3
Morishima, C.4
Hayakawa, Y.5
-
27
-
-
33748870886
-
Multifacet's General Execution-driven Multiprocessor Simulator (GEMS) Toolset
-
M. M. Martin, D. J. Sorin, B. M. Beckmann, M. R. Marty, M. Xu, A. R. Alameldeen, K. E. Moore, M. D. Hill, , and D. A.Wood. Multifacet's General Execution-driven Multiprocessor Simulator (GEMS) Toolset. Computer Architecture News (CAN), 2005.
-
(2005)
Computer Architecture News (CAN)
-
-
Martin, M.M.1
Sorin, D.J.2
Beckmann, B.M.3
Marty, M.R.4
Xu, M.5
Alameldeen, A.R.6
Moore, K.E.7
Hill, M.D.8
Wood, D.A.9
-
28
-
-
16244399540
-
A loop accelerator for low power embedded vliw processors
-
B. Mathew and A. Davis. A loop accelerator for low power embedded vliw processors. In CODES+ISSS '04, pages 6-11.
-
CODES+ISSS '04
, pp. 6-11
-
-
Mathew, B.1
Davis, A.2
-
29
-
-
34547456544
-
Tartan: Evaluating spatial computation for whole program execution
-
M. Mishra, T. J. Callahan, T. Chelcea, G. Venkataramani, S. C. Goldstein, and M. Budiu. Tartan: evaluating spatial computation for whole program execution. In ASPLOS-XII, pages 163-174.
-
ASPLOS-XII
, pp. 163-174
-
-
Mishra, M.1
Callahan, T.J.2
Chelcea, T.3
Venkataramani, G.4
Goldstein, S.C.5
Budiu, M.6
-
31
-
-
49249086142
-
Larrabee: A many-core x86 architecture for visual computing
-
L. Seiler, D. Carmean, E. Sprangle, T. Forsyth, M. Abrash, P. Dubey, S. Junkins, A. Lake, J. Sugerman, R. Cavin, R. Espasa, E. Grochowski, T. Juan, and P. Hanrahan. Larrabee: a many-core x86 architecture for visual computing. In ACM SIGGRAPH 2008, pages 18:1-18:15.
-
ACM SIGGRAPH 2008
-
-
Seiler, L.1
Carmean, D.2
Sprangle, E.3
Forsyth, T.4
Abrash, M.5
Dubey, P.6
Junkins, S.7
Lake, A.8
Sugerman, J.9
Cavin, R.10
Espasa, R.11
Grochowski, E.12
Juan, T.13
Hanrahan, P.14
-
32
-
-
85046463637
-
Decoupled access/execute computer architectures
-
J. E. Smith. Decoupled access/execute computer architectures. In ISCA '82, pages 112-119, 1982.
-
(1982)
ISCA '82
, pp. 112-119
-
-
Smith, J.E.1
-
33
-
-
79955919759
-
-
SPEC CPU2006, Standard Performance Evaluation Corporation
-
SPEC CPU2006, Standard Performance Evaluation Corporation.
-
-
-
-
35
-
-
40349106434
-
Fire-and-forget: Load/store scheduling with no store queue at all
-
S. Subramaniam and G. H. Loh. Fire-and-forget: Load/store scheduling with no store queue at all. In MICRO 39, pages 273-284, 2006.
-
(2006)
MICRO 39
, pp. 273-284
-
-
Subramaniam, S.1
Loh, G.H.2
-
37
-
-
0036505033
-
The RAWMicroprocessor: A Computational Fabric for Software Circuits and General-Purpose Programs
-
March
-
M. B. Taylor et al. The RAWMicroprocessor: A Computational Fabric for Software Circuits and General-Purpose Programs, IEEE Micro, 22(2):25-35, March, 2002.
-
(2002)
IEEE Micro
, vol.22
, Issue.2
, pp. 25-35
-
-
Taylor, M.B.1
-
39
-
-
0033703884
-
Chimaera: A high-performance architecture with a tightly-coupled reconfigurable functional unit
-
Z. A. Ye, A. Moshovos, S. Hauck, and P. Banerjee. Chimaera: a high-performance architecture with a tightly-coupled reconfigurable functional unit. In ISCA '00, pages 225-235.
-
ISCA '00
, pp. 225-235
-
-
Ye, Z.A.1
Moshovos, A.2
Hauck, S.3
Banerjee, P.4
-
40
-
-
0033707298
-
Understanding the backward slices of performance degrading instructions
-
C. B. Zilles and G. S. Sohi. Understanding the backward slices of performance degrading instructions. In ISCA '00, pages 172-181.
-
ISCA '00
, pp. 172-181
-
-
Zilles, C.B.1
Sohi, G.S.2
|