-
1
-
-
0029180378
-
The MIT Alewife machine: Architecture and performance
-
A. Agarwal, R. Bianchini, D. Chaiken, K. L. Johnson, D. Kranz, J. Kubiatowicz, B.-H. Lim, K. Mackenzie, and D. Yeung. The MIT Alewife machine: architecture and performance. In Proc. of ISCA-22, pages 2-13, 1995.
-
(1995)
Proc. of ISCA-22
, pp. 2-13
-
-
Agarwal, A.1
Bianchini, R.2
Chaiken, D.3
Johnson, K.L.4
Kranz, D.5
Kubiatowicz, J.6
Lim, B.-H.7
Mackenzie, K.8
Yeung, D.9
-
2
-
-
0025431380
-
April: A processor architecture for multiprocessing
-
A. Agarwal, B.-H. Lim, D. Kranz, and J. Kubiatowicz. April: a processor architecture for multiprocessing. SIGARCH Comput. Archit. News, 18(3a):104-114, 1990.
-
(1990)
SIGARCH Comput. Archit. News
, vol.18
, Issue.3 A
, pp. 104-114
-
-
Agarwal, A.1
Lim, B.-H.2
Kranz, D.3
Kubiatowicz, J.4
-
3
-
-
2342460917
-
Perceptron-based branch confidence estimation
-
H. Akkary, S. T. Srinivasan, R. Koltur, Y Patil, and W. Refaai. Perceptron-based branch confidence estimation. In Proc. of HPCA-IO, page 265, 2004.
-
(2004)
Proc. of HPCA-IO
, pp. 265
-
-
Akkary, H.1
Srinivasan, S.T.2
Koltur, R.3
Patil, Y.4
Refaai, W.5
-
4
-
-
27544432558
-
The impact of performance asymmetry in emerging multicore architectures
-
S. Balakrishnan, R. Rajwar, M. Upton, and K. Lai. The impact of performance asymmetry in emerging multicore architectures. In Proc. of ISCA-32, pages 506-517, 2005.
-
(2005)
Proc. of ISCA-32
, pp. 506-517
-
-
Balakrishnan, S.1
Rajwar, R.2
Upton, M.3
Lai, K.4
-
5
-
-
33749413981
-
-
C. Bazeghi, F. J. Mesa-Martinez, and J. Renau. uComplexity: Estimating processor design effort. In Proc. of MICRO-38, pages 209-218, 2005.
-
C. Bazeghi, F. J. Mesa-Martinez, and J. Renau. uComplexity: Estimating processor design effort. In Proc. of MICRO-38, pages 209-218, 2005.
-
-
-
-
6
-
-
0034312472
-
A multithreaded PowerPC processor for commercial servers
-
J. M. Borkenhagen, R. J. Eickemeyer, R. N. Kalla, and S. R. Kunkel. A multithreaded PowerPC processor for commercial servers. IBM Journal of Research and Development, 44(6):885-898, 2000.
-
(2000)
IBM Journal of Research and Development
, vol.44
, Issue.6
, pp. 885-898
-
-
Borkenhagen, J.M.1
Eickemeyer, R.J.2
Kalla, R.N.3
Kunkel, S.R.4
-
8
-
-
21644443801
-
Dynamically controlled resource allocation in SMT processors
-
F. J. Cazorla, A. Ramirez, M. Valero, and E. Fernandez. Dynamically controlled resource allocation in SMT processors. In Proc. of MICRO-37, pages 171-182, 2004.
-
(2004)
Proc. of MICRO-37
, pp. 171-182
-
-
Cazorla, F.J.1
Ramirez, A.2
Valero, M.3
Fernandez, E.4
-
9
-
-
0036949391
-
A stateless, content-directed data prefetching mechanism
-
R. Cooksey, S. Jourdan, and D. Grunwald. A stateless, content-directed data prefetching mechanism. SIGOPS Oper. Syst. Rev., 36(5):279-290, 2002.
-
(2002)
SIGOPS Oper. Syst. Rev
, vol.36
, Issue.5
, pp. 279-290
-
-
Cooksey, R.1
Jourdan, S.2
Grunwald, D.3
-
11
-
-
33746683732
-
Maximizing CMP throughput with mediocre cores
-
J. D. Davis, J. Laudon, and K. Olukotun. Maximizing CMP throughput with mediocre cores. In Proc. of PACT-14, pages 51-62, 2005.
-
(2005)
Proc. of PACT-14
, pp. 51-62
-
-
Davis, J.D.1
Laudon, J.2
Olukotun, K.3
-
12
-
-
0042352254
-
Borrowed-virtual-time (BVT) scheduling: Supporting latency-sensitive threads in a general-purpose scheduler
-
K. J. Duda and D. R. Cheriton. Borrowed-virtual-time (BVT) scheduling: supporting latency-sensitive threads in a general-purpose scheduler. In Proc. of the 7th ACM symposium on Operating systems principles, pages 261-276, 1999.
-
(1999)
Proc. of the 7th ACM symposium on Operating systems principles
, pp. 261-276
-
-
Duda, K.J.1
Cheriton, D.R.2
-
13
-
-
77950618712
-
Evaluation of multithreaded processors and thread-switch policies
-
R. J. Eickemeyer, R. E. Johnson, S. R. Kunkel, B.-H. Lim, M. S. Squillante, and C.-F. E. Wu. Evaluation of multithreaded processors and thread-switch policies. In Proc. of the International Symposium on High Performance Computing (ISHPC '97), pages 75-90, 1997.
-
(1997)
Proc. of the International Symposium on High Performance Computing (ISHPC '97)
, pp. 75-90
-
-
Eickemeyer, R.J.1
Johnson, R.E.2
Kunkel, S.R.3
Lim, B.-H.4
Squillante, M.S.5
Wu, C.-F.E.6
-
14
-
-
40349112792
-
Prophet/critic hybrid branch prediction
-
A. Falcon, J. Stark, A. Ramirez, K. Lai, and M. Valero. Prophet/critic hybrid branch prediction. SIGARCH Comput. Archit News, 32(2):250, 2004.
-
(2004)
SIGARCH Comput. Archit News
, vol.32
, Issue.2
, pp. 250
-
-
Falcon, A.1
Stark, J.2
Ramirez, A.3
Lai, K.4
Valero, M.5
-
15
-
-
0026152056
-
Strategies for achieving improved processor throughput
-
M. K. Farrens and A. R. Pleszkun. Strategies for achieving improved processor throughput, In Proc. of ISCA-18, pages 362-369, 1991.
-
(1991)
Proc. of ISCA-18
, pp. 362-369
-
-
Farrens, M.K.1
Pleszkun, A.R.2
-
16
-
-
34249828455
-
Introduction to Intel Core Duo processor architecture
-
May
-
S. Gochman, A. Mendelson, A. Naveh, and E. Rotem, Introduction to Intel Core Duo processor architecture, Intel Technology Journal, May 2006.
-
(2006)
Intel Technology Journal
-
-
Gochman, S.1
Mendelson, A.2
Naveh, A.3
Rotem, E.4
-
18
-
-
0026158290
-
Comparative evaluation of latency reducing and tolerating techniques
-
A. Gupta, J. Hennessy, K. Gharachorloo, T. Mowry, and W.-D. Weber. Comparative evaluation of latency reducing and tolerating techniques, SIGARCH Comput. Archit News, 19(3):254-263, 1991.
-
(1991)
SIGARCH Comput. Archit News
, vol.19
, Issue.3
, pp. 254-263
-
-
Gupta, A.1
Hennessy, J.2
Gharachorloo, K.3
Mowry, T.4
Weber, W.-D.5
-
19
-
-
0002284699
-
Intel's P6 uses decoupled superscalar design
-
92, Febuary 1995
-
L. Gwennap. Intel's P6 uses decoupled superscalar design. Microprocessor Report, 9(2), Febuary 1995.
-
Microprocessor Report
-
-
Gwennap, L.1
-
20
-
-
0035000415
-
Inexpensive throughput enhancement in small-scale embedded microprocessors with block multithreading: Extensions characterization, and tradeoffs
-
J. W. Haskins and J. Skadron, Inexpensive throughput enhancement in small-scale embedded microprocessors with block multithreading: Extensions characterization, and tradeoffs. In Proc. of the 20th IEEE International Performance, Computing, and Communications Conference, pages 319-328, 2001.
-
(2001)
Proc. of the 20th IEEE International Performance, Computing, and Communications Conference
, pp. 319-328
-
-
Haskins, J.W.1
Skadron, J.2
-
21
-
-
0345813853
-
efficient, predictable scheduling of independent activities
-
CPU reservations and time constraints
-
M. B. Jones, D. Roşu, and M.-C. Roşu, CPU reservations and time constraints: efficient, predictable scheduling of independent activities, ACM SIGOPS Operating Systems Review, 31(5):198-211, 1997.
-
(1997)
ACM SIGOPS Operating Systems Review
, vol.31
, Issue.5
, pp. 198-211
-
-
Jones, M.B.1
Roşu, D.2
Roşu, M.-C.3
-
22
-
-
33749393519
-
The Cell processor architecture
-
J. Kahle. The Cell processor architecture. In Proc. of MICRO-38, page 3,2005.
-
(2005)
Proc. of MICRO-38
, pp. 3
-
-
Kahle, J.1
-
23
-
-
3042669130
-
IBM Power5 chip: A dual-core multithreaded processor
-
R. Kalla, B. Sinharoy, and J. Tendler. IBM Power5 chip: A dual-core multithreaded processor. In Proc. of MICRO-37, pages 40-47, 2004.
-
(2004)
Proc. of MICRO-37
, pp. 40-47
-
-
Kalla, R.1
Sinharoy, B.2
Tendler, J.3
-
24
-
-
10444238444
-
Fair cache sharing and partitioning in a chip multiprocessor architecture
-
S. Kim, D. Chandra, and Y. Solihin. Fair cache sharing and partitioning in a chip multiprocessor architecture. In Proc. of PACT-13, pages 111-122, 2004.
-
(2004)
Proc. of PACT-13
, pp. 111-122
-
-
Kim, S.1
Chandra, D.2
Solihin, Y.3
-
25
-
-
20344374162
-
Niagara: A 32-way multithreaded Sparc processor
-
P. Kongetira, K. Aingaran, and K. Olukotun, Niagara: A 32-way multithreaded Sparc processor. IEEE Micro, 25(2):21-29, 2005.
-
(2005)
IEEE Micro
, vol.25
, Issue.2
, pp. 21-29
-
-
Kongetira, P.1
Aingaran, K.2
Olukotun, K.3
-
27
-
-
35348869633
-
Intel looks to core for success
-
March
-
K. Krewell. Intel looks to core for success. Microprocessor Report, March 2006.
-
(2006)
Microprocessor Report
-
-
Krewell, K.1
-
28
-
-
28244437702
-
Heterogeneous chip multiprocessors
-
R. Kumar, D. M. Tullsen, N. P. Jouppi, and P. Ranganathan. Heterogeneous chip multiprocessors, Computer, 38(11):32-38, 2005.
-
(2005)
Computer
, vol.38
, Issue.11
, pp. 32-38
-
-
Kumar, R.1
Tullsen, D.M.2
Jouppi, N.P.3
Ranganathan, P.4
-
30
-
-
0001087280
-
Hyper-threading technology architecture and microarchitecture
-
Febuary
-
D. Man, F. Binns, D. Hill, G. Hinton, D. Koufaty, J. Miller, and M. Upton. Hyper-threading technology architecture and microarchitecture. Intel Technology Journal, 6, Febuary 2002.
-
(2002)
Intel Technology Journal
, vol.6
-
-
Man, D.1
Binns, F.2
Hill, D.3
Hinton, G.4
Koufaty, D.5
Miller, J.6
Upton, M.7
-
31
-
-
20344403770
-
Montecito: A dual-core, dualthread Itanium processor
-
C. McNairy and R. Bhatia. Montecito: A dual-core, dualthread Itanium processor, IEEE Micro, 25(2): 10-20, 2005.
-
(2005)
IEEE Micro
, vol.25
, Issue.2
, pp. 10-20
-
-
McNairy, C.1
Bhatia, R.2
-
33
-
-
84955506994
-
Runahead execution: An alternative to very large instruction windows for out-of-order processors
-
O. Mutlu, J. Stark, C. Wilkerson, and Y. N. Patt. Runahead execution: An alternative to very large instruction windows for out-of-order processors. In Proc. of HPCA-9, page 129, 2003.
-
(2003)
Proc. of HPCA-9
, pp. 129
-
-
Mutlu, O.1
Stark, J.2
Wilkerson, C.3
Patt, Y.N.4
-
34
-
-
40349108377
-
-
S. E. Raasch and S. K. Reinhardt. Applications of thread prioritization in SMT processors. In In Proc. of the Workshop on Multithreaded Execution And Compilation., 1999
-
S. E. Raasch and S. K. Reinhardt. Applications of thread prioritization in SMT processors. In In Proc. of the Workshop on Multithreaded Execution And Compilation., 1999
-
-
-
-
35
-
-
84968718478
-
The impact of resource partitioning on SMT processors
-
S. E. Raasch and S. K. Reinhardt. The impact of resource partitioning on SMT processors. In Proc. of PACT-12, page 15, 2003.
-
(2003)
Proc. of PACT-12
, pp. 15
-
-
Raasch, S.E.1
Reinhardt, S.K.2
-
36
-
-
33746696007
-
Multi-core to the masses
-
J. Rattner. Multi-core to the masses. In Proc. of PACT-14, page 3, 2005.
-
(2005)
Proc. of PACT-14
, pp. 3
-
-
Rattner, J.1
-
37
-
-
0030171894
-
Efficient fair queueing using deficit round-robin
-
M. Shreedhar and G. Varghese, Efficient fair queueing using deficit round-robin, IEEE/ACM Trans. Netw., 4(3):375-385, 1996.
-
(1996)
IEEE/ACM Trans. Netw
, vol.4
, Issue.3
, pp. 375-385
-
-
Shreedhar, M.1
Varghese, G.2
-
38
-
-
28744440545
-
Performance analysis and validation of the Intel Pentium4 processor on 90nm technology
-
February
-
R. Singhal, K. Venkatraman, E. Cohn, J. Holm, D. Koufaty, M.-J. Lin, M. Madhav, M. Mattwandel, N. Nidhi, J. Pearce, and M. Seshadri, Performance analysis and validation of the Intel Pentium4 processor on 90nm technology, Intel Technology Journal, 8, February 2004
-
(2004)
Intel Technology Journal
, vol.8
-
-
Singhal, R.1
Venkatraman, K.2
Cohn, E.3
Holm, J.4
Koufaty, D.5
Lin, M.-J.6
Madhav, M.7
Mattwandel, M.8
Nidhi, N.9
Pearce, J.10
Seshadri, M.11
-
40
-
-
28444468099
-
Chip multithreading: Opportunities and challenges
-
L. Spracklen and S. G. Abraham, Chip multithreading: opportunities and challenges, In Proc. of HPCA-11, pages 248-252, 2005
-
(2005)
Proc. of HPCA-11
, pp. 248-252
-
-
Spracklen, L.1
Abraham, S.G.2
-
42
-
-
0036298603
-
-
J. M. Tendler, J. S. Dodson, J. S. Fields, H. L. Jr., and B. Sinharoy. POWER4 system microarchitecture, IBM Journal of Research and Development, 46(1):5-26, 2002.
-
J. M. Tendler, J. S. Dodson, J. S. Fields, H. L. Jr., and B. Sinharoy. POWER4 system microarchitecture, IBM Journal of Research and Development, 46(1):5-26, 2002.
-
-
-
-
43
-
-
84976769480
-
The effectiveness of multiple hardware contexts
-
R. Thekkath and S. J. Eggers. The effectiveness of multiple hardware contexts, SIGPLAN Not., 29(11):328-337, 1994.
-
(1994)
SIGPLAN Not
, vol.29
, Issue.11
, pp. 328-337
-
-
Thekkath, R.1
Eggers, S.J.2
-
44
-
-
0035696665
-
Handling long-latency loads in a simultaneous multithreading processor
-
D. M. Tullsen and J. Brown. Handling long-latency loads in a simultaneous multithreading processor. In Proc. of MICRO-34, pages 318-327, 2001.
-
(2001)
Proc. of MICRO-34
, pp. 318-327
-
-
Tullsen, D.M.1
Brown, J.2
-
45
-
-
0029666641
-
Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor
-
D. M. Tullsen, S. Eggers, J. Emer, H. Levy, J. Lo, and R. Stamm. Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor. In Proc. of ISCA-23, pages 191-202, 1996.
-
(1996)
Proc. of ISCA-23
, pp. 191-202
-
-
Tullsen, D.M.1
Eggers, S.2
Emer, J.3
Levy, H.4
Lo, J.5
Stamm, R.6
-
46
-
-
40349091187
-
-
D. M. Tullsen, S. Eggers, and H. Levy. Simultaneous multithreading: maximizing on-chip parallelism, In ISCA '98: 25 years of the international symposia on Computer architecture (selected papers), pages 533-544, 1998
-
D. M. Tullsen, S. Eggers, and H. Levy. Simultaneous multithreading: maximizing on-chip parallelism, In ISCA '98: 25 years of the international symposia on Computer architecture (selected papers), pages 533-544, 1998
-
-
-
-
47
-
-
21644481490
-
Balanced multithreading: Increasing throughput via a low cost multithreading hierarchy
-
E. Tune, R. Kumar, D. M. Tullsen, and B. Calder, Balanced multithreading: Increasing throughput via a low cost multithreading hierarchy. In Proc. of MICRO-37, pages 183-194, 2004.
-
(2004)
Proc. of MICRO-37
, pp. 183-194
-
-
Tune, E.1
Kumar, R.2
Tullsen, D.M.3
Calder, B.4
-
48
-
-
2042458649
-
A survey of processors with explicit multithreading
-
T. Ungerer, B. Robic, and J. Silc. A survey of processors with explicit multithreading. ACM Comput. Surv., 35(1):29-63, 2003.
-
(2003)
ACM Comput. Surv
, vol.35
, Issue.1
, pp. 29-63
-
-
Ungerer, T.1
Robic, B.2
Silc, J.3
|