-
1
-
-
44149085697
-
Reducing the interconnection network cost of chip multiprocessors
-
P. Abad, V. Puente, and J. Gregorio, "Reducing the interconnection network cost of chip multiprocessors," in NOCS, 2008.
-
(2008)
NOCS
-
-
Abad, P.1
Puente, V.2
Gregorio, J.3
-
2
-
-
35348819913
-
Rotary router: An efficient architecture for cmp interconnection networks
-
P. Abad, V. Puente, J. Gregorio, and P. Prieto, "Rotary router: an efficient architecture for cmp interconnection networks," in ISCA, 2007.
-
(2007)
ISCA
-
-
Abad, P.1
Puente, V.2
Gregorio, J.3
Prieto, P.4
-
3
-
-
66749188683
-
Power-Aware DRAM Speculation
-
N. Aggarwal, J. Cantin, M. Lipasti, and J. E. Smith, "Power-Aware DRAM Speculation," in HPCA-12, 2008.
-
(2008)
HPCA-12
-
-
Aggarwal, N.1
Cantin, J.2
Lipasti, M.3
Smith, J.E.4
-
4
-
-
27544481926
-
Variability in architectural simulations of multi-threaded workloads
-
A. R. Alameldeen and D. A. Wood, "Variability in architectural simulations of multi-threaded workloads," in Proceedings of HPCA-9, 2003.
-
(2003)
Proceedings of HPCA-9
-
-
Alameldeen, A.R.1
Wood, D.A.2
-
6
-
-
0027309859
-
The performance of cache-coherent ringbased multiprocessors
-
L. A. Barroso and M. Dubois, "The performance of cache-coherent ringbased multiprocessors," in ISCA-20, 1993.
-
(1993)
ISCA-20
-
-
Barroso, L.A.1
Dubois, M.2
-
7
-
-
0032647513
-
Multicast snooping: A new coherence method using a multicast address network
-
May
-
E. Bilir, R. Dickson, Y. Hu, M. Plakal, D. Sorin, M. Hill, and D. Wood, "Multicast snooping: A new coherence method using a multicast address network," in Proc. of ISCA, May 1999.
-
(1999)
Proc. of ISCA
-
-
Bilir, E.1
Dickson, R.2
Hu, Y.3
Plakal, M.4
Sorin, D.5
Hill, M.6
Wood, D.7
-
8
-
-
0005369980
-
Precise and accurate processor simulation
-
H. Cain, K. Lepak, B. Schwarz, and M. H. Lipasti, "Precise and accurate processor simulation," in Workshop On Computer Architecture Evaluation using Commercial Workloads, 2002.
-
(2002)
Workshop On Computer Architecture Evaluation using Commercial Workloads
-
-
Cain, H.1
Lepak, K.2
Schwarz, B.3
Lipasti, M.H.4
-
9
-
-
27544506862
-
Improving multiprocessor performance with coarse-grain coherence tracking
-
J. F. Cantin, M. H. Lipasti, and J. E. Smith, "Improving multiprocessor performance with coarse-grain coherence tracking," in ISCA-32, 2005.
-
(2005)
ISCA-32
-
-
Cantin, J.F.1
Lipasti, M.H.2
Smith, J.E.3
-
11
-
-
0033099692
-
An efficient tree cache coherence protocol for distributed shared memory multiprocessors
-
Y. Chang and L. N. Bhuyan, "An efficient tree cache coherence protocol for distributed shared memory multiprocessors," IEEE Transactions on Computers, vol. 48, no. 3, 1998.
-
(1998)
IEEE Transactions on Computers
, vol.48
, Issue.3
-
-
Chang, Y.1
Bhuyan, L.N.2
-
12
-
-
0025433355
-
Virtual-channel flow control
-
W J. Dally, "Virtual-channel flow control," in ISCA, 1990.
-
(1990)
ISCA
-
-
Dally, W.J.1
-
14
-
-
52649171528
-
Virtual circuit tree multicasting: A case for on-chip hardware multicast support
-
N. Enright Jerger, L.-S. Peh, and M. H. Lipasti, "Virtual circuit tree multicasting: A case for on-chip hardware multicast support," in Proceedings of ISCA-35, 2008.
-
(2008)
Proceedings of ISCA-35
-
-
Enright Jerger, N.1
Peh, L.-S.2
Lipasti, M.H.3
-
15
-
-
47349085587
-
An evaluation of server consolidation workloads for multi-core designs
-
N. Enright Jerger, D. Vanatrease, and M. Lipasti, "An evaluation of server consolidation workloads for multi-core designs," in IISWC, 2007.
-
(2007)
IISWC
-
-
Enright Jerger, N.1
Vanatrease, D.2
Lipasti, M.3
-
16
-
-
0039335269
-
Architecture and design of AlphaServer GS320
-
K. Gharachorloo, M. Sharma, S. Steely, and S. V Doren, "Architecture and design of AlphaServer GS320," in Architectural Support for Programming Languages and Operating Systems, 2000.
-
(2000)
Architectural Support for Programming Languages and Operating Systems
-
-
Gharachorloo, K.1
Sharma, M.2
Steely, S.3
Doren, S.V.4
-
17
-
-
0030685588
-
The SGI Origin: A ccNUMA highly scalable server
-
J. Laudon and D. Lenoski, "The SGI Origin: a ccNUMA highly scalable server," in ISCA-24, 1997.
-
(1997)
ISCA-24
-
-
Laudon, J.1
Lenoski, D.2
-
18
-
-
84968853465
-
Redeeming IPC as a performance metric for multithreaded programs
-
K. M. Lepak, H. W Cain, and M. H. Lipasti, "Redeeming IPC as a performance metric for multithreaded programs," in Proceeding of 12th PACT, 2003, pp. 232-243.
-
(2003)
Proceeding of 12th PACT
, pp. 232-243
-
-
Lepak, K.M.1
Cain, H.W.2
Lipasti, M.H.3
-
19
-
-
0038684776
-
Using destination-set prediction to improve the latency/bandwidth tradeoff in shared-memory multiprocessors
-
June
-
M. M. K. Martin, P. J. Harper, D. J. Sorin, M. D. Hill, and D. A. Wood, "Using destination-set prediction to improve the latency/bandwidth tradeoff in shared-memory multiprocessors," in Proceedings of the 30th ISCA, June 2003.
-
(2003)
Proceedings of the 30th ISCA
-
-
Martin, M.M.K.1
Harper, P.J.2
Sorin, D.J.3
Hill, M.D.4
Wood, D.A.5
-
20
-
-
0038346234
-
Token coherence: Decoupling performance and correctness
-
M. M. K. Martin, M. D. Hill, and D. A. Wood, "Token coherence: Decoupling performance and correctness," in ISCA-30, 2003.
-
(2003)
ISCA-30
-
-
Martin, M.M.K.1
Hill, M.D.2
Wood, D.A.3
-
21
-
-
33745130597
-
Bandwidth adaptive snooping
-
M. M. K. Martin, D. J. Sorin, M. D. Hill, and D. A. Wood, "Bandwidth adaptive snooping," in HPCA-8, 2002.
-
(2002)
HPCA-8
-
-
Martin, M.M.K.1
Sorin, D.J.2
Hill, M.D.3
Wood, D.A.4
-
23
-
-
28444472751
-
Improving multiple-cmp systems using token coherenece
-
February
-
M. R. Marty, J. D. Bingham, M. D. Hill, A. J. Hu, M. M. K. Martin, and D. A. Wood, "Improving multiple-cmp systems using token coherenece," in HPCA, February 2005.
-
(2005)
HPCA
-
-
Marty, M.R.1
Bingham, J.D.2
Hill, M.D.3
Hu, A.J.4
Martin, M.M.K.5
Wood, D.A.6
-
24
-
-
40349100696
-
Coherence ordering for ring-based chip multiprocessors
-
December
-
M. R. Marty and M. D. Hill, "Coherence ordering for ring-based chip multiprocessors," in MICRO-39, December 2006.
-
(2006)
MICRO-39
-
-
Marty, M.R.1
Hill, M.D.2
-
25
-
-
35348900723
-
Virtual hierarchies to support server consolidation
-
M. R. Marty, "Virtual hierarchies to support server consolidation," in ISCA-34, 2007.
-
(2007)
ISCA-34
-
-
Marty, M.R.1
-
26
-
-
27544455733
-
Regionscout: Exploiting coarse grain sharing in snoop-based coherence
-
A. Moshovos, "Regionscout: Exploiting coarse grain sharing in snoop-based coherence." in ISCA-32, 2005.
-
(2005)
ISCA-32
-
-
Moshovos, A.1
-
27
-
-
0036167929
-
The Alpha 21364 network architecture
-
S. S. Mukherjee, P. Bannon, S. Lang, A. Spink, and D. Webb, "The Alpha 21364 network architecture," IEEE Micro, vol. 22, no. 1, pp. 26-35, 2002.
-
(2002)
IEEE Micro
, vol.22
, Issue.1
, pp. 26-35
-
-
Mukherjee, S.S.1
Bannon, P.2
Lang, S.3
Spink, A.4
Webb, D.5
-
28
-
-
0002979865
-
The scalable tree protocol - a cache coherence approach for large-scale multiprocessors
-
H. Nilsson and P. Stenstrom, "The scalable tree protocol - a cache coherence approach for large-scale multiprocessors," in IPDPS, 1992.
-
(1992)
IPDPS
-
-
Nilsson, H.1
Stenstrom, P.2
-
29
-
-
25844437046
-
Power5 system microarchitecture
-
B. Sinharoy, R. Kalla, J. Tendler, R. Eickemeyer, and J. Joyner, "Power5 system microarchitecture," IBM Journal of Research and Development, vol. 49, no. 4, 2005.
-
(2005)
IBM Journal of Research and Development
, vol.49
, Issue.4
-
-
Sinharoy, B.1
Kalla, R.2
Tendler, J.3
Eickemeyer, R.4
Joyner, J.5
-
30
-
-
24644502365
-
-
SPEC
-
SPEC, "SPEC benchmarks," http://www.spec.org.
-
SPEC benchmarks
-
-
-
32
-
-
47349125701
-
Uncorq: Unconstrained snoop request delivery in embedded-ring multiprocessors
-
K. Strauss, "Uncorq: Unconstrained snoop request delivery in embedded-ring multiprocessors," in MICRO-40, 2007.
-
(2007)
MICRO-40
-
-
Strauss, K.1
-
33
-
-
84871283702
-
-
TPC
-
TPC, "TPC benchmarks," http://www.tpc.org.
-
TPC benchmarks
-
-
-
34
-
-
34548858682
-
An 80-tile 1.28 tflops network-on-chip in 65nm cmos
-
S. Vangal, J. Howard, G. Ruhl, S. Dighe, H. Wilson, J. Tschanz, D. Finan, P. Iyer, A. Singh, T. Jacob, S. Jain, S. Venkataraman, Y. Hoskote, and N. Borkar, "An 80-tile 1.28 tflops network-on-chip in 65nm cmos," in IEEE International Solid State Circuit Conference, 2007.
-
(2007)
IEEE International Solid State Circuit Conference
-
-
Vangal, S.1
Howard, J.2
Ruhl, G.3
Dighe, S.4
Wilson, H.5
Tschanz, J.6
Finan, D.7
Iyer, P.8
Singh, A.9
Jacob, T.10
Jain, S.11
Venkataraman, S.12
Hoskote, Y.13
Borkar, N.14
-
35
-
-
36849030305
-
-
D. Wentzlaff, P. Griffin, H. Hoffman, L. Bao, B. Edwards, C. Ramey, M. Mattina, C.-C. Miao, J. B. III, and A. Agarwal, On-chip interconnection architecture of the tile processor, IEEE Micro, pp. 15-31, 2007.
-
D. Wentzlaff, P. Griffin, H. Hoffman, L. Bao, B. Edwards, C. Ramey, M. Mattina, C.-C. Miao, J. B. III, and A. Agarwal, "On-chip interconnection architecture of the tile processor," IEEE Micro, pp. 15-31, 2007.
-
-
-
-
36
-
-
0029194459
-
The SPLASH-2 programs: Characterization and methodological considerations
-
June
-
S. Woo, M. Ohara, E. Torrie, J. Singh, and A. Gupta, "The SPLASH-2 programs: Characterization and methodological considerations," in ISCA-22, June 1995.
-
(1995)
ISCA-22
-
-
Woo, S.1
Ohara, M.2
Torrie, E.3
Singh, J.4
Gupta, A.5
-
37
-
-
47349115313
-
A framework for coarse-grain optimizations in the on-chip memory hierarchy
-
J. Zebchuk, E. Safi, and A. Moshovos, "A framework for coarse-grain optimizations in the on-chip memory hierarchy," in MICRO-40, 2007.
-
(2007)
MICRO-40
-
-
Zebchuk, J.1
Safi, E.2
Moshovos, A.3
|