-
1
-
-
0029254155
-
Myrinet - A Gigabit-per-Second Local-Area Network
-
N. J. Boden, D. Cohen, R. E. Felderman, A. E. Kulawik, C. L. Seitz, J. N. Seizovic, and W. king Su. Myrinet - A Gigabit-per-Second Local-Area Network. IEEE Micro, 1995.
-
(1995)
IEEE Micro
-
-
Boden, N.J.1
Cohen, D.2
Felderman, R.E.3
Kulawik, A.E.4
Seitz, C.L.5
Seizovic, J.N.6
king Su, W.7
-
3
-
-
36348965353
-
The Power of Priority: NoC Based Distributed Cache Coherency
-
E. Bolotin, Z. Guz, I. Cidon, R. Ginosar, and A. Kolodny. The Power of Priority: NoC Based Distributed Cache Coherency. In NOCS'07, 2007.
-
(2007)
NOCS'07
-
-
Bolotin, E.1
Guz, Z.2
Cidon, I.3
Ginosar, R.4
Kolodny, A.5
-
4
-
-
0029666638
-
-
A. A. Chien and J. H. Kim. Rotating Combined Queueing (RCQ): Bandwidth and Latency Guarantees in Low-Cost, High-Performance Networks. ISCA-23, 1996.
-
A. A. Chien and J. H. Kim. Rotating Combined Queueing (RCQ): Bandwidth and Latency Guarantees in Low-Cost, High-Performance Networks. ISCA-23, 1996.
-
-
-
-
6
-
-
0024889726
-
Analysis and simulation of a fair queueing algorithm
-
A. Demers, S. Keshav, and S. Shenker. Analysis and simulation of a fair queueing algorithm. In SIGCOMM, 1989.
-
(1989)
SIGCOMM
-
-
Demers, A.1
Keshav, S.2
Shenker, S.3
-
7
-
-
0030662863
-
-
J. Dundas and T. Mudge. Improving data cache performance by pre-executing instructions under a cache miss. In ICS-11, 1997.
-
J. Dundas and T. Mudge. Improving data cache performance by pre-executing instructions under a cache miss. In ICS-11, 1997.
-
-
-
-
8
-
-
47249094055
-
System-level performance metrics for multiprogram workloads
-
May-June
-
S. Eyerman and L. Eeckhout. System-level performance metrics for multiprogram workloads. IEEE Micro, May-June 2008.
-
(2008)
IEEE Micro
-
-
Eyerman, S.1
Eeckhout, L.2
-
9
-
-
0034844926
-
Focusing processor policies via critical-path prediction
-
B. Fields, S. Rubin, and R. Bodík. Focusing processor policies via critical-path prediction. In ISCA-25, 2001.
-
(2001)
ISCA-25
-
-
Fields, B.1
Rubin, S.2
Bodík, R.3
-
11
-
-
4644285853
-
MLP Yes! ILP No! Memory Level Parallelism, or, Why I No Longer Worry About IPC
-
A. Glew. MLP Yes! ILP No! Memory Level Parallelism, or, Why I No Longer Worry About IPC. In ASPLOS Wild and Crazy Ideas Session, 1998.
-
(1998)
ASPLOS Wild and Crazy Ideas Session
-
-
Glew, A.1
-
13
-
-
47349129525
-
Flattened butterfly topology for on-chip networks
-
J. Kim, J. Balfour, and W. Dally. Flattened butterfly topology for on-chip networks. MICRO-40, 2007.
-
(2007)
MICRO-40
-
-
Kim, J.1
Balfour, J.2
Dally, W.3
-
14
-
-
35348908288
-
A novel dimensionally-decomposed router for on-chip communication in 3D architectures
-
J. Kim, C. Nicopoulos, D. Park, R. Das, Y. Xie, V. Narayanan, M. S. Yousif, and C. R. Das. A novel dimensionally-decomposed router for on-chip communication in 3D architectures. ISCA-34, 2007.
-
(2007)
ISCA-34
-
-
Kim, J.1
Nicopoulos, C.2
Park, D.3
Das, R.4
Xie, Y.5
Narayanan, V.6
Yousif, M.S.7
Das, C.R.8
-
15
-
-
84904279959
-
Lockup-free instruction fetch/prefetch cache organization
-
D. Kroft. Lockup-free instruction fetch/prefetch cache organization. In ISCA-8, 1981.
-
(1981)
ISCA-8
-
-
Kroft, D.1
-
16
-
-
35348858651
-
Express virtual channels: Towards the ideal interconnection fabric
-
A. Kumar, L.-S. Peh, P. Kundu, and N. K. Jha. Express virtual channels: Towards the ideal interconnection fabric. In ISCA-34, 2007.
-
(2007)
ISCA-34
-
-
Kumar, A.1
Peh, L.-S.2
Kundu, P.3
Jha, N.K.4
-
17
-
-
52649094492
-
Globally-Synchronized Frames for Guaranteed Quality-of-Service in On-Chip Networks
-
J. W. Lee, M. C. Ng, and K. Asanovic. Globally-Synchronized Frames for Guaranteed Quality-of-Service in On-Chip Networks. In ISCA-35, 2008.
-
(2008)
ISCA-35
-
-
Lee, J.W.1
Ng, M.C.2
Asanovic, K.3
-
18
-
-
33644903196
-
Efficient runahead execution: Power-efficient memory latency tolerance
-
O. Mutlu, H. Kim, and Y. N. Patt. Efficient runahead execution: Power-efficient memory latency tolerance. IEEE Micro, 2006.
-
(2006)
IEEE Micro
-
-
Mutlu, O.1
Kim, H.2
Patt, Y.N.3
-
19
-
-
47349122373
-
Stall-Time Fair Memory Access Scheduling for Chip Multiprocessors
-
O. Mutlu and T. Moscibroda. Stall-Time Fair Memory Access Scheduling for Chip Multiprocessors. In MICRO-41, 2007.
-
(2007)
MICRO-41
-
-
Mutlu, O.1
Moscibroda, T.2
-
20
-
-
52649119398
-
Parallelism-Aware Batch Scheduling: Enhancing both Performance and Fairness of Shared DRAM Systems
-
O. Mutlu and T. Moscibroda. Parallelism-Aware Batch Scheduling: Enhancing both Performance and Fairness of Shared DRAM Systems. In ISCA-35, 2008.
-
(2008)
ISCA-35
-
-
Mutlu, O.1
Moscibroda, T.2
-
21
-
-
84955506994
-
Runahead execution: An alternative to very large instruction windows for out-of-order processors
-
O. Mutlu, J. Stark, C. Wilkerson, and Y. Patt. Runahead execution: an alternative to very large instruction windows for out-of-order processors. In HPCA-9, 2003.
-
(2003)
HPCA-9
-
-
Mutlu, O.1
Stark, J.2
Wilkerson, C.3
Patt, Y.4
-
22
-
-
40349107206
-
Vichar: A dynamic virtual channel regulator for network-on-chip routers
-
C. A. Nicopoulos, D. Park, J. Kim, N. Vijaykrishnan, M. S. Yousif, and C. R. Das. Vichar: A dynamic virtual channel regulator for network-on-chip routers. In MICRO-39, 2006.
-
(2006)
MICRO-39
-
-
Nicopoulos, C.A.1
Park, D.2
Kim, J.3
Vijaykrishnan, N.4
Yousif, M.S.5
Das, C.R.6
-
24
-
-
76749103698
-
Pinpointing representative portions of large intel ocombining double acute accentitaniumocombining double acute accent programs with dynamic instrumentation
-
H. Patil, R. Cohn, M. Charney, R. Kapoor, A. Sun, and A. Karunanidhi. Pinpointing representative portions of large intel ocombining double acute accentitaniumocombining double acute accent programs with dynamic instrumentation. In MICRO-37, 2004.
-
(2004)
MICRO-37
-
-
Patil, H.1
Cohn, R.2
Charney, M.3
Kapoor, R.4
Sun, A.5
Karunanidhi, A.6
-
26
-
-
84893753441
-
Trade-offs in the design of a router with both guaranteed and best-effort services for networks on chip
-
E. Rijpkema, K. Goossens, A. Radulescu, J. Dielissen, J. van Meerbergen, P. Wielage, and E. Waterlander. Trade-offs in the design of a router with both guaranteed and best-effort services for networks on chip. DATE, 2003.
-
(2003)
DATE
-
-
Rijpkema, E.1
Goossens, K.2
Radulescu, A.3
Dielissen, J.4
van Meerbergen, J.5
Wielage, P.6
Waterlander, E.7
-
27
-
-
0034443570
-
Symbiotic jobscheduling for a simultaneous multithreading processor
-
A. Snavely and D. Tullsen. Symbiotic jobscheduling for a simultaneous multithreading processor. In ASPLOS-8, 2000.
-
(2000)
ASPLOS-8
-
-
Snavely, A.1
Tullsen, D.2
-
28
-
-
27544447688
-
Load latency tolerance in dynamically scheduled processors
-
S. T. Srinivasan and A. R. Lebeck. Load latency tolerance in dynamically scheduled processors. In MICRO-31, 1998.
-
(1998)
MICRO-31
-
-
Srinivasan, S.T.1
Lebeck, A.R.2
-
29
-
-
64949119281
-
Criticality-based optimizations for efficient load processing
-
S. Subramaniam, A. Bracy, H. Wang, and G. Loh. Criticality-based optimizations for efficient load processing. In HPCA-15, 2009.
-
(2009)
HPCA-15
-
-
Subramaniam, S.1
Bracy, A.2
Wang, H.3
Loh, G.4
-
32
-
-
0034850359
-
QoS provisioning in clusters: An investigation of router and NIC design
-
K. H. Yum, E. J. Kim, and C. Das. QoS provisioning in clusters: an investigation of router and NIC design. In ISCA-28, 2001.
-
(2001)
ISCA-28
-
-
Yum, K.H.1
Kim, E.J.2
Das, C.3
-
33
-
-
85030153179
-
Virtual clock: A new traffic control algorithm for packet switching networks
-
L. Zhang. Virtual clock: a new traffic control algorithm for packet switching networks. SIGCOMM, 1990.
-
(1990)
SIGCOMM
-
-
Zhang, L.1
|