-
1
-
-
0037331006
-
Simulating a $2M commercial server on a $2K PC
-
Feb.
-
A. R. Alameldeen, M. M. K. Martin, C. J. Mauer, K. E. Moore, M. Xu, D. J. Sorin, M. D. Hill, and D. A. Wood. Simulating a $2M commercial server on a $2K PC. IEEE Computer, 36(2):50-57, Feb. 2003.
-
(2003)
IEEE Computer
, vol.36
, Issue.2
, pp. 50-57
-
-
Alameldeen, A.R.1
Martin, M.M.K.2
Mauer, C.J.3
Moore, K.E.4
Xu, M.5
Sorin, D.J.6
Hill, M.D.7
Wood, D.A.8
-
2
-
-
85007632138
-
A cache coherence approach for large multiprocessor systems
-
J. K. Archibald. A cache coherence approach for large multiprocessor systems. In the 2nd ICS, pages 337-345, 1988.
-
(1988)
The 2nd ICS
, pp. 337-345
-
-
Archibald, J.K.1
-
3
-
-
84900342836
-
SPECOMP: A new benchmark suite for measuring parallel computer performance
-
V. Aslot, M. J. Domeika, R. Eigenmann, G. Gaertner, W. B. Jones, and B. Parady. SPECOMP: A new benchmark suite for measuring parallel computer performance. In the International Workshop on OpenMP Applications and Tools, pages 1-10, 2001.
-
(2001)
The International Workshop on OpenMP Applications and Tools
, pp. 1-10
-
-
Aslot, V.1
Domeika, M.J.2
Eigenmann, R.3
Gaertner, G.4
Jones, W.B.5
Parady, B.6
-
4
-
-
0023672138
-
On the inclusion properties for multi-level cache hierarchies
-
J.-L. Baer and W.-H. Wang. On the inclusion properties for multi-level cache hierarchies. In the 15th ISCA, pages 73-80, 1988.
-
(1988)
The 15th ISCA
, pp. 73-80
-
-
Baer, J.-L.1
Wang, W.-H.2
-
5
-
-
0033722744
-
Piranha: A scalable architecture based on single-chip multiprocessing
-
June
-
L. A. Barroso, K. Gharachorloo, R. McNamara, A. Nowatzyk, S. Qadeer, B. Sano, S. Smith, R. Stets, and B. Verghese. Piranha: A scalable architecture based on single-chip multiprocessing. In the 27th ISCA, pages 282-293, June 2000.
-
(2000)
The 27th ISCA
, pp. 282-293
-
-
Barroso, L.A.1
Gharachorloo, K.2
McNamara, R.3
Nowatzyk, A.4
Qadeer, S.5
Sano, B.6
Smith, S.7
Stets, R.8
Verghese, B.9
-
6
-
-
21644472427
-
Managing wire delay in large chip-multiprocessor caches
-
Dec.
-
B. M. Beckmann and D. A. Wood. Managing wire delay in large chip-multiprocessor caches. In the 37th MICRO, pages 319-330, Dec. 2004.
-
(2004)
The 37th MICRO
, pp. 319-330
-
-
Beckmann, B.M.1
Wood, D.A.2
-
7
-
-
84944411840
-
Distance associativity for high-performance energy-efficient non-uniform cache architectures
-
Dec
-
Z. Chishti, M. D. Powell, and T. N. Vijaykumar. Distance associativity for high-performance energy-efficient non-uniform cache architectures. In the 36th MICRO, pages 55-66, Dec 2003.
-
(2003)
The 36th MICRO
, pp. 55-66
-
-
Chishti, Z.1
Powell, M.D.2
Vijaykumar, T.N.3
-
8
-
-
27544432313
-
Optimizing replication, communication and capacity allocation in CMPs
-
June
-
Z. Chishti, M. D. Powell, and T. N. Vijaykumar. Optimizing replication, communication and capacity allocation in CMPs. In the 32th ISCA, pages 357-368, June 2005.
-
(2005)
The 32th ISCA
, pp. 357-368
-
-
Chishti, Z.1
Powell, M.D.2
Vijaykumar, T.N.3
-
9
-
-
85020058991
-
Cooperative caching: Using remote client memory to improve file system performance
-
Nov
-
M. Dahlin, R. Wang, T. E. Anderson, and D. A. Patterson. Cooperative caching: Using remote client memory to improve file system performance. In the 1st OSDI, pages 267-280, Nov 1994.
-
(1994)
The 1st OSDI
, pp. 267-280
-
-
Dahlin, M.1
Wang, R.2
Anderson, T.E.3
Patterson, D.A.4
-
10
-
-
0034206002
-
Summary cache: A scalable wide-area web cache sharing protocol
-
L. Fan, P. Cao, J. Almeida, and A. Z. Broder. Summary cache: a scalable wide-area web cache sharing protocol. IEEE Transactions on Networking, 8(3):281-293, 2000.
-
(2000)
IEEE Transactions on Networking
, vol.8
, Issue.3
, pp. 281-293
-
-
Fan, L.1
Cao, P.2
Almeida, J.3
Broder, A.Z.4
-
11
-
-
84883539106
-
Implementing global memory management in a workstation cluster
-
Dec
-
M. J. Feeley, W. E. Morgan, E. P. Pighin, A. R. Karlin, H. M. Levy, and C. A. Thekkath. Implementing global memory management in a workstation cluster. In the 15th SOSP, pages 201-212, Dec 1995.
-
(1995)
The 15th SOSP
, pp. 201-212
-
-
Feeley, M.J.1
Morgan, W.E.2
Pighin, E.P.3
Karlin, A.R.4
Levy, H.M.5
Thekkath, C.A.6
-
12
-
-
84945711902
-
DDM: A cache-only memory architecture
-
E. Hagersten, A. Landin, and S. Haridi. DDM: A cache-only memory architecture. IEEE Computer, 25(9):44-54, 1992.
-
(1992)
IEEE Computer
, vol.25
, Issue.9
, pp. 44-54
-
-
Hagersten, E.1
Landin, A.2
Haridi, S.3
-
15
-
-
32844471317
-
A NUCA substrate for flexible CMP cache sharing
-
June
-
J. Huh, C. Kim, H. Shafi, L. Zhang, D. Burger, and S. W. Keckler. A NUCA substrate for flexible CMP cache sharing. In the 19th ICS, pages 31-40, June 2005.
-
(2005)
The 19th ICS
, pp. 31-40
-
-
Huh, J.1
Kim, C.2
Shafi, H.3
Zhang, L.4
Burger, D.5
Keckler, S.W.6
-
16
-
-
8344246922
-
CQoS: A framework for enabling QoS in shared caches of CMP platforms
-
June
-
R. Iyer. CQoS: a framework for enabling QoS in shared caches of CMP platforms. In the 18th ICS, pages 257-266, June 2004.
-
(2004)
The 18th ICS
, pp. 257-266
-
-
Iyer, R.1
-
17
-
-
0036949388
-
An adaptive, non-uniform cache structure for wire-delay dominated on-chip caches
-
Oct
-
C. Kim, D. Burger, and S. W. Keckler. An adaptive, non-uniform cache structure for wire-delay dominated on-chip caches. In ASPLOS-X, pages 211-222, Oct, 2002.
-
(2002)
ASPLOS-X
, pp. 211-222
-
-
Kim, C.1
Burger, D.2
Keckler, S.W.3
-
19
-
-
20344374162
-
Niagara: A 32-way multithreaded SPARC processor
-
P. Kongetira, K. Aingaran, and K. Olukotun. Niagara: A 32-way multithreaded SPARC processor. IEEE Micro, 25(2):21-29, 2005.
-
(2005)
IEEE Micro
, vol.25
, Issue.2
, pp. 21-29
-
-
Kongetira, P.1
Aingaran, K.2
Olukotun, K.3
-
20
-
-
2342468635
-
Organizing the last line of defense before hitting the memory wall for CMPs
-
Feb.
-
C. Liu, A. Sivasubramaniam, and M. Kandemir. Organizing the last line of defense before hitting the memory wall for CMPs. In HPCA-10, pages 176-185, Feb. 2004.
-
(2004)
HPCA-10
, pp. 176-185
-
-
Liu, C.1
Sivasubramaniam, A.2
Kandemir, M.3
-
21
-
-
0036469676
-
Simics: A full system simulation platform
-
Feb
-
P. Magnusson, M. Christensson, J. Eskilson, D. Forsgren, G. Hållberg, J. Högberg, F. Larsson, A. Moestedt, and B. Werner. Simics: A full system simulation platform. IEEE Computer, 35(2):50-58, Feb 2002.
-
(2002)
IEEE Computer
, vol.35
, Issue.2
, pp. 50-58
-
-
Magnusson, P.1
Christensson, M.2
Eskilson, J.3
Forsgren, D.4
Hållberg, G.5
Högberg, J.6
Larsson, F.7
Moestedt, A.8
Werner, B.9
-
22
-
-
33748870886
-
Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset
-
M. M. Martin, D. J. Sorin, B. M. Beckmann, M. R. Marty, M. Xu, A. R. Alameldeen, K. E. Moore, M. D. Hill, and D. A. Wood. Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset. Computer Architecture News, 2005.
-
(2005)
Computer Architecture News
-
-
Martin, M.M.1
Sorin, D.J.2
Beckmann, B.M.3
Marty, M.R.4
Xu, M.5
Alameldeen, A.R.6
Moore, K.E.7
Hill, M.D.8
Wood, D.A.9
-
23
-
-
0038346234
-
Token coherence: Decoupling performance and correctness
-
June
-
M. M. K. Martin, M. D. Hill, and D. A. Wood. Token coherence: Decoupling performance and correctness. In the 30th ISCA, pages 182-193, June 2003.
-
(2003)
The 30th ISCA
, pp. 182-193
-
-
Martin, M.M.K.1
Hill, M.D.2
Wood, D.A.3
-
24
-
-
28444472751
-
Improving multiple-CMP systems using token coherence
-
Feb
-
M. R. Marty, J. D. Bingham, M. D. Hill, A. J. Hu, M. M. K. Martin, and D. A. Wood. Improving multiple-CMP systems using token coherence. In HPCA-11, pages 328-339, Feb 2005.
-
(2005)
HPCA-11
, pp. 328-339
-
-
Marty, M.R.1
Bingham, J.D.2
Hill, M.D.3
Hu, A.J.4
Martin, M.M.K.5
Wood, D.A.6
-
25
-
-
0035265873
-
High-throughput coherence control and hardware messaging m Everest
-
A. K. Nanda, A.-T. Nguyen, M. M. Michael, and D. J. Joseph. High-throughput coherence control and hardware messaging m Everest. IBM Journal of Research and Development, 45(2), 2001.
-
(2001)
IBM Journal of Research and Development
, vol.45
, Issue.2
-
-
Nanda, A.K.1
Nguyen, A.-T.2
Michael, M.M.3
Joseph, D.J.4
-
26
-
-
0029666647
-
Evaluation of design alternatives for a multiprocessor microprocessor
-
May
-
B. A. Nayfeh, L. Hammond, and K. Olukotun. Evaluation of design alternatives for a multiprocessor microprocessor. In the 23rd ISCA, pages 67-77, May 1996.
-
(1996)
The 23rd ISCA
, pp. 67-77
-
-
Nayfeh, B.A.1
Hammond, L.2
Olukotun, K.3
-
27
-
-
85008031236
-
MinneSPEC: A new spec benchmark workload for simulation-based computer architecture research
-
June
-
A. K. Osowski and D. J. Lilja. MinneSPEC: A new spec benchmark workload for simulation-based computer architecture research. Computer Architecture Letters, June 2002.
-
(2002)
Computer Architecture Letters
-
-
Osowski, A.K.1
Lilja, D.J.2
-
28
-
-
0021160872
-
A low-overhead coherence solution for multiprocessors with private cache memories
-
M. S. Papamarcos and J. H. Patel. A low-overhead coherence solution for multiprocessors with private cache memories. In the 11th ISCA, pages 348-354, 1984.
-
(1984)
The 11th ISCA
, pp. 348-354
-
-
Papamarcos, M.S.1
Patel, J.H.2
-
29
-
-
27644555246
-
The V-way cache: Demand based associativity via global replacement
-
June
-
M. K. Qureshi, D. Thompson, and Y. N. Patt. The V-way cache: Demand based associativity via global replacement. In the 32nd ISCA, pages 544-555, June 2005.
-
(2005)
The 32nd ISCA
, pp. 544-555
-
-
Qureshi, M.K.1
Thompson, D.2
Patt, Y.N.3
-
30
-
-
0002267369
-
An argument for simple COMA
-
Jan
-
A. Saulsbury, T. Wilkinson, J. Carter, and A. Landin. An argument for simple COMA. In HPCA 1, pages 276-285, Jan, 1995.
-
(1995)
HPCA 1
, pp. 276-285
-
-
Saulsbury, A.1
Wilkinson, T.2
Carter, J.3
Landin, A.4
-
31
-
-
0029700352
-
A shared-bus control mechanism and a cache coherence protocol for a high-performance on-chip multiprocessor
-
Feb
-
M. Takahashi, H. Takano, E. Kaneko, and S. Suzuki. A shared-bus control mechanism and a cache coherence protocol for a high-performance on-chip multiprocessor. In HPCA 2, pages 314-322, Feb 1996.
-
(1996)
HPCA 2
, pp. 314-322
-
-
Takahashi, M.1
Takano, H.2
Kaneko, E.3
Suzuki, S.4
-
32
-
-
0036298603
-
IBM Power4 system microarchitecture
-
J. M. Tendler, J. S. Dodson, J. S. F. Jr., H. Le, and B. Sinharoy. IBM Power4 system microarchitecture. IBM Journal of Research and Development, 46(1):5-26, 2002.
-
(2002)
IBM Journal of Research and Development
, vol.46
, Issue.1
, pp. 5-26
-
-
Tendler, J.M.1
Dodson, J.S.2
F. Jr., J.S.3
Le, H.4
Sinharoy, B.5
-
33
-
-
0031612547
-
Performance isolation: Sharing and isolation in shared-memory multiprocessors
-
Oct
-
B. Verghese, A. Gupta, and M. Rosenblum. Performance isolation: Sharing and isolation in shared-memory multiprocessors. In ASPLOS-VIII, pages 181-192, Oct, 1998.
-
(1998)
ASPLOS-VIII
, pp. 181-192
-
-
Verghese, B.1
Gupta, A.2
Rosenblum, M.3
-
34
-
-
29144463717
-
Fast and fair: Data-stream quality of service
-
Sep
-
T. Y. Yeh and G. Reinman. Fast and fair: data-stream quality of service. In CASES'05, pages 237-248, Sep 2005.
-
(2005)
CASES'05
, pp. 237-248
-
-
Yeh, T.Y.1
Reinman, G.2
-
35
-
-
27544495466
-
Victim replication: Maximizing capacity while hiding wire delay in tiled CMPs
-
June
-
M. Zhang and K. Asanovic. Victim replication: Maximizing capacity while hiding wire delay in tiled CMPs. In the 32th ISCA, pages 336-345, June 2005.
-
(2005)
The 32th ISCA
, pp. 336-345
-
-
Zhang, M.1
Asanovic, K.2
|