-
1
-
-
0030783438
-
An evaluation of fine-grain producer-initiated communication in cache-coherent multiprocessors
-
San Antonio, TX
-
H. Abdel-Shafi, J. Hall, S. V. Adve, and V. S. Adve. An evaluation of fine-grain producer-initiated communication in cache-coherent multiprocessors. In 3rd IEEE Symposium on High-Performance Computer Architecture (HPCA '97), pages 204-215, San Antonio, TX, 1997.
-
(1997)
3rd IEEE Symposium on High-Performance Computer Architecture (HPCA '97)
, pp. 204-215
-
-
Abdel-Shafi, H.1
Hall, J.2
Adve, S.V.3
Adve, V.S.4
-
2
-
-
27544478808
-
Mambo - a full system simulator for the PowerPC architecture
-
Mar
-
P. Bohrer, M. Elnozahy, A. Gheith, C. Lefurgy, T. Nakra, J. Peterson, R. Rajamony, R. Rockhold, H. Shafi, R. Simpson, E. Speight, K. Sudeep, E. V. Hensbergen, and L. Zhang. Mambo - a full system simulator for the PowerPC architecture. ACM SIGMETRICS Performance Evaluation Review, 31(4):8-12, Mar. 2004.
-
(2004)
ACM SIGMETRICS Performance Evaluation Review
, vol.31
, Issue.4
, pp. 8-12
-
-
Bohrer, P.1
Elnozahy, M.2
Gheith, A.3
Lefurgy, C.4
Nakra, T.5
Peterson, J.6
Rajamony, R.7
Rockhold, R.8
Shafi, H.9
Simpson, R.10
Speight, E.11
Sudeep, K.12
Hensbergen, E.V.13
Zhang, L.14
-
3
-
-
46349112166
-
Method and apparatus for accelerating Input/Output processing using cache injections,
-
Mar, US PatentNo. US 6,711,650 B1
-
P. Bohrer, R. Rajamony, and H. Shafi. Method and apparatus for accelerating Input/Output processing using cache injections, Mar. 2004. US PatentNo. US 6,711,650 B1.
-
(2004)
-
-
Bohrer, P.1
Rajamony, R.2
Shafi, H.3
-
4
-
-
0033708935
-
Semicoarsening multigrid on distributed memory machines
-
P. N. Brown, R. D. Falgout, and J. E. Jones. Semicoarsening multigrid on distributed memory machines. SIAM Journal on Scientific Computing, 21(5):1823-1834, 2000.
-
(2000)
SIAM Journal on Scientific Computing
, vol.21
, Issue.5
, pp. 1823-1834
-
-
Brown, P.N.1
Falgout, R.D.2
Jones, J.E.3
-
5
-
-
0033097556
-
Producer-consumer communication in distributed shared memory multiprocessors
-
G. T. Byrd and M. J. Flynn. Producer-consumer communication in distributed shared memory multiprocessors. Proceedings of the IEEE, 87(3):456-466, 1999.
-
(1999)
Proceedings of the IEEE
, vol.87
, Issue.3
, pp. 456-466
-
-
Byrd, G.T.1
Flynn, M.J.2
-
6
-
-
46449091801
-
-
M. P. I. Forum. MPI: A message-passing interface standard. Technical Report UT-CS-94-230, Knoxville, TN, 1994.
-
M. P. I. Forum. MPI: A message-passing interface standard. Technical Report UT-CS-94-230, Knoxville, TN, 1994.
-
-
-
-
8
-
-
46449121306
-
Advanced POWER Virtualization on IBM eServer p5 Servers: Architecture and Performance Considerations
-
second edition
-
B. Gibbs, B. Atyam, F. Berres, B. Blanchard, L. Castillo, P. Coelho, N. Guerin, L. Liu, C. D. Maciel, C. Sosa, and R. Thirumalai. Advanced POWER Virtualization on IBM eServer p5 Servers: Architecture and Performance Considerations. IBM Redbooks, second edition, 2005.
-
(2005)
IBM Redbooks
-
-
Gibbs, B.1
Atyam, B.2
Berres, F.3
Blanchard, B.4
Castillo, L.5
Coelho, P.6
Guerin, N.7
Liu, L.8
Maciel, C.D.9
Sosa, C.10
Thirumalai, R.11
-
9
-
-
27544482360
-
Direct cache access for high bandwidth network I/O
-
Madison, WI, June
-
R. Huggahalli, R. Iyer, and S. Tetrick. Direct cache access for high bandwidth network I/O. In 32nd Annual International Symposium on Computer Architecture (ISCA'05), pages 50-59, Madison, WI, June 2005.
-
(2005)
32nd Annual International Symposium on Computer Architecture (ISCA'05)
, pp. 50-59
-
-
Huggahalli, R.1
Iyer, R.2
Tetrick, S.3
-
10
-
-
0036949388
-
An adaptive, nonuniform cache structure for wire-delay dominated on-chip caches
-
San Jose, CA, Oct
-
C. Kim, D. Burger, and S. W. Keckler. An adaptive, nonuniform cache structure for wire-delay dominated on-chip caches. In 10th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS X), pages 211-222, San Jose, CA, Oct. 2002.
-
(2002)
10th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS X)
, pp. 211-222
-
-
Kim, C.1
Burger, D.2
Keckler, S.W.3
-
11
-
-
0026839484
-
The Stanford Dash multiprocessor
-
D. Lenoski, J. Laudon, K. Gharachorloo, W.-D. Weber, A. Gupta, J. Hennessy, M. Horowitz, and M. S. Lam. The Stanford Dash multiprocessor. Computer, 25(3):63-79, 1992.
-
(1992)
Computer
, vol.25
, Issue.3
, pp. 63-79
-
-
Lenoski, D.1
Laudon, J.2
Gharachorloo, K.3
Weber, W.-D.4
Gupta, A.5
Hennessy, J.6
Horowitz, M.7
Lam, M.S.8
-
14
-
-
4143107088
-
Increasing memory bandwidth for vector computations
-
Zurich, Switzerland, Mar
-
S. A. McKee, S. A. Moyer, and W. A. Wulf. Increasing memory bandwidth for vector computations. In International Conference on Programming Languages and System Architectures, pages 87-104, Zurich, Switzerland, Mar. 1994.
-
(1994)
International Conference on Programming Languages and System Architectures
, pp. 87-104
-
-
McKee, S.A.1
Moyer, S.A.2
Wulf, W.A.3
-
16
-
-
0002031606
-
Tolerating latency through software-controlled prefetching in shared-memory multiprocessors
-
T. Mowry and A. Gupta. Tolerating latency through software-controlled prefetching in shared-memory multiprocessors. Journal of Parallel and Distributed Computing, 12(2):87-106, 1991.
-
(1991)
Journal of Parallel and Distributed Computing
, vol.12
, Issue.2
, pp. 87-106
-
-
Mowry, T.1
Gupta, A.2
-
18
-
-
77954460854
-
Data prefetching and data forwarding in shared memory multiprocessors
-
North Carolina State University, NC
-
D. K. Poulsen and P.-C. Yew. Data prefetching and data forwarding in shared memory multiprocessors. In International Conference on Parallel Processing (ICPP'94), pages 276-280, North Carolina State University, NC, 1994.
-
(1994)
International Conference on Parallel Processing (ICPP'94)
, pp. 276-280
-
-
Poulsen, D.K.1
Yew, P.-C.2
-
19
-
-
25844437046
-
POWER5 system microarchitecture
-
B. Sinharoy, R. N. Kalla, J. M. Tendler, R. J. Eickemeyer, and J. B. Joyner. POWER5 system microarchitecture. IBM Journal of Research and Development, 49(4/5), 2005.
-
(2005)
IBM Journal of Research and Development
, vol.49
, Issue.4-5
-
-
Sinharoy, B.1
Kalla, R.N.2
Tendler, J.M.3
Eickemeyer, R.J.4
Joyner, J.B.5
-
20
-
-
85117198273
-
An empirical performance evaluation of scalable scientific applications
-
Baltimore, Maryland
-
J. S. Vetter and A. Yoo. An empirical performance evaluation of scalable scientific applications. In 2002 ACM/IEEE Conference on Supercomputing (SC'02), pages 1-18, Baltimore, Maryland, 2002.
-
(2002)
2002 ACM/IEEE Conference on Supercomputing (SC'02)
, pp. 1-18
-
-
Vetter, J.S.1
Yoo, A.2
-
21
-
-
0003158656
-
Hitting the memory wall: Implications of the obvious
-
Mar
-
W. A. Wulf and S. A. McKee. Hitting the memory wall: Implications of the obvious. ACM SIGARCH Computer Architecture News, 3(1):20-24, Mar. 1995.
-
(1995)
ACM SIGARCH Computer Architecture News
, vol.3
, Issue.1
, pp. 20-24
-
-
Wulf, W.A.1
McKee, S.A.2
|