-
1
-
-
0031998929
-
Characterizing distributed shared memory performance: Case study of the Convex SPP1000
-
Abandah, G., and Davidson E., 1998. Characterizing distributed shared memory performance: Case study of the Convex SPP1000. IEEE Transactions on Parallel and Distributed Systems 9(2): 206-216.
-
(1998)
IEEE Transactions on Parallel and Distributed Systems
, vol.9
, Issue.2
, pp. 206-216
-
-
Abandah, G.1
Davidson, E.2
-
2
-
-
0029322264
-
Unfavorable strides in cache memory systems
-
Bailey, D.H., 1995. Unfavorable strides in cache memory systems. Scientific Programming 4:53-58.
-
(1995)
Scientific Programming
, vol.4
, pp. 53-58
-
-
Bailey, D.H.1
-
3
-
-
2942626621
-
-
IBM Redbook
-
Behling, S., Bell, R., Farrell, P., Holthoff, H., O'Connel, F., and Weir, W., 2002. The POWER4 processor Introduction and Tuning Guide, IBM Redbook.
-
(2002)
The POWER4 Processor Introduction and Tuning Guide
-
-
Behling, S.1
Bell, R.2
Farrell, P.3
Holthoff, H.4
O'Connel, F.5
Weir, W.6
-
5
-
-
0029308368
-
Effective hardware-based data prefetching for high-performance processors
-
Chen, T.F., and Baer, J.L., 1995 Effective hardware-based data prefetching for high-performance processors. IEEE Transactions on Computers 44(5):609-623.
-
(1995)
IEEE Transactions on Computers
, vol.44
, Issue.5
, pp. 609-623
-
-
Chen, T.F.1
Baer, J.L.2
-
6
-
-
0003662159
-
-
Morgan Kaufman, San Mateo, CA
-
Culler, D., Singh, J.P., and Gupta, A., 1998. Parallel Computer Architecture: a Hardware/Software Approach, Morgan Kaufman, San Mateo, CA.
-
(1998)
Parallel Computer Architecture: A Hardware/Software Approach
-
-
Culler, D.1
Singh, J.P.2
Gupta, A.3
-
7
-
-
2942626622
-
How useful are non-blocking loads, stream buffers and speculative execution in multiple issue processors?
-
Raleigh, NC. pp 78-89. IEEE Computer Society
-
Farkas, K., Jouppi, N.P., and Chow, P., 1995. How useful are non-blocking loads, stream buffers and speculative execution in multiple issue processors? In Proceedings of the 1st IEEE Simposium on High Performance Computing Architecture (HPCA 1995), January 22-25 1995. Raleigh, NC. pp 78-89. IEEE Computer Society 1995.
-
(1995)
Proceedings of the 1st IEEE Simposium on High Performance Computing Architecture (HPCA 1995), January 22-25 1995
-
-
Farkas, K.1
Jouppi, N.P.2
Chow, P.3
-
8
-
-
0034272461
-
Introducing the IA-64 architecture
-
Huck, J., Morris, D., Ross, J., Knies, A., Mulder, H., Zahir, R., 2000 Introducing the IA-64 architecture. IEEE Micro 20(5): 12-23.
-
(2000)
IEEE Micro
, vol.20
, Issue.5
, pp. 12-23
-
-
Huck, J.1
Morris, D.2
Ross, J.3
Knies, A.4
Mulder, H.5
Zahir, R.6
-
9
-
-
0032669612
-
Comparing the memory system performance of the HP V-Class and SGI Origin 2000 multiprocessors using micro-benchmarks and scientific applications
-
Photes, Greece. ACM. pp 339-347
-
Iyer, R., Amato, N.M., Rauchwerger, L., and Bhuyan, L., 1999. Comparing the memory system performance of the HP V-Class and SGI Origin 2000 multiprocessors using micro-benchmarks and scientific applications. Proceedings of the 1999 International Conference on Supercomputing. June 20-25 1999. Photes, Greece. ACM. pp 339-347 pp 339-347
-
(1999)
Proceedings of the 1999 International Conference on Supercomputing. June 20-25 1999
-
-
Iyer, R.1
Amato, N.M.2
Rauchwerger, L.3
Bhuyan, L.4
-
10
-
-
2942533399
-
-
McCalpin, J.D., 1995. Stream benchmark, http://www.cs.virginia.edu/stream/.
-
(1995)
-
-
McCalpin, J.D.1
-
12
-
-
0022138619
-
On the effective bandwidth of interleaved memories in vector systems
-
Oed, W., and Lange, O., 1985 On the effective bandwidth of interleaved memories in vector systems. IEEE Transactions on Computers C34(10):949-957.
-
(1985)
IEEE Transactions on Computers
, vol.C34
, Issue.10
, pp. 949-957
-
-
Oed, W.1
Lange, O.2
-
13
-
-
2942529792
-
Comparing and combining read miss clustering and software prefetching
-
Pai, V.S., and Adve, S.V., 1995 Comparing and combining read miss clustering and software prefetching. UIUC Technical Report.
-
(1995)
UIUC Technical Report
-
-
Pai, V.S.1
Adve, S.V.2
-
14
-
-
0000718681
-
Measuring cache and TLB performance and their effect on benchmark run times
-
Saavedra R.H., and Smith A.J., 1995. Measuring cache and TLB performance and their effect on benchmark run times. IEEE Transactions on Computers 44(10):1223-1235.
-
(1995)
IEEE Transactions on Computers
, vol.44
, Issue.10
, pp. 1223-1235
-
-
Saavedra, R.H.1
Smith, A.J.2
-
15
-
-
0036298603
-
Power4 system microarchitecture
-
Tendler, J.M., Dodson, J.S., Fields, J.S., Le, H., and Sinharoy, B., 2002. Power4 system microarchitecture. IBM Journal of Research and Development 46(1).
-
(2002)
IBM Journal of Research and Development
, vol.46
, Issue.1
-
-
Tendler, J.M.1
Dodson, J.S.2
Fields, J.S.3
Le, H.4
Sinharoy, B.5
|