-
5
-
-
0026267802
-
An effective on-chip preloading scheme to reduce data access penalty
-
BAER, J.-L. AND CHEN, T.-F. 1991. An effective on-chip preloading scheme to reduce data access penalty. In Proceedings of Supercomputing '91.
-
(1991)
Proceedings of Supercomputing '91
-
-
Baer, J.-L.1
Chen, T.-F.2
-
7
-
-
0026138044
-
Software prefetching
-
April. ACM, New York
-
CALLAHAN, D., KENNEDY, K., AND PORTERFIELD, A. 1991. Software prefetching. In Proceedings of the 4th International Conference on Architectural Support for Programming Languages and Operating Systems (April). ACM, New York, 40-52.
-
(1991)
Proceedings of the 4th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 40-52
-
-
Callahan, D.1
Kennedy, K.2
Porterfield, A.3
-
8
-
-
0023531324
-
A vliw architecture for a trace scheduling compiler
-
Oct. ACM, New York
-
COLWELL, R. P., NIX, R. P., O'DONNELL, J. J., PAPWORTH, D. B., AND RODMAN, P. K. 1987. A vliw architecture for a trace scheduling compiler. In Proceedings of the 2nd International Conference on Architectural Support for Programming Languages and Operating Systems (Oct.). ACM, New York, 180-192.
-
(1987)
Proceedings of the 2nd International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 180-192
-
-
Colwell, R.P.1
Nix, R.P.2
O'Donnell, J.J.3
Papworth, D.B.4
Rodman, P.K.5
-
9
-
-
0004269807
-
-
CONVEX COMPUTER Convex Computer Corp.
-
CONVEX COMPUTER. 1994. Convex Exemplar Architecture. Convex Computer Corp.
-
(1994)
Convex Exemplar Architecture
-
-
-
10
-
-
0027574855
-
A methodology for procedure cloning
-
April
-
COOPER, K., HALL, M., AND KENNEDY, K. 1993. A methodology for procedure cloning. Comput. Lang. 19, 2 (April).
-
(1993)
Comput. Lang.
, vol.19
, Issue.2
-
-
Cooper, K.1
Hall, M.2
Kennedy, K.3
-
11
-
-
0024868691
-
Overlapped loop support in the cydra 5
-
April. ACM, New York
-
DEHNERT, J. C., Hsu, P. Y.-T., AND BRATT, J. P. 1989. Overlapped loop support in the cydra 5. In Proceedings of the 3rd International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS III) (April). ACM, New York, 26-38.
-
(1989)
Proceedings of the 3rd International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS III)
, pp. 26-38
-
-
Dehnert, J.C.1
Hsu, P.Y.-T.2
Bratt, J.P.3
-
12
-
-
0023963509
-
Synchronization, coherence, and event ordering in multiprocessors
-
Feb.
-
DUBOIS, M., SCHEURICH, C., AND BRIGGS, F. A. 1988. Synchronization, coherence, and event ordering in multiprocessors. Computer 21, 2 (Feb.), 9-21.
-
(1988)
Computer
, vol.21
, Issue.2
, pp. 9-21
-
-
Dubois, M.1
Scheurich, C.2
Briggs, F.A.3
-
14
-
-
0347662803
-
The impact of hierarchical memory systems on linear algebra algorithm design
-
Univ. of Illinois, Urbana-Champaign, Ill.
-
GALLIVAN, K., JALBY, W., MEIER, U., AND SAMEH, A. 1987. The impact of hierarchical memory systems on linear algebra algorithm design. Tech. Rep. UIUCSRD 625, Univ. of Illinois, Urbana-Champaign, Ill.
-
(1987)
Tech. Rep. UIUCSRD 625
-
-
Gallivan, K.1
Jalby, W.2
Meier, U.3
Sameh, A.4
-
15
-
-
0347662804
-
The influence of memory hierarchy on algorithm organization: Programming FFTs on a vector multiprocessor
-
MIT Press, Cambridge, Mass.
-
GANNON, D. AND JALBY, W. 1987. The influence of memory hierarchy on algorithm organization: Programming FFTs on a vector multiprocessor. In The Characteristics of Parallel Algorithms. MIT Press, Cambridge, Mass.
-
(1987)
The Characteristics of Parallel Algorithms
-
-
Gannon, D.1
Jalby, W.2
-
16
-
-
0026137114
-
Performance evaluation of memory consistency models for shared-memory multiprocessors
-
April. ACM, New York
-
GHARACHORLOO, K., GUPTA, A., AND HENNESSY, J. 1991. Performance evaluation of memory consistency models for shared-memory multiprocessors. In Proceedings of the 4th International Conference on Architectural Support for Programming Languages and Operating Systems (April). ACM, New York, 245-257.
-
(1991)
Proceedings of the 4th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 245-257
-
-
Gharachorloo, K.1
Gupta, A.2
Hennessy, J.3
-
17
-
-
0025433762
-
Memory consistency and event ordering in scalable shared-memory multiprocessors
-
May
-
GHARACHORLOO, K., LENOSKI, D., LAUDON, J., GIBBONS, P., GUPTA, A., AND HENNESSY, J. 1990. Memory consistency and event ordering in scalable shared-memory multiprocessors. In Proceedings of the 17th Annual International Symposium on Computer Architecture (May). 15-26.
-
(1990)
Proceedings of the 17th Annual International Symposium on Computer Architecture
, pp. 15-26
-
-
Gharachorloo, K.1
Lenoski, D.2
Laudon, J.3
Gibbons, P.4
Gupta, A.5
Hennessy, J.6
-
18
-
-
0011603313
-
Tango introduction and tutorial
-
Stanford Univ., Palo Alto, Calif.
-
GOLDSCHMIDT, S. R. AND DAVIS, H. 1990. Tango introduction and tutorial. Tech. Rep. CSL-TR-90-410, Stanford Univ., Palo Alto, Calif.
-
(1990)
Tech. Rep. CSL-TR-90-410
-
-
Goldschmidt, S.R.1
Davis, H.2
-
22
-
-
0026158290
-
Comparative evaluation of latency reducing and tolerating techniques
-
May
-
GUPTA, A., HENNESSY, J., GHARACHORLOO, K., MOWRY, T., AND WEBER, W.-D. 1991. Comparative evaluation of latency reducing and tolerating techniques. In Proceedings of the 18th Annual International Symposium on Computer Architecture (May). 254-263.
-
(1991)
Proceedings of the 18th Annual International Symposium on Computer Architecture
, pp. 254-263
-
-
Gupta, A.1
Hennessy, J.2
Gharachorloo, K.3
Mowry, T.4
Weber, W.-D.5
-
24
-
-
0028732616
-
Cache performance in vector supercomputers
-
KONTOTHANASSIS, L., SUGUMAR, R., FAANES, G., SMITH, J., AND SCOTT, M. 1994. Cache performance in vector supercomputers. In Proceedings of Supercomputing '94. 255-264.
-
(1994)
Proceedings of Supercomputing '94
, pp. 255-264
-
-
Kontothanassis, L.1
Sugumar, R.2
Faanes, G.3
Smith, J.4
Scott, M.5
-
26
-
-
0018518477
-
How to make a multiprocessor computer that correctly executes multiprocess programs
-
Sept.
-
LAMPORT, L. 1979. How to make a multiprocessor computer that correctly executes multiprocess programs. IEEE Trans. Comput. C-28, 9 (Sept.), 241-248.
-
(1979)
IEEE Trans. Comput.
, vol.C-28
, Issue.9
, pp. 241-248
-
-
Lamport, L.1
-
28
-
-
0347031952
-
-
Ph.D. thesis, Dept. of Computer Science, Univ. of Illinois, Urbana-Champaign, Ill.
-
LEE, R. L. 1987. The effectiveness of caches and data prefetch buffers in large-scale shared memory multiprocessors. Ph.D. thesis, Dept. of Computer Science, Univ. of Illinois, Urbana-Champaign, Ill.
-
(1987)
The Effectiveness of Caches and Data Prefetch Buffers in Large-scale Shared Memory Multiprocessors
-
-
Lee, R.L.1
-
29
-
-
0026839484
-
The Stanford DASH multiprocessor
-
Mar.
-
LENOSKI, D., GHARACHORLOO, K., LAUDON, J., GUPTA, A., HENNESSY, J., HOROWITZ, M., AND LAM, M. 1992. The Stanford DASH multiprocessor. IEEE Comput. 25, 3 (Mar.), 63-79.
-
(1992)
IEEE Comput.
, vol.25
, Issue.3
, pp. 63-79
-
-
Lenoski, D.1
Gharachorloo, K.2
Laudon, J.3
Gupta, A.4
Hennessy, J.5
Horowitz, M.6
Lam, M.7
-
31
-
-
0003979521
-
-
Holt, Rinehart and Winston, Inc.
-
LUSK, E., OVERBEEK, R., ET AL. 1987. Portable Programs for Parallel Processors. Holt, Rinehart and Winston, Inc.
-
(1987)
Portable Programs for Parallel Processors
-
-
Lusk, E.1
Overbeek, R.2
-
32
-
-
84945709131
-
The organization of matrices and matrix operations in a paged multiprogramming environment
-
MCKELLER, A. C. AND COFFMAN, E. G. 1969. The organization of matrices and matrix operations in a paged multiprogramming environment. Commun. ACM 12, 3, 153-165.
-
(1969)
Commun. ACM
, vol.12
, Issue.3
, pp. 153-165
-
-
Mckeller, A.C.1
Coffman, E.G.2
-
33
-
-
0002031606
-
Tolerating latency through software-controlled prefetching in shared-memory multiprocessors
-
MOWRY, T. AND GUPTA, A. 1991. Tolerating latency through software-controlled prefetching in shared-memory multiprocessors. J. Parallel Distrib. Comput. 12, 2, 87-106.
-
(1991)
J. Parallel Distrib. Comput.
, vol.12
, Issue.2
, pp. 87-106
-
-
Mowry, T.1
Gupta, A.2
-
35
-
-
0026918402
-
Design and evaluation of a compiler algorithm for prefetching
-
Oct.
-
MOWRY, T. C., LAM, M. S., AND GUPTA, A. 1992. Design and evaluation of a compiler algorithm for prefetching. In Proceedings of the 5th International Conference on Architectural Support for Programming Languages and Operating Systems. Vol. 27 (Oct.). 62-73.
-
(1992)
Proceedings of the 5th International Conference on Architectural Support for Programming Languages and Operating Systems
, vol.27
, pp. 62-73
-
-
Mowry, T.C.1
Lam, M.S.2
Gupta, A.3
-
36
-
-
0003690936
-
-
Ph.D. thesis, Dept. of Computer Science, Rice Univ., Houston, Tex.
-
PORTERFIELD, A. K. 1989. Software methods for improvement of cache performance on supercomputer applications. Ph.D. thesis, Dept. of Computer Science, Rice Univ., Houston, Tex.
-
(1989)
Software Methods for Improvement of Cache Performance on Supercomputer Applications
-
-
Porterfield, A.K.1
-
37
-
-
0003897840
-
Splash: Stanford parallel applications for shared memory
-
Stanford Univ., Palo Alto, Calif.
-
SINGH, J. P., WEBER, W.-D., AND GUPTA, A. 1991. Splash: Stanford parallel applications for shared memory. Tech. Rep. CSL-TR-91-469, Stanford Univ., Palo Alto, Calif.
-
(1991)
Tech. Rep. CSL-TR-91-469
-
-
Singh, J.P.1
Weber, W.-D.2
Gupta, A.3
-
38
-
-
0025440459
-
A survey of cache coherence schemes for multiprocessors
-
June
-
STENSTROM, P. 1990. A survey of cache coherence schemes for multiprocessors. IEEE Comput. 23, 6 (June), 12-24.
-
(1990)
IEEE Comput.
, vol.23
, Issue.6
, pp. 12-24
-
-
Stenstrom, P.1
-
40
-
-
0003333239
-
Shared data placement optimizations to reduce multiprocessor cache miss rates
-
Aug.
-
TORRELLAS, J., LAM, M. S., AND HENNESSY, J. L. 1990. Shared data placement optimizations to reduce multiprocessor cache miss rates. In Proceedings of the 1990 International Conference on Parallel Processing. Vol. 2 (Aug.). 266-270.
-
(1990)
Proceedings of the 1990 International Conference on Parallel Processing
, vol.2
, pp. 266-270
-
-
Torrellas, J.1
Lam, M.S.2
Hennessy, J.L.3
|