-
1
-
-
0029180378
-
The MIT Alewife machine: Architecture and performance
-
Agarwal A., Bianchini R., Chaiken D., Johnson K. L., Kranz D., Kubiatowicz J., Lim B., Mackenzie K., Yeung D. The MIT Alewife machine: architecture and performance. Proc. 22nd International Symposium on Computer Architecture. 1995.
-
(1995)
Proc. 22nd International Symposium on Computer Architecture
-
-
Agarwal, A.1
Bianchini, R.2
Chaiken, D.3
Johnson, K.L.4
Kranz, D.5
Kubiatowicz, J.6
Lim, B.7
Mackenzie, K.8
Yeung, D.9
-
4
-
-
85067798127
-
An interactive environment for data partitioning and distribution
-
Charleston, SC, April
-
V. Balasundaram, G. Fox, K. Kennedy, and, U. Kremer, An interactive environment for data partitioning and distribution, in, 5th Distributed Memory Computing Conference, Charleston, SC, April 1990.
-
(1990)
In, 5th Distributed Memory Computing Conference
-
-
Balasundaram, V.1
Fox, G.2
Kennedy, K.3
Kremer, U.4
-
5
-
-
0029394470
-
The PARADIGM compiler for distributed-memory multicomputers
-
Banerjee P., Chandy J. A., Gupta M., Hodges E. W. IV, Holm J. G., Lain A., Palermo D. J., Ramaswamy S., Su E. The PARADIGM compiler for distributed-memory multicomputers. IEEE Comput. 28:October 1995;37-47.
-
(1995)
IEEE Comput.
, vol.28
, pp. 37-47
-
-
Banerjee, P.1
Chandy, J.A.2
Gupta, M.3
Hodges E.W. IV4
Holm, J.G.5
Lain, A.6
Palermo, D.J.7
Ramaswamy, S.8
Su, E.9
-
7
-
-
0030382364
-
Parallel programming with Polaris
-
Blume W., Doallo R., Eigenmann R., Grout J., Hoeflinger J., Lawrence T., Lee J., Padua D., Paek Y., Pottenger B., Rauchwerger L., Tu P. Parallel programming with Polaris. IEEE Comput. 29:December 1996;78-82.
-
(1996)
IEEE Comput.
, vol.29
, pp. 78-82
-
-
Blume, W.1
Doallo, R.2
Eigenmann, R.3
Grout, J.4
Hoeflinger, J.5
Lawrence, T.6
Lee, J.7
Padua, D.8
Paek, Y.9
Pottenger, B.10
Rauchwerger, L.11
Tu, P.12
-
8
-
-
84900310806
-
Techniques for compiling and executing HPF programs on shared-memory and distributed-memory parallel systems
-
Bangalore, India, December
-
Z. Bozkus, L. Meadows, D. Miles, S. Nakamoto, V. Schuster, and, M. Young, Techniques for compiling and executing HPF programs on shared-memory and distributed-memory parallel systems, in, Proc. 1st International Workshop on Parallel Processing, Bangalore, India, December 1994.
-
(1994)
In, Proc. 1st International Workshop on Parallel Processing
-
-
Bozkus, Z.1
Meadows, L.2
Miles, D.3
Nakamoto, S.4
Schuster, V.5
Young, M.6
-
10
-
-
3142734175
-
Blocking linear algebra codes for memory hierarchies
-
Chicago, IL, December
-
S. Carr, and, K. Kennedy, Blocking linear algebra codes for memory hierarchies, in, Proc. 4th SIAM Conference on Parallel Processing for Scientific Computing, Chicago, IL, December 1989.
-
(1989)
In, Proc. 4th SIAM Conference on Parallel Processing for Scientific Computing
-
-
Carr, S.1
Kennedy, K.2
-
11
-
-
0030651789
-
Data-distribution support on distributed-shared memory multiprocessors
-
Las Vegas, NV
-
R. Chandra, D. Chen, R. Cox, D. Maydan, N. Nedeljkovic, and, J. M. Anderson, Data-distribution support on distributed-shared memory multiprocessors, in, Proc. Programming Language Design and Implementation (PLDI), Las Vegas, NV, 1997.
-
(1997)
In, Proc. Programming Language Design and Implementation (PLDI)
-
-
Chandra, R.1
Chen, D.2
Cox, R.3
Maydan, D.4
Nedeljkovic, N.5
Anderson, J.M.6
-
12
-
-
84976859799
-
Unifying data and control transformations for distributed shared memory machines
-
La Jolla, CA, June
-
M. Cierniak, and, W. Li, Unifying data and control transformations for distributed shared memory machines, in, Proc. SIGPLAN'95 Conference on Programming Language Design and Implementation, La Jolla, CA, June 1995.
-
(1995)
In, Proc. SIGPLAN'95 Conference on Programming Language Design and Implementation
-
-
Cierniak, M.1
Li, W.2
-
13
-
-
84976745804
-
Tile size selection using cache organization and data layout
-
La Jolla, CA, June
-
S. Coleman, and, K. McKinley, Tile size selection using cache organization and data layout, in, Proc. SIGPLAN'95 Conference on Programming Language Design and Implementation, La Jolla, CA, June 1995.
-
(1995)
In, Proc. SIGPLAN'95 Conference on Programming Language Design and Implementation
-
-
Coleman, S.1
McKinley, K.2
-
15
-
-
0026821098
-
New CPU benchmark suites from SPEC
-
San Francisco, CA, February
-
K. M. Dixit, New CPU benchmark suites from SPEC, in, Proc. COMPCON'92 - 37th IEEE Computer Society International Conference, San Francisco, CA, February 1992.
-
(1992)
In, Proc. COMPCON'92 - 37th IEEE Computer Society International Conference
-
-
Dixit, K.M.1
-
17
-
-
0001366267
-
Strategies for cache and local memory management by global program transformations
-
Gannon D., Jalby W., Gallivan K. Strategies for cache and local memory management by global program transformations. J. Parallel Distrib. Comput. 1988;587-616.
-
(1988)
J. Parallel Distrib. Comput.
, pp. 587-616
-
-
Gannon, D.1
Jalby, W.2
Gallivan, K.3
-
18
-
-
0029430244
-
A novel approach towards automatic data distribution
-
San Diego, December
-
J. Garcia, E. Ayguade, and, J. Labarta, A novel approach towards automatic data distribution, in, Proc. Supercomputing'95, San Diego, December 1995.
-
(1995)
In, Proc. Supercomputing'95
-
-
Garcia, J.1
Ayguade, E.2
Labarta, J.3
-
20
-
-
84990709846
-
Updating distributed variables in local computations
-
Gerndt M. Updating distributed variables in local computations. Concurrency Practice Experience. 2:September 1990;171-193.
-
(1990)
Concurrency Practice Experience
, vol.2
, pp. 171-193
-
-
Gerndt, M.1
-
21
-
-
0026823950
-
Demonstration of automatic data partitioning techniques for parallelizing compilers on multicomputers
-
Gupta M., Banerjee P. Demonstration of automatic data partitioning techniques for parallelizing compilers on multicomputers. IEEE Trans. Parallel Distrib. Systems. 3:March 1992;179-193.
-
(1992)
IEEE Trans. Parallel Distrib. Systems
, vol.3
, pp. 179-193
-
-
Gupta, M.1
Banerjee, P.2
-
25
-
-
0002065001
-
Reduction of cache coherence overhead by compiler data layout and loop transformations
-
Santa Clara, CA, August
-
Y.-J. Ju, and, H. Dietz, Reduction of cache coherence overhead by compiler data layout and loop transformations, in, Proc. 4th Workshop on Languages and Compilers for Parallel Computing, Santa Clara, CA, August 1991.
-
(1991)
In, Proc. 4th Workshop on Languages and Compilers for Parallel Computing
-
-
Ju, Y.-J.1
Dietz, H.2
-
26
-
-
0003328017
-
Automatic data layout for High Performance Fortran
-
San Diego, CA, December
-
K. Kennedy, and, U. Kremer, Automatic data layout for High Performance Fortran, in, Proceedings of Supercomputing'95, San Diego, CA, December 1995.
-
(1995)
In, Proceedings of Supercomputing'95
-
-
Kennedy, K.1
Kremer, U.2
-
27
-
-
85027602455
-
Optimizing for parallelism and data locality
-
Washington, D.C, July
-
K. Kennedy, and, K. S. McKinley, Optimizing for parallelism and data locality, in, Proc. 1992 ACM International Conference on Supercomputing (ICS'92), Washington, D.C, July 1992.
-
(1992)
In, Proc. 1992 ACM International Conference on Supercomputing (ICS'92)
-
-
Kennedy, K.1
McKinley, K.S.2
-
29
-
-
0026137116
-
The cache performance and optimizations of blocked algorithms
-
April
-
M. S. Lam, E. Rothberg, and, M. E. Wolf, The cache performance and optimizations of blocked algorithms, in, Proc. 4th International Conference on Architectural Support for Programming Languages and Operating Systems, April, 1991.
-
(1991)
In, Proc. 4th International Conference on Architectural Support for Programming Languages and Operating Systems
-
-
Lam, M.S.1
Rothberg, E.2
Wolf, M.E.3
-
31
-
-
0026865505
-
The DASH prototype: Implementation and performance
-
Gold Coast, Australia, May
-
D. Lenoski, J. Laudon, T. Joe, D. Nakahira, L. Stevens, A. Gupta, and J. Hennessy, The DASH prototype: implementation and performance, in Proc. 19th International Symposium on Computer Architecture, Gold Coast, Australia, May 1992, pp. 92-103.
-
(1992)
In Proc. 19th International Symposium on Computer Architecture
, pp. 92-103
-
-
Lenoski, D.1
Laudon, J.2
Joe, T.3
Nakahira, D.4
Stevens, L.5
Gupta, A.6
Hennessy, J.7
-
35
-
-
0002848657
-
Non-singular data transformations: Definition, validity, applications
-
Aachen, Germany
-
M. O'Boyle and P. Knijnenburg, Non-singular data transformations: definition, validity, applications, in Proc. 6th Workshop on Compilers for Parallel Computers, Aachen, Germany, 1996, pp. 287-297.
-
(1996)
In Proc. 6th Workshop on Compilers for Parallel Computers
, pp. 287-297
-
-
O'Boyle, M.1
Knijnenburg, P.2
-
36
-
-
0001787125
-
Automatic selection of dynamic data partitioning schemes for distributed-memory multicomputers
-
Columbus, OH
-
D. Palermo and P. Banerjee, Automatic selection of dynamic data partitioning schemes for distributed-memory multicomputers, in Proc. 8th Workshop on Languages and Compilers for Parallel Computing, Columbus, OH, 1995, pp. 392-406.
-
(1995)
In Proc. 8th Workshop on Languages and Compilers for Parallel Computing
, pp. 392-406
-
-
Palermo, D.1
Banerjee, P.2
-
37
-
-
0024874341
-
Parafrase-2: An environment for parallelizing, partitioning, synchronizing, and scheduling programs on multiprocessors
-
St. Charles, IL, August
-
C. Polychronopoulos, M. B. Girkar, M. R. Haghighat, C. L. Lee, B. P. Leung, and D. A. Schouten, Parafrase-2: an environment for parallelizing, partitioning, synchronizing, and scheduling programs on multiprocessors, in Proc. the International Conference on Parallel Processing, St. Charles, IL, August 1989, pp. 39-48.
-
(1989)
In Proc. the International Conference on Parallel Processing
, pp. 39-48
-
-
Polychronopoulos, C.1
Girkar, M.B.2
Haghighat, M.R.3
Lee, C.L.4
Leung, B.P.5
Schouten, D.A.6
-
38
-
-
85031707006
-
Non-unimodular transformations of nested loops
-
Minneapolis, MN, November
-
J. Ramanujam, Non-unimodular transformations of nested loops, in Proc. Supercomputing 92, Minneapolis, MN, November 1992, pp. 214-223.
-
(1992)
In Proc. Supercomputing 92
, pp. 214-223
-
-
Ramanujam, J.1
-
40
-
-
0043279904
-
Automatic data mapping and program transformations
-
Houston, TX, April
-
J. Ramanujam, and, A. Narayan, Automatic data mapping and program transformations, in, Proc. Workshop on Automatic Data Layout and Performance Prediction, Houston, TX, April 1995.
-
(1995)
In, Proc. Workshop on Automatic Data Layout and Performance Prediction
-
-
Ramanujam, J.1
Narayan, A.2
-
41
-
-
84958797356
-
Locality analysis for distributed shared-memory multiprocessors
-
Santa Clara, CA, August
-
V. Sarkar, G. R. Gao, and, S. Han, Locality analysis for distributed shared-memory multiprocessors, in, Proc. the Ninth International Workshop on Languages and Compilers for Parallel Computing, Santa Clara, CA, August 1996.
-
(1996)
In, Proc. the Ninth International Workshop on Languages and Compilers for Parallel Computing
-
-
Sarkar, V.1
Gao, G.R.2
Han, S.3
-
43
-
-
0029205969
-
Aligning parallel arrays to reduce communication
-
McLean, VA, February
-
T. J. Sheffler, R. Schreiber, J. R. Gilbert, and S. Chatterjee, Aligning parallel arrays to reduce communication, in Frontiers '95: The 5th Symposium on the Frontiers of Massively Parallel Computing, McLean, VA, February 1995, pp. 324-331.
-
(1995)
In Frontiers '95: The 5th Symposium on the Frontiers of Massively Parallel Computing
, pp. 324-331
-
-
Sheffler, T.J.1
Schreiber, R.2
Gilbert, J.R.3
Chatterjee, S.4
-
44
-
-
0029194311
-
Unified compilation techniques for shared and distributed address space machines
-
Barcelona, Spain, July
-
C.-W. Tseng, J. Anderson, S. Amarasinghe, and, M. Lam, Unified compilation techniques for shared and distributed address space machines, in, Proc. 1995 International Conference on Supercomputing (ICS'95), Barcelona, Spain, July 1995.
-
(1995)
In, Proc. 1995 International Conference on Supercomputing (ICS'95)
-
-
Tseng, C.-W.1
Anderson, J.2
Amarasinghe, S.3
Lam, M.4
-
45
-
-
0030671818
-
An evaluation of a commercial CC-NUMA architecture: The CONVEX Exemplar SPP-1200
-
Geneva, Switzerland, April
-
R. Thekkath, A. P. Singh, J. P. Singh, S. John, and, J. Hennessey, An evaluation of a commercial CC-NUMA architecture: the CONVEX Exemplar SPP-1200, in, Proc. 11th International Parallel Processing Symposium, Geneva, Switzerland, April 1997.
-
(1997)
In, Proc. 11th International Parallel Processing Symposium
-
-
Thekkath, R.1
Singh, A.P.2
Singh, J.P.3
John, S.4
Hennessey, J.5
-
46
-
-
0029194338
-
Evaluating the impact of advanced memory systems on compiler-parallelized codes
-
Limassol, Cyprus, June
-
E. Torrie, C.-W. Tseng, M. Martonosi, and, M. W. Hall, Evaluating the impact of advanced memory systems on compiler-parallelized codes, in, Proc. International Conference on Parallel Architectures and Compilations Techniques (PACT), Limassol, Cyprus, June 1995.
-
(1995)
In, Proc. International Conference on Parallel Architectures and Compilations Techniques (PACT)
-
-
Torrie, E.1
Tseng, C.-W.2
Martonosi, M.3
Hall, M.W.4
-
47
-
-
84976692695
-
SUIF: An infrastructure for research on parallelizing and optimizing compilers
-
Wilson R. P., French R. S., Wilson C. S., Amarasinghe S. P., Anderson J. M., Tjiang S. W. K., Liao S.-W., Tseng C.-W., Hall M. W., Lam M. S., Hennessy J. L. SUIF: An infrastructure for research on parallelizing and optimizing compilers. ACM SIGPLAN Notices. 29:December 1994;31-37.
-
(1994)
ACM SIGPLAN Notices
, vol.29
, pp. 31-37
-
-
Wilson, R.P.1
French, R.S.2
Wilson, C.S.3
Amarasinghe, S.P.4
Anderson, J.M.5
Tjiang, S.W.K.6
Liao, S.-W.7
Tseng, C.-W.8
Hall, M.W.9
Lam, M.S.10
Hennessy, J.L.11
-
48
-
-
0026232450
-
A loop transformation theory and an algorithm to maximize parallelism
-
Wolf M., Lam M. A loop transformation theory and an algorithm to maximize parallelism. IEEE Trans. Parallel Distrib. Systems. 2:October 1991;452-471.
-
(1991)
IEEE Trans. Parallel Distrib. Systems
, vol.2
, pp. 452-471
-
-
Wolf, M.1
Lam, M.2
-
51
-
-
0027543560
-
Compiling for distributed-memory systems
-
Zima H., Chapman B. Compiling for distributed-memory systems. Proc. IEEE. 81:1993;264-287.
-
(1993)
Proc. IEEE
, vol.81
, pp. 264-287
-
-
Zima, H.1
Chapman, B.2
|