-
1
-
-
0029199162
-
Empirical evaluation of the CRAY-T3D: A compiler perspective
-
ACM SIGARCH
-
R. Arpaci, D. Culler, A. Krishnamurthy, S. Steinberg, K. Yelick, 1995, Empirical evaluation of the CRAY-T3D: A compiler perspective, International Symposium on Computer Architecture, ACM SIGARCH.
-
(1995)
International Symposium on Computer Architecture
-
-
Arpaci, R.1
Culler, D.2
Krishnamurthy, A.3
Steinberg, S.4
Yelick, K.5
-
2
-
-
0010921883
-
Programming abstractions for dynamically partitioning and coordinating localized scientific calculations running on multiprocessors
-
Baden S. B. Programming abstractions for dynamically partitioning and coordinating localized scientific calculations running on multiprocessors. SIAM J. Sci. Statist. Comput. 12:1991;145-157.
-
(1991)
SIAM J. Sci. Statist. Comput.
, vol.12
, pp. 145-157
-
-
Baden, S.B.1
-
4
-
-
0028583189
-
Parallel performance of a symmetric eigensolver based on the invariant subspace decomposition approach
-
May Knoxville, TN, 39, IEEE Press, New York
-
C. Bischof, S. Huss-Lederman, X. Sun, A. Tsao, T. Turnbull, May 1994, Parallel performance of a symmetric eigensolver based on the invariant subspace decomposition approach, Scalable High Performance Computing Conference, Knoxville, TN, 32, 39, IEEE Press, New York.
-
(1994)
Scalable High Performance Computing Conference
, pp. 32
-
-
Bischof, C.1
Huss-Lederman, S.2
Sun, X.3
Tsao, A.4
Turnbull, T.5
-
5
-
-
0003229750
-
Scalapack: A portable linear algebra library for distributed memory computers - design issues and performance
-
L. S. Blackford, J. Choi, A. Cleary, J. Demmel, I. Dhillon, J. J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. W. Walker, R. C. Whaley, Scalapack: A portable linear algebra library for distributed memory computers - Design issues and performance, Proceedings of Supercomputing '96.
-
Proceedings of Supercomputing '96
-
-
Blackford, L.S.1
Choi, J.2
Cleary, A.3
Demmel, J.4
Dhillon, I.5
Dongarra, J.J.6
Hammarling, S.7
Henry, G.8
Petitet, A.9
Stanley, K.10
Walker, D.W.11
Whaley, R.C.12
-
6
-
-
43949161602
-
Implementation of a portable nested data-parallel language
-
Blelloch G. E., Chatterjee S., Hardwick J. C., Sipelstein J., Zagha M. Implementation of a portable nested data-parallel language. J. Parallel Distrib. Comput. 21:1994;4-14.
-
(1994)
J. Parallel Distrib. Comput.
, vol.21
, pp. 4-14
-
-
Blelloch, G.E.1
Chatterjee, S.2
Hardwick, J.C.3
Sipelstein, J.4
Zagha, M.5
-
10
-
-
0003978709
-
A proposal for a set of parallel basic linear algebra subprograms
-
CS-95-292:May
-
Choi J., Dongarra J., Ostrouchov S., Petitet A., Walker D., Whaley R. C. A proposal for a set of parallel basic linear algebra subprograms. Computer Science Dept. Technical Report. CS-95-292:May 1995.
-
(1995)
Computer Science Dept. Technical Report
-
-
Choi, J.1
Dongarra, J.2
Ostrouchov, S.3
Petitet, A.4
Walker, D.5
Whaley, R.C.6
-
11
-
-
0000659575
-
A divide and conquer method for the symmetric tridiagonal eigenproblem
-
Cuppen J. A divide and conquer method for the symmetric tridiagonal eigenproblem. Numer. Math. 36:1981;177-195.
-
(1981)
Numer. Math.
, vol.36
, pp. 177-195
-
-
Cuppen, J.1
-
14
-
-
0003517895
-
A proposal for a user-level message passing interface in a distributed memory environment
-
ORNL/TM-12231:February
-
Dongarra J., Hempel R., Hay A., Walker D. A proposal for a user-level message passing interface in a distributed memory environment. Technical Report. ORNL/TM-12231:February 1993.
-
(1993)
Technical Report
-
-
Dongarra, J.1
Hempel, R.2
Hay, A.3
Walker, D.4
-
15
-
-
0026991394
-
A look at scalable dense linear algebra libraries
-
April IEEE Comput. Soc. Los Alamitos, CA
-
J. Dongarra, R. van de Geijn, D. Walker, April 1992, A look at scalable dense linear algebra libraries, Scalable High-Performance Computing Conference, IEEE Comput. Soc. Los Alamitos, CA.
-
(1992)
Scalable High-Performance Computing Conference
-
-
Dongarra, J.1
Van De Geijn, R.2
Walker, D.3
-
16
-
-
0010921884
-
-
Fortran 90
-
Fortran 90.
-
-
-
-
17
-
-
0003487728
-
High performance Fortran language specification, version 1.0
-
CRPC-TR92225:May
-
Forum H. P. F. High performance Fortran language specification, version 1.0. Technical Report. CRPC-TR92225:May 1993.
-
(1993)
Technical Report
-
-
Forum, H.P.F.1
-
18
-
-
0028599384
-
A compilation system that integrates high performance Fortran and Fortran M
-
300, IEEE, New York
-
I. Foster, M. Xu, B. Avalani, A. Chowdhary, 1994, A compilation system that integrates high performance Fortran and Fortran M, Scalable High Performance Computing Conference, 293, 300, IEEE, New York.
-
(1994)
Scalable High Performance Computing Conference
, pp. 293
-
-
Foster, I.1
Xu, M.2
Avalani, B.3
Chowdhary, A.4
-
19
-
-
0003287437
-
Bounds for multiprocessor scheduling with resource constraints
-
Garey M. R., Graham R. L. Bounds for multiprocessor scheduling with resource constraints. SIAM J. Comput. 4:1975;187-200.
-
(1975)
SIAM J. Comput.
, vol.4
, pp. 187-200
-
-
Garey, M.R.1
Graham, R.L.2
-
21
-
-
0027606922
-
On the granularity and clustering of directed acyclic task graphs
-
Gerasoulis A., Yang T. On the granularity and clustering of directed acyclic task graphs. IEEE Trans. Parallel Distrib. Syst. 4:1993;686-701.
-
(1993)
IEEE Trans. Parallel Distrib. Syst.
, vol.4
, pp. 686-701
-
-
Gerasoulis, A.1
Yang, T.2
-
22
-
-
0010865720
-
The analysis of a nested dissection algorithm
-
Gilbert J., Tarjan R. The analysis of a nested dissection algorithm. Numer. Math. 50:1987;377-404.
-
(1987)
Numer. Math.
, vol.50
, pp. 377-404
-
-
Gilbert, J.1
Tarjan, R.2
-
23
-
-
0014477093
-
Bounds on multiprocessor timing anomalies
-
Graham R. L. Bounds on multiprocessor timing anomalies. SIAM J. Appl. Math. 17:1969;416-429.
-
(1969)
SIAM J. Appl. Math.
, vol.17
, pp. 416-429
-
-
Graham, R.L.1
-
24
-
-
0003487728
-
High performance Fortran language specification version 1.0
-
Draft, Jan.
-
High Performance Fortran Forum, High performance Fortran language specification version 1.0, Draft, Jan. 1993.
-
(1993)
High Performance Fortran Forum
-
-
-
27
-
-
0023328834
-
Communication efficient basic linear algebra computations on hypercube architectures
-
Johnsson S. L. Communication efficient basic linear algebra computations on hypercube architectures. J. Parallel Distrib. Comput. 4:1987.
-
(1987)
J. Parallel Distrib. Comput.
, vol.4
-
-
Johnsson, S.L.1
-
28
-
-
0010807779
-
On the concurrency of C++
-
May 219, Ontario, Canada
-
X. Li, H. Huang, May 1993, On the concurrency of C++, Proceedings ICCI'93. Fifth International Conference on Computing and Information, 215, 219, Ontario, Canada.
-
(1993)
Proceedings ICCI'93. Fifth International Conference on Computing and Information
, pp. 215
-
-
Li, X.1
Huang, H.2
-
29
-
-
0026840122
-
The multifrontal method for sparse matrix solution: Theory and practice
-
Liu J. W. H. The multifrontal method for sparse matrix solution: Theory and practice. SIAM Rev. 34:1992;82-109.
-
(1992)
SIAM Rev.
, vol.34
, pp. 82-109
-
-
Liu, J.W.H.1
-
30
-
-
0028195126
-
Scheduling malleable and nonmalleable parallel tasks
-
176, ACM-SIAM, New York
-
W. Ludwig, P. Tiwari, 1994, Scheduling malleable and nonmalleable parallel tasks, Symposium on Discrete Algorithms (SODA), 167, 176, ACM-SIAM, New York.
-
(1994)
Symposium on Discrete Algorithms (SODA)
, pp. 167
-
-
Ludwig, W.1
Tiwari, P.2
-
31
-
-
0011612024
-
Implementing an efficient portable global memory layer on distributed memory multiprocessors
-
UCB/CSD-94-810:May
-
Luna S. Implementing an efficient portable global memory layer on distributed memory multiprocessors. Technical Report. UCB/CSD-94-810:May 1994.
-
(1994)
Technical Report
-
-
Luna, S.1
-
32
-
-
0027868988
-
Parallelization and distribution of a coupled atmosphere-ocean general circulation model
-
Mechoso C. R., Ma C.-C., Farrara J., Spahr J. A., Moore R. W. Parallelization and distribution of a coupled atmosphere-ocean general circulation model. Monthly Weather Rev. 121:1993;2062-2076.
-
(1993)
Monthly Weather Rev.
, vol.121
, pp. 2062-2076
-
-
Mechoso, C.R.1
Ma, C.-C.2
Farrara, J.3
Spahr, J.A.4
Moore, R.W.5
-
33
-
-
0025418536
-
Towards an architecture-independent analysis of parallel algorithms
-
Papadimitriou C. H., Yannakakis M. Towards an architecture-independent analysis of parallel algorithms. SIAM J. Comput. 19:1990;322-328.
-
(1990)
SIAM J. Comput.
, vol.19
, pp. 322-328
-
-
Papadimitriou, C.H.1
Yannakakis, M.2
-
34
-
-
84904357426
-
A convex programming approach for exploiting data and functional parallelism on distributed memory multiprocessors
-
IEEE, New York
-
S. Ramaswamy, S. Sapatnekar, P. Banerjee, 1994, A convex programming approach for exploiting data and functional parallelism on distributed memory multiprocessors, International Conference on Parallel Processing (ICPP), IEEE, New York.
-
(1994)
International Conference on Parallel Processing (ICPP)
-
-
Ramaswamy, S.1
Sapatnekar, S.2
Banerjee, P.3
-
37
-
-
0026213832
-
Automatic partitioning of a program dependence graph into parallel tasks
-
Sarkar V. Automatic partitioning of a program dependence graph into parallel tasks. IBM J. Res. Devel. 35:1991.
-
(1991)
IBM J. Res. Devel.
, vol.35
-
-
Sarkar, V.1
-
38
-
-
0010921885
-
Modeling the performance of linear systems solvers on distributed memory multiprocessors
-
Stanley K., Demmel J. Modeling the performance of linear systems solvers on distributed memory multiprocessors. Technical report. 1994.
-
(1994)
Technical Report
-
-
Stanley, K.1
Demmel, J.2
-
39
-
-
0027845715
-
Exploiting task and data parallelism on a multicomputer
-
New York: ACM-SIGPLAN. p. 13-22
-
Subhlok J., Stichnoth J., O'Hallaron D., Gross T. Exploiting task and data parallelism on a multicomputer. Principles and Practice of Parallel Programming (PPoPP), San Diego. May 1993;ACM-SIGPLAN, New York. p. 13-22.
-
(1993)
Principles and Practice of Parallel Programming (PPoPP), San Diego
-
-
Subhlok, J.1
Stichnoth, J.2
O'Hallaron, D.3
Gross, T.4
-
43
-
-
0027796598
-
Parallel timing simulation on a distributed memory multiprocessor
-
November Santa Clara, CA
-
C.-P. Wen, K. Yelick, November 1993, Parallel timing simulation on a distributed memory multiprocessor, International Conference on CAD, Santa Clara, CA.
-
(1993)
International Conference on CAD
-
-
Wen, C.-P.1
Yelick, K.2
-
44
-
-
0003687069
-
Basic linear algebra communication subprograms: Analysis and implementation across multiple parallel architectures
-
Whaley R. C. Basic linear algebra communication subprograms: Analysis and implementation across multiple parallel architectures. Technical report. [LAPACK Working Note 73]:June 1994.
-
(1994)
Technical Report
, vol.73
-
-
Whaley, R.C.1
-
45
-
-
0010921886
-
Basic linear algebra communication subroutines: Analysis and implementation across multiple parallel architectures
-
Whaley R. C. Basic linear algebra communication subroutines: Analysis and implementation across multiple parallel architectures. Technical report. [LAPACK Working Note 73]:June 1994.
-
(1994)
Technical Report
, vol.73
-
-
Whaley, R.C.1
|