-
1
-
-
85046166672
-
A 3D approach to parallel matrix multiplication
-
R. C. Agarwal, S. M. Balle, F. G. Gustavson, M. Joshi, and P.Palkar. A 3D approach to parallel matrix multiplication. IBM J. Res. Develop., 1995.
-
(1995)
IBM J. Res. Develop.
-
-
Agarwal, R.C.1
Balle, S.M.2
Gustavson, F.G.3
Joshi, M.4
Palkar, P.5
-
2
-
-
0028545949
-
A high-performance matrix multiplication algorithm on a distributed-memory parallel computer using overlapped communication
-
R. C. Agarwal, F. G. Gustavson, and M. Zubair. A high-performance matrix multiplication algorithm on a distributed-memory parallel computer using overlapped communication. IBM J. Res. Develop., 1994.
-
(1994)
IBM J. Res. Develop.
-
-
Agarwal, R.C.1
Gustavson, F.G.2
Zubair, M.3
-
4
-
-
0024883116
-
Communication efficient matrix multiplication on hypercubes
-
J. Berntsen. Communication efficient matrix multiplication on hypercubes. Parallel Computing, 12:335-342, 1989.
-
(1989)
Parallel Computing
, vol.12
, pp. 335-342
-
-
Berntsen, J.1
-
7
-
-
0037631791
-
Efficient mapping and implementation of matrix algorithms on a hypercube
-
V. Cherkassky and R. Smith. Efficient mapping and implementation of matrix algorithms on a hypercube. J. Supercomputing, 2:7-27, 1988.
-
(1988)
J. Supercomputing
, vol.2
, pp. 7-27
-
-
Cherkassky, V.1
Smith, R.2
-
8
-
-
0028530654
-
PUMMA: Parallel universal matrix multiplication algorithms on distributed-memory concurrent computers
-
October
-
J. Choi, J. Dongarra, and D. W. Walker. PUMMA: Parallel Universal Matrix Multiplication Algorithms on Distributed-Memory Concurrent Computers. Concurrency: Pract. & Exper., Vol. 6, October 1994.
-
(1994)
Concurrency: Pract. & Exper.
, vol.6
-
-
Choi, J.1
Dongarra, J.2
Walker, D.W.3
-
9
-
-
0030287932
-
LogP: A practical model of parallel computation
-
D. E. Culler, R. M. Karp, D. A. Patterson, A. Sahay, E. Santos, K. E. Schauser, R. Subramonian, and T. von Eicken. LogP: A practical model of parallel computation. Communications of the ACM, 37(11):78-85, 1996.
-
(1996)
Communications of the ACM
, vol.37
, Issue.11
, pp. 78-85
-
-
Culler, D.E.1
Karp, R.M.2
Patterson, D.A.3
Sahay, A.4
Santos, E.5
Schauser, K.E.6
Subramonian, R.7
Von Eicken, T.8
-
12
-
-
0037631801
-
Domain decomposition in distributed and shared memory environments
-
G. Fox. Domain decomposition in distributed and shared memory environments. In Proceedings of the Int. Conf. on Supercomputing, 1042-1073, 1987.
-
(1987)
Proceedings of the Int. Conf. on Supercomputing
, pp. 1042-1073
-
-
Fox, G.1
-
13
-
-
0003506603
-
-
Prentice-Hall
-
G. Fox, M. Johnson, G. Lyzenga, S. Otto, J. Salmon, and D. Walker. Solving Problems on Concurrent Processors, Vol. i, Prentice-Hall, 1988.
-
(1988)
Solving Problems on Concurrent Processors
, vol.1
-
-
Fox, G.1
Johnson, M.2
Lyzenga, G.3
Otto, S.4
Salmon, J.5
Walker, D.6
-
14
-
-
0023288009
-
Matrix algorithms on a hypercube i: Matrix multiplication
-
G. Fox, S. Otto, and A. Hey. Matrix algorithms on a hypercube i: Matrix multiplication. Parallel Computing, 4:17-31, 1987.
-
(1987)
Parallel Computing
, vol.4
, pp. 17-31
-
-
Fox, G.1
Otto, S.2
Hey, A.3
-
15
-
-
84990908392
-
Scalability of parallel algorithms for matrix multiplications
-
University of Minnesota
-
A. Gupta and V. Kumar. Scalability of parallel algorithms for matrix multiplications. Technical Report 91-54, University of Minnesota, 1991.
-
(1991)
Technical Report
, vol.91
, Issue.54
-
-
Gupta, A.1
Kumar, V.2
-
20
-
-
0031146653
-
A poly-algorithm for parallel dense matrix multiplication on 2D process grid topologies
-
G. Li, A. Skjellum, and R. D. Falgout. A poly-algorithm for parallel dense matrix multiplication on 2D process grid topologies. Concurrency: Pract. and Expr., 9(5):345-389, 1997.
-
(1997)
Concurrency: Pract. and Expr.
, vol.9
, Issue.5
, pp. 345-389
-
-
Li, G.1
Skjellum, A.2
Falgout, R.D.3
-
23
-
-
0034226293
-
Matrix multiplication and data routing using a partitioned optical passive stars network
-
S. Sahni. Matrix multiplication and data routing using a partitioned optical passive stars network. IEEE Trans. on Parallel and Distributed Systems, 11(7), 2000.
-
(2000)
IEEE Trans. on Parallel and Distributed Systems
, vol.11
, Issue.7
-
-
Sahni, S.1
-
24
-
-
0036106373
-
Optimal and efficient parallel algorithms for summing and prefix summing
-
E. E. Santos. Optimal and efficient parallel algorithms for summing and prefix summing. J. Parallel and Distributed Computing, 62(4), 517-543, 2002.
-
(2002)
J. Parallel and Distributed Computing
, vol.62
, Issue.4
, pp. 517-543
-
-
Santos, E.E.1
-
26
-
-
84870926569
-
Optimal parallel algorithms for solving tridiagonal linear systems
-
E. E. Santos. Optimal parallel algorithms for solving tridiagonal linear systems. In Springer-Verlag Lecture Notes in Compuer Science #1300, 1997.
-
Springer-Verlag Lecture Notes in Computer Science
, vol.1300
, pp. 1997
-
-
Santos, E.E.1
-
27
-
-
0031123769
-
SUMMA: Scalable universal matrix multiplication algorithm
-
April
-
R. van de Geijn and J. Watts. SUMMA: Scalable universal matrix multiplication algorithm. In Concurrency: Pract. & Exper., Vol. 9, April 1997.
-
(1997)
Concurrency: Pract. & Exper.
, vol.9
-
-
Van de Geijn, R.1
Watts, J.2
|