-
1
-
-
0029370767
-
A three-dimensional approach to parallel matrix multiplication
-
Agarwal, R.C., Balle, S.M., Gustavson, F.G., Joshi, M., Palkar, P.: A three-dimensional approach to parallel matrix multiplication. IBM J. Res. Dev. 39, 575-582 (1995)
-
(1995)
IBM J. Res. Dev.
, vol.39
, pp. 575-582
-
-
Agarwal, R.C.1
Balle, S.M.2
Gustavson, F.G.3
Joshi, M.4
Palkar, P.5
-
2
-
-
0025231126
-
Communication complexity of PRAMs
-
Aggarwal, A., Chandra, A.K., Snir, M.: Communication complexity of PRAMs. Theoretical Computer Science 71(1), 3-28 (1990)
-
(1990)
Theoretical Computer Science
, vol.71
, Issue.1
, pp. 3-28
-
-
Aggarwal, A.1
Chandra, A.K.2
Snir, M.3
-
4
-
-
0001314661
-
The fan-both family of column-based distributed Cholesky factorization algorithms
-
Alan George, J.R.G., Liu, J.W.H. (eds.) Graph Theory and Sparse Matrix Computation. Springer, Heidelberg
-
Ashcraft, C.: The fan-both family of column-based distributed Cholesky factorization algorithms. In: Alan George, J.R.G., Liu, J.W.H. (eds.) Graph Theory and Sparse Matrix Computation. IMA Volumes in Mathematics and its Applications, vol. 56, pp. 159-190. Springer, Heidelberg (1993)
-
(1993)
IMA Volumes in Mathematics and Its Applications
, vol.56
, pp. 159-190
-
-
Ashcraft, C.1
-
5
-
-
80052309144
-
Minimizing communication in numerical linear algebra
-
To appear in SIAM J. Mat. Anal. Appl.
-
Ballard, G., Demmel, J., Holtz, O., Schwartz, O.: Minimizing communication in numerical linear algebra. To appear in SIAM J. Mat. Anal. Appl., UCB Technical Report EECS-2009-62 (2010)
-
(2010)
UCB Technical Report EECS-2009-62
-
-
Ballard, G.1
Demmel, J.2
Holtz, O.3
Schwartz, O.4
-
6
-
-
0003615167
-
-
Society for Industrial and AppliedMathematics, Philadelphia, PA, USA
-
Blackford, L.S., Choi, J., Cleary, A., D'Azeuedo, E., Demmel, J., Dhillon, I., Hammarling, S., Henry, G., Petitet, A., Stanley, K., Walker, D., Whaley, R.C.: ScaLAPACK User's Guide, Society for Industrial and AppliedMathematics, Philadelphia, PA, USA (1997)
-
(1997)
ScaLAPACK User's Guide
-
-
Blackford, L.S.1
Choi, J.2
Cleary, A.3
D'Azeuedo, E.4
Demmel, J.5
Dhillon, I.6
Hammarling, S.7
Henry, G.8
Petitet, A.9
Stanley, K.10
Walker, D.11
Whaley, R.C.12
-
8
-
-
0000456144
-
Parallel matrix and graph algorithms
-
Dekel, E., Nassimi, D., Sahni, S.: Parallel matrix and graph algorithms. SIAM Journal on Computing 10(4), 657-675 (1981)
-
(1981)
SIAM Journal on Computing
, vol.10
, Issue.4
, pp. 657-675
-
-
Dekel, E.1
Nassimi, D.2
Sahni, S.3
-
10
-
-
35548978022
-
Fast linear algebra is stable
-
Demmel, J., Dumitriu, I., Holtz, O.: Fast linear algebra is stable. Numerische Mathematik 108, 59-91 (2007)
-
(2007)
Numerische Mathematik
, vol.108
, pp. 59-91
-
-
Demmel, J.1
Dumitriu, I.2
Holtz, O.3
-
11
-
-
77950942204
-
MPI collective communications on the Blue Gene/P supercomputer: Algorithms and optimizations
-
Faraj, A., Kumar, S., Smith, B., Mamidala, A., Gunnels, J.: MPI collective communications on the Blue Gene/P supercomputer: Algorithms and optimizations. In: 17th IEEE Symposium on High Performance Interconnects HOTI 2009, pp. 63-72 (2009)
-
(2009)
17th IEEE Symposium on High Performance Interconnects HOTI 2009
, pp. 63-72
-
-
Faraj, A.1
Kumar, S.2
Smith, B.3
Mamidala, A.4
Gunnels, J.5
-
12
-
-
70350784030
-
Communication avoiding Gaussian elimination
-
IEEE Press, Piscataway
-
Grigori, L., Demmel, J.W., Xiang, H.: Communication avoiding Gaussian elimination. In: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing SC 2008, pp. 29:1-29:12. IEEE Press, Piscataway (2008)
-
(2008)
Proceedings of the 2008 ACM/IEEE Conference on Supercomputing SC 2008
-
-
Grigori, L.1
Demmel, J.W.2
Xiang, H.3
-
13
-
-
0003417929
-
-
MIT Press, Cambridge
-
Gropp, W., Lusk, E., Skjellum, A.: Using MPI: portable parallel programming with the message-passing interface. MIT Press, Cambridge (1994)
-
(1994)
Using MPI: Portable Parallel Programming with the Message-passing Interface
-
-
Gropp, W.1
Lusk, E.2
Skjellum, A.3
-
14
-
-
80052309746
-
Trading replication for communication in parallel distributedmemory dense solvers
-
Irony, D., Toledo, S.: Trading replication for communication in parallel distributedmemory dense solvers. Parallel Processing Letters 71, 3-28 (2002)
-
(2002)
Parallel Processing Letters
, vol.71
, pp. 3-28
-
-
Irony, D.1
Toledo, S.2
-
15
-
-
10844258198
-
Communication lower bounds for distributedmemory matrix multiplication
-
Irony, D., Toledo, S., Tiskin, A.: Communication lower bounds for distributedmemory matrix multiplication. Journal of Parallel and Distributed Computing 64(9), 1017-1026 (2004)
-
(2004)
Journal of Parallel and Distributed Computing
, vol.64
, Issue.9
, pp. 1017-1026
-
-
Irony, D.1
Toledo, S.2
Tiskin, A.3
-
16
-
-
0027702512
-
Minimizing the communication time for matrix multiplication on multiprocessors
-
Johnsson, S.L.: Minimizing the communication time for matrix multiplication on multiprocessors. Parallel Comput. 19, 1235-1257 (1993)
-
(1993)
Parallel Comput.
, vol.19
, pp. 1235-1257
-
-
Johnsson, S.L.1
-
17
-
-
57349161912
-
The deep computing messaging framework: Generalized scalable message passing on the Blue Gene/P supercomputer
-
ACM, New York
-
Kumar, S., Dozsa, G., Almasi, G., Heidelberger, P., Chen, D., Giampapa, M.E., Michael, B., Faraj, A., Parker, J., Ratterman, J., Smith, B., Archer, C.J.: The deep computing messaging framework: generalized scalable message passing on the Blue Gene/P supercomputer. In: Proceedings of the 22nd Annual International Conference on Supercomputing ICS 2008, pp. 94-103. ACM, New York (2008)
-
(2008)
Proceedings of the 22nd Annual International Conference on Supercomputing ICS 2008
, pp. 94-103
-
-
Kumar, S.1
Dozsa, G.2
Almasi, G.3
Heidelberger, P.4
Chen, D.5
Giampapa, M.E.6
Michael, B.7
Faraj, A.8
Parker, J.9
Ratterman, J.10
Smith, B.11
Archer, C.J.12
-
18
-
-
0000743020
-
Memory-efficient matrix multiplication in the BSP model
-
McColl, W.F., Tiskin, A.: Memory-efficient matrix multiplication in the BSP model. Algorithmica 24, 287-297 (1999)
-
(1999)
Algorithmica
, vol.24
, pp. 287-297
-
-
McColl, W.F.1
Tiskin, A.2
-
19
-
-
80052305141
-
-
Tech. Rep. UCB/EECS-2011-10, EECS Department February
-
Solomonik, E., Demmel, J.: Communication-optimal parallel 2.5D matrix multiplication and LU factorization algorithms. Tech. Rep. UCB/EECS-2011-10, EECS Department, University of California, Berkeley (February 2011), http://www.eecs.berkeley.edu/Pubs/TechRpts/2011/EECS-2011-10.html
-
(2011)
Communication-optimal Parallel 2.5D Matrix Multiplication and LU Factorization Algorithms
-
-
Solomonik, E.1
Demmel, J.2
|