-
1
-
-
0029370767
-
A threedimensional approach to parallel matrix multiplication
-
Agarwal, R.C., Balle, S.M., Gustavson, F.G., Joshi, M., Palkar, P. A threedimensional approach to parallel matrix multiplication. IBM J. Res. Dev. 39, 5 (1995), 575-582.
-
(1995)
IBM J. Res. Dev.
, vol.39
, Issue.5
, pp. 575-582
-
-
Agarwal, R.C.1
Balle, S.M.2
Gustavson, F.G.3
Joshi, M.4
Palkar, P.5
-
2
-
-
42249105555
-
An elementary construction of constantdegree expanders
-
Alon, N., Schwartz, O., Shapira, A. An elementary construction of constantdegree expanders. Combinator. Probab. Comput. 17, 3 (2008), 319-327.
-
(2008)
Combinator. Probab. Comput.
, vol.17
, Issue.3
, pp. 319-327
-
-
Alon, N.1
Schwartz, O.2
Shapira, A.3
-
3
-
-
0003706460
-
-
Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, Also available from
-
A nderson, E., Bai, Z., Bischof, C., Demmel, J., Dongarra, J., Croz, J.D., Greenbaum, A., Hammarling, S., McKenney, A., Ostrouchov, S., Sorensen, D. LAPACK's User's Guide, Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 1992. Also available from http://www. netlib.org/lapack/.
-
(1992)
LAPACK's User's Guide
-
-
Anderson, E.1
Bai, Z.2
Bischof, C.3
Demmel, J.4
Dongarra, J.5
Croz, J.D.6
Greenbaum, A.7
Hammarling, S.8
McKenney, A.9
Ostrouchov, S.10
Sorensen, D.11
-
4
-
-
84883515454
-
Communication optimal parallel multiplication of sparse random matrices
-
ACM, New York, NY, USA
-
Ballard, G., Buluç, A., Demmel, J., Grigori, L., Lipshitz, B., Schwartz, O., Toledo, S. Communication Optimal Parallel Multiplication of Sparse Random Matrices. In Proceedings of the 25th ACM Symposium on Parallelism in Algorithms and Architectures, (2013), ACM, New York, NY, USA.
-
(2013)
Proceedings of the 25th ACM Symposium on Parallelism in Algorithms and Architectures
-
-
Ballard, G.1
Buluç, A.2
Demmel, J.3
Grigori, L.4
Lipshitz, B.5
Schwartz, O.6
Toledo, S.7
-
5
-
-
84864146488
-
Brief announcement: Strong scaling of matrix multiplication algorithms and memory-independent communication lower bounds
-
ACM, New York, NY, USA
-
Ballard, G., Demmel, J., Holtz, O., Lipshitz, B., Schwartz, O. Brief announcement: Strong scaling of matrix multiplication algorithms and memory-independent communication lower bounds. In Proceedings of the 24th ACM Symposium on Parallelism in Algorithms and Architectures, (2012), ACM, New York, NY, USA, 77-79.
-
(2012)
Proceedings of the 24th ACM Symposium on Parallelism in Algorithms and Architectures
, pp. 77-79
-
-
Ballard, G.1
Demmel, J.2
Holtz, O.3
Lipshitz, B.4
Schwartz, O.5
-
6
-
-
84864147291
-
Communicationoptimal parallel algorithm for Strassen's matrix multiplication
-
SPAA '12 ACM, New York, NY, USA
-
Ballard, G., Demmel, J., Holtz, O., Lipshitz, B., Schwartz, O. Communicationoptimal parallel algorithm for Strassen's matrix multiplication. In Proceedings of the 24th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA '12 (2012), ACM, New York, NY, USA, 193-204.
-
(2012)
Proceedings of the 24th ACM Symposium on Parallelism in Algorithms and Architectures
, pp. 193-204
-
-
Ballard, G.1
Demmel, J.2
Holtz, O.3
Lipshitz, B.4
Schwartz, O.5
-
7
-
-
85124668424
-
Graph expansion analysis for communication costs of fast rectangular matrix multiplication
-
G. Even and D. Rawitz, eds., Lecture Notes in Computer Science Springer, Berlin-Heidelberg
-
Ballard, G., Demmel, J., Holtz, O., Lipshitz, B., Schwartz, O. Graph expansion analysis for communication costs of fast rectangular matrix multiplication. In Design and Analysis of Algorithms. G. Even and D. Rawitz, eds., Volume 7659 of Lecture Notes in Computer Science (2012), Springer, Berlin-Heidelberg, 13-36.
-
(2012)
Design and Analysis of Algorithms
, vol.7659
, pp. 13-36
-
-
Ballard, G.1
Demmel, J.2
Holtz, O.3
Lipshitz, B.4
Schwartz, O.5
-
8
-
-
79959674766
-
Graph expansion and communication costs of fast matrix multiplication
-
ACM, New York, NY, USA
-
Ballard, G., Demmel, J., Holtz, O., Schwartz, O. Graph expansion and communication costs of fast matrix multiplication. In Proceedings of the 23rd Annual ACM Symposium on Parallel Algorithms and Architectures (2011), ACM, New York, NY, USA, 1-12.
-
(2011)
Proceedings of the 23rd Annual ACM Symposium on Parallel Algorithms and Architectures
, pp. 1-12
-
-
Ballard, G.1
Demmel, J.2
Holtz, O.3
Schwartz, O.4
-
9
-
-
80054034521
-
Minimizing communication in numerical linear algebra
-
Ballard, G., Demmel, J., Holtz, O., Schwartz, O. Minimizing communication in numerical linear algebra. SIAM J. Matrix Anal. Appl. 32, 3 (2011), 866-901.
-
(2011)
SIAM J. Matrix Anal. Appl.
, vol.32
, Issue.3
, pp. 866-901
-
-
Ballard, G.1
Demmel, J.2
Holtz, O.3
Schwartz, O.4
-
10
-
-
84872481259
-
Graph expansion and communication costs of fast matrix multiplication
-
Dec
-
Ballard, G., Demmel, J., Holtz, O., Schwartz, O. Graph expansion and communication costs of fast matrix multiplication. J. ACM (Dec. 2012) 59, 6, 32:1-32:23.
-
(2012)
J. ACM
, vol.59
, Issue.6
, pp. 321-3223
-
-
Ballard, G.1
Demmel, J.2
Holtz, O.3
Schwartz, O.4
-
12
-
-
84893306519
-
-
Christ, M., Demmel, J., Knight, N., Scanlon, T., Yelick, K. Communication lower bounds and optimal algorithms for programs that reference arrays - Part I. Manuscript, 2013.
-
(2013)
Communication Lower Bounds and Optimal Algorithms for Programs That Reference Arrays - Part I. Manuscript
-
-
Christ, M.1
Demmel, J.2
Knight, N.3
Scanlon, T.4
Yelick, K.5
-
13
-
-
35548978022
-
Fast linear algebra is stable
-
Demmel, J., Dumitriu, I., Holtz, O. Fast linear algebra is stable. Numer. Math. 108, 1 (2007), 59-91.
-
(2007)
Numer. Math.
, vol.108
, Issue.1
, pp. 59-91
-
-
Demmel, J.1
Dumitriu, I.2
Holtz, O.3
-
14
-
-
84884883916
-
Communication-optimal parallel recursive rectangular matrix multiplication
-
IEEE
-
Demmel, J., Eliahu, D., Fox, A., Kamil, S., Lipshitz, B., Schwartz, O., Spillinger, O. Communication-optimal parallel recursive rectangular matrix multiplication. In Proceedings of the 27th IEEE International Parallel & Distributed Processing Symposium (IPDPS) (2013), IEEE.
-
(2013)
Proceedings of the 27th IEEE International Parallel & Distributed Processing Symposium (IPDPS)
-
-
Demmel, J.1
Eliahu, D.2
Fox, A.3
Kamil, S.4
Lipshitz, B.5
Schwartz, O.6
Spillinger, O.7
-
15
-
-
84884891973
-
Perfect strong scaling using no additional energy
-
IPDPS '13 IEEE
-
Demmel, J., Gearhart, A., Lipshitz, B., Schwartz, O. Perfect strong scaling using no additional energy. In Proceedings of the 27th IEEE International Parallel & Distributed Processing Symposium, IPDPS '13 (2013), IEEE.
-
(2013)
Proceedings of the 27th IEEE International Parallel & Distributed Processing Symposium
-
-
Demmel, J.1
Gearhart, A.2
Lipshitz, B.3
Schwartz, O.4
-
16
-
-
79959806789
-
-
The National Academies Press, Washington, D.C
-
Fuller, S.H., Millett, L.I., eds. The Future of Computing Performance: Game Over or Next Level? The National Academies Press, Washington, D.C., 2011, 200 pages, http://www.nap.edu.
-
(2011)
The Future of Computing Performance: Game over or Next Level?
, pp. 200
-
-
Fuller, S.H.1
Millett, L.I.2
-
17
-
-
85014324703
-
-
Report of National Research Council of the National Academies Sciences. The National Academies Press, Washington, D.C
-
Graham, S.L., Snir, M., Patterson, C.A., eds. Getting up to Speed: The Future of Supercomputing. Report of National Research Council of the National Academies Sciences. The National Academies Press, Washington, D.C., 2004, 289 pages, http://www.nap.edu.
-
(2004)
Getting Up to Speed: The Future of Supercomputing
, pp. 289
-
-
Graham, S.L.1
Snir, M.2
Patterson, C.A.3
-
18
-
-
84971853043
-
I/O complexity: The red-blue pebble game
-
ACM, New York, NY, USA
-
Hong, J.W., Kung, H.T. I/O complexity: The red-blue pebble game. In STOC '81: Proceedings of the 13th annual ACM Symposium on Theory of Computing (1981), ACM, New York, NY, USA, 326-333.
-
(1981)
STOC '81: Proceedings of the 13th Annual ACM Symposium on Theory of Computing
, pp. 326-333
-
-
Hong, J.W.1
Kung, H.T.2
-
19
-
-
33749002225
-
Expander graphs and their applications
-
Hoory, S., Linial, N., Wigderson, A. Expander graphs and their applications. Bull. AMS 43(4), (2006), 439-561.
-
(2006)
Bull. AMS
, vol.43
, Issue.4
, pp. 439-561
-
-
Hoory, S.1
Linial, N.2
Wigderson, A.3
-
20
-
-
10844258198
-
Communication lower bounds for distributed-memory matrix multiplication
-
I rony, D., Toledo, S., Tiskin, A. Communication lower bounds for distributed-memory matrix multiplication. J. Parallel Distrib. Comput. 64, 9, (2004), 1017-1026.
-
(2004)
J. Parallel Distrib. Comput.
, vol.64
, Issue.9
, pp. 1017-1026
-
-
Irony, D.1
Toledo, S.2
Tiskin, A.3
-
21
-
-
84877716093
-
Communication-avoiding parallel Strassen: Implementation and performance
-
IEEE Computer Society Press, Los Alamitos, CA, USA
-
Lipshitz, B., Ballard, G., Demmel, J., Schwartz, O. Communication- avoiding parallel Strassen: Implementation and performance. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, (2012), IEEE Computer Society Press, Los Alamitos, CA, USA, 101:1-101:11.
-
(2012)
Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
, pp. 1011-10111
-
-
Lipshitz, B.1
Ballard, G.2
Demmel, J.3
Schwartz, O.4
-
22
-
-
0000743020
-
Memoryefficient matrix multiplication in the BSP model
-
McColl, W.F., Tiskin, A. Memoryefficient matrix multiplication in the BSP model. Algorithmica 24 (1999), 287-297.
-
(1999)
Algorithmica
, vol.24
, pp. 287-297
-
-
McColl, W.F.1
Tiskin, A.2
-
24
-
-
34250487811
-
Gaussian elimination is not optimal
-
Strassen, V. Gaussian elimination is not optimal. Numer. Math. 13 (1969), 354-356.
-
(1969)
Numer. Math.
, vol.13
, pp. 354-356
-
-
Strassen, V.1
-
25
-
-
84862595166
-
Multiplying matrices faster than Coppersmith-Winograd
-
STO C '12 ACM, New York, NY, USA
-
Williams, V.V. Multiplying matrices faster than Coppersmith-Winograd. In Proceedings of the 44th Symposium on Theory of Computing, STO C '12 (2012), ACM, New York, NY, USA, 887-898.
-
(2012)
Proceedings of the 44th Symposium on Theory of Computing
, pp. 887-898
-
-
Williams V, V.1
|