메뉴 건너뛰기




Volumn 34, Issue 1, 2012, Pages

Communication-optimal parallel and sequential QR and LU factorizations

Author keywords

Linear algebra; LU factorization; QR factorization

Indexed keywords

LOWER-UPPER DECOMPOSITION;

EID: 84861354409     PISSN: 10648275     EISSN: None     Source Type: Journal    
DOI: 10.1137/080731992     Document Type: Article
Times cited : (299)

References (49)
  • 3
    • 2442576081 scopus 로고    scopus 로고
    • Algorithm 827: Irbleigs: A MATLAB program for computing a few eigenpairs of a large sparse Hermitian matrix
    • J. Baglama, D. Calvetti, and L. Reichel, Algorithm 827: Irbleigs: A MATLAB program for computing a few eigenpairs of a large sparse Hermitian matrix, ACM Trans. Math. Software, 29 (2003), pp. 337-348.
    • (2003) ACM Trans. Math. Software , vol.29 , pp. 337-348
    • Baglama, J.1    Calvetti, D.2    Reichel, L.3
  • 4
    • 84861395314 scopus 로고    scopus 로고
    • Block Arnoldi method
    • Z. Bai, J. W. Demmel, J. J. Dongarra, A. Ruhe, and H. van der Vorst, eds., SIAM, Philadelphia
    • Z. Bai and D. Day, Block Arnoldi method, in Templates for the Solution of Algebraic Eigenvalue Problems: A Practical Guide, Z. Bai, J. W. Demmel, J. J. Dongarra, A. Ruhe, and H. van der Vorst, eds., SIAM, Philadelphia, 2000, pp. 196-204.
    • (2000) Templates for the Solution of Algebraic Eigenvalue Problems: A Practical Guide , pp. 196-204
    • Bai, Z.1    Day, D.2
  • 11
    • 0001023112 scopus 로고
    • Parallel QR decomposition of a rectangular matrix
    • M. Cosnard, J.-M. Muller, and Y. Robert, Parallel QR decomposition of a rectangular matrix, Numer. Math., 48 (1986), pp. 239-249.
    • (1986) Numer. Math. , vol.48 , pp. 239-249
    • Cosnard, M.1    Muller, J.-M.2    Robert, Y.3
  • 12
    • 0000538288 scopus 로고
    • Fast parallel matrix inversion algorithms
    • L. Csanky, Fast parallel matrix inversion algorithms, SIAM J. Comput., 5 (1976), pp. 618-623.
    • (1976) SIAM J. Comput. , vol.5 , pp. 618-623
    • Csanky, L.1
  • 13
    • 84861360773 scopus 로고    scopus 로고
    • New parallel (rank-revealing) QR factorization algorithms
    • Parallel Processing: Eighth International Euro-Par Conference, Paderborn, Germany
    • R. D. D. Cunha, D. Becker, and J. C. Patterson, New parallel (rank-revealing) QR factorization algorithms, in Proceedings of the Euro-Par 2002. Parallel Processing: Eighth International Euro-Par Conference, Paderborn, Germany, 2002.
    • (2002) Proceedings of the Euro-Par 2002
    • Cunha, R.D.D.1    Becker, D.2    Patterson, J.C.3
  • 15
    • 0034487070 scopus 로고    scopus 로고
    • The design and implementation of the parallel out-ofcore ScaLAPACK LU, QR, and Cholesky factorization routines
    • E. D'Azevedo and J. Dongarra, The design and implementation of the parallel out-ofcore ScaLAPACK LU, QR, and Cholesky factorization routines, Concurrency Practice Experience, 12 (2000), pp. 1481-1483.
    • (2000) Concurrency Practice Experience , vol.12 , pp. 1481-1483
    • D'Azevedo, E.1    Dongarra, J.2
  • 17
    • 74049121700 scopus 로고    scopus 로고
    • Nonnegative diagonals and high performance on low-profile matrices from Householder QR
    • J. W. Demmel, M. Hoemmen, Y. Hida, and E. J. Riedy, Nonnegative diagonals and high performance on low-profile matrices from Householder QR, SIAM J. Sci. Comput., 31 (2009), pp. 2832-2841.
    • (2009) SIAM J. Sci. Comput. , vol.31 , pp. 2832-2841
    • Demmel, J.W.1    Hoemmen, M.2    Hida, Y.3    Riedy, E.J.4
  • 20
    • 84947936389 scopus 로고    scopus 로고
    • New serial and parallel recursive QR factorization algorithms for SMP systems
    • B. Kågström, E. Elmroth, J. Dongarra, and J. Wasniewski, eds., Lecture Notes in Comput. Sci., Springer, New York
    • E. Elmroth and F. Gustavson, New serial and parallel recursive QR factorization algorithms for SMP systems, in Proceedings of the Fourth International Workshop on Applied Parallel Computing, Large Scale Scientific and Industrial Problems, B. Kågström, E. Elmroth, J. Dongarra, and J. Wasniewski, eds., Lecture Notes in Comput. Sci. 1541, Springer, New York, 1998, pp. 120-128.
    • (1998) Proceedings of the Fourth International Workshop on Applied Parallel Computing, Large Scale Scientific and Industrial Problems , vol.1541 , pp. 120-128
    • Elmroth, E.1    Gustavson, F.2
  • 21
    • 0034224207 scopus 로고    scopus 로고
    • Applying recursion to serial and parallel QR factorization leads to better performance
    • E. Elmroth and F. Gustavson, Applying recursion to serial and parallel QR factorization leads to better performance, IBM J. Res. Develop., 44 (2000), pp. 605-624.
    • (2000) IBM J. Res. Develop. , vol.44 , pp. 605-624
    • Elmroth, E.1    Gustavson, F.2
  • 22
    • 0038716587 scopus 로고    scopus 로고
    • QR factorization with Morton-ordered quadtree matrices for memory re-use and parallelism
    • J. D. Frens and D. S. Wise, QR factorization with Morton-ordered quadtree matrices for memory re-use and parallelism, SIGPLAN Not., 38 (2003), pp. 144-154.
    • (2003) SIGPLAN Not. , vol.38 , pp. 144-154
    • Frens, J.D.1    Wise, D.S.2
  • 23
    • 0009598276 scopus 로고    scopus 로고
    • A block QMR algorithm for non-Hermitian linear systems with multiple right-hand sides
    • PII S0024379596005290
    • R. W. Freund and M. Malhotra, A block QMR algorithm for non-Hermitian linear systems with multiple right-hand sides, Linear Algebra Appl., 254 (1997), pp. 119-157. (Pubitemid 127377532)
    • (1997) Linear Algebra and Its Applications , vol.254 , Issue.1-3 , pp. 119-157
    • Freund, R.W.1    Malhotra, M.2
  • 24
    • 77953973267 scopus 로고
    • Parallel block schemes for large-scale leastsquares computations
    • R. B. Wilhelmson, ed., University of Illinois Press, Chicago, IL
    • G. H. Golub, R. J. Plemmons, and A. Sameh, Parallel block schemes for large-scale leastsquares computations, in High-Speed Computing: Scientific Applications and Algorithm Design, R. B. Wilhelmson, ed., University of Illinois Press, Chicago, IL, 1988, pp. 171-179.
    • (1988) High-Speed Computing: Scientific Applications and Algorithm Design , pp. 171-179
    • Golub, G.H.1    Plemmons, R.J.2    Sameh, A.3
  • 28
    • 17644368925 scopus 로고    scopus 로고
    • Parallel out-of-core computation and updating of the QR factorization
    • DOI 10.1145/1055531.1055534
    • B. C. Gunter and R. A. van de Geijn, Parallel out-of-core computation and updating of the QR factorization, ACM Trans. Math. Software, 31 (2005), pp. 60-78. (Pubitemid 40557862)
    • (2005) ACM Transactions on Mathematical Software , vol.31 , Issue.1 , pp. 60-78
    • Gunter, B.C.1    Van De Geijn, R.A.2
  • 29
    • 33748688428 scopus 로고    scopus 로고
    • Basis selection in LOBPCG
    • DOI 10.1016/j.jcp.2006.02.007, PII S0021999106000866
    • U. Hetmaniuk and R. Lehoucq, Basis selection in LOBPCG, J. Comput. Phys., 218 (2006), pp. 324-332. (Pubitemid 44389052)
    • (2006) Journal of Computational Physics , vol.218 , Issue.1 , pp. 324-332
    • Hetmaniuk, U.1    Lehoucq, R.2
  • 32
    • 10844258198 scopus 로고    scopus 로고
    • Communication lower bounds for distributed-memory matrix multiplication
    • DOI 10.1016/j.jpdc.2004.03.021
    • D. Irony, S. Toledo, and A. Tiskin, Communication lower bounds for distributed-memory matrix multiplication, J. Parallel Distrib. Comput., 64 (2004), pp. 1017-1026. (Pubitemid 40000755)
    • (2004) Journal of Parallel and Distributed Computing , vol.64 , Issue.9 , pp. 1017-1026
    • Irony, D.1    Toledo, S.2    Tiskin, A.3
  • 34
    • 33746412371 scopus 로고    scopus 로고
    • A. V. Knyazev, BLOPEX, http://www-math.cudenver.edu/~aknyazev/software/ BLOPEX.
    • BLOPEX
    • Knyazev, A.V.1
  • 37
    • 0033323425 scopus 로고    scopus 로고
    • Parallel complexity of numerically accurate linear system solvers
    • M. Leoncini, G. Manzini, and L. Margara, Parallel complexity of numerically accurate linear system solvers, SIAM J. Comput., 28 (1999), pp. 2030-2058. (Pubitemid 30530990)
    • (1999) SIAM Journal on Computing , vol.28 , Issue.6 , pp. 2030-2058
    • Leoncini, M.1    Manzini, G.2    Margara, L.3
  • 40
    • 0001084178 scopus 로고
    • The block conjugate gradient algorithm and related methods
    • D. P. O'Leary, The block conjugate gradient algorithm and related methods, Linear Algebra Appl., 29 (1980), pp. 293-322.
    • (1980) Linear Algebra Appl. , vol.29 , pp. 293-322
    • O'Leary, D.P.1
  • 41
    • 0004481424 scopus 로고
    • Distributed orthogonal factorization: Givens and Householder algorithms
    • A. Pothen and P. Raghavan, Distributed orthogonal factorization: Givens and Householder algorithms, SIAM J. Sci. Statist. Comput., 10 (1989), pp. 1113-1134.
    • (1989) SIAM J. Sci. Statist. Comput. , vol.10 , pp. 1113-1134
    • Pothen, A.1    Raghavan, P.2
  • 44
    • 0344153336 scopus 로고    scopus 로고
    • On the complexity of matrix product
    • R. Raz, On the complexity of matrix product, SIAM J. Comput., 32 (2003), pp. 1356-1369.
    • (2003) SIAM J. Comput. , vol.32 , pp. 1356-1369
    • Raz, R.1
  • 45
    • 0003078924 scopus 로고
    • A storage efficient WY representation for products of Householder transformations
    • R. Schreiber and C. Van Loan, A storage efficient WY representation for products of Householder transformations, SIAM J. Sci. Statist. Comput., 10 (1989), pp. 53-57.
    • (1989) SIAM J. Sci. Statist. Comput. , vol.10 , pp. 53-57
    • Schreiber, R.1    Van Loan, C.2
  • 47
    • 0031496750 scopus 로고    scopus 로고
    • Locality of reference in LU decomposition with partial pivoting
    • S. Toledo, Locality of reference in LU decomposition with partial pivoting, SIAM J. Matrix Anal. Appl., 18 (1997), pp. 1065-1081.
    • (1997) SIAM J. Matrix Anal. Appl. , vol.18 , pp. 1065-1081
    • Toledo, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.