메뉴 건너뛰기




Volumn 32, Issue 6, 2010, Pages 3495-3523

Communication-optimal parallel and sequential Cholesky decomposition

Author keywords

Algorithm; Bandwidth; Cholesky decomposition; Communication avoiding; Latency; Lower bound

Indexed keywords

COSTS; FACTORIZATION; LINEAR SYSTEMS; LOWER-UPPER DECOMPOSITION; MEMORY ARCHITECTURE;

EID: 79251563454     PISSN: 10648275     EISSN: None     Source Type: Journal    
DOI: 10.1137/090760969     Document Type: Article
Times cited : (40)

References (30)
  • 1
    • 69149088136 scopus 로고    scopus 로고
    • IEEE standard for floating-point arithmetic
    • IEEE standard for floating-point arithmetic, IEEE Std. 754-2008, (2008), pp. 1-58.
    • (2008) IEEE Std. , vol.754-2008 , pp. 1-58
  • 2
    • 0024082546 scopus 로고
    • The input/output complexity of sorting and related problems
    • A. Aggarwal and J. S. Vitter, The input/output complexity of sorting and related problems, Commun. ACM, 31 (1988), pp. 1116-1127.
    • (1988) Commun. ACM , vol.31 , pp. 1116-1127
    • Aggarwal, A.1    Vitter, J.S.2
  • 4
    • 18044400448 scopus 로고    scopus 로고
    • A recursive formulation of Cholesky factorization of a matrix in packed storage format
    • B. S. Andersen, F. G. Gustavson, and J. Wasniewski, A recursive formulation of Cholesky factorization of a matrix in packed storage format, ACM Trans. Math. Software, 27 (2001), pp. 214-244.
    • (2001) ACM Trans. Math. Software , vol.27 , pp. 214-244
    • Andersen, B.S.1    Gustavson, F.G.2    Wasniewski, J.3
  • 6
    • 45449120592 scopus 로고    scopus 로고
    • Hardware-oriented implementation of cache oblivious matrix operations based on space-filling curves
    • Parallel Processing and Applied Mathematics, 7th International Conference, PPAM, Springer-Verlag, New York
    • M. Bader, R. Franz, S. Guenther, and A. Heinecke, Hardware-oriented implementation of cache oblivious matrix operations based on space-filling curves, in Parallel Processing and Applied Mathematics, 7th International Conference, PPAM 2007, Lecture Notes in Comput. Sci. 4967, Springer-Verlag, New York, 2008, pp. 628-638.
    • (2007) Lecture Notes in Comput. Sci. , vol.4967 , Issue.2008 , pp. 628-638
    • Bader, M.1    Franz, R.2    Guenther, S.3    Heinecke, A.4
  • 10
    • 70449440599 scopus 로고    scopus 로고
    • Out-of-core implementations of Cholesky factorization: Loop-based versus recursive algorithms
    • N. Béreux, Out-of-core implementations of Cholesky factorization: Loop-based versus recursive algorithms, SIAM J. Matrix Anal. Appl., 30 (2008), pp. 1302-1319.
    • (2008) SIAM J. Matrix Anal. Appl. , vol.30 , pp. 1302-1319
    • Béreux, N.1
  • 13
    • 77953980008 scopus 로고    scopus 로고
    • Communication-optimal parallel and sequential QR and LU factorizations
    • Technical report EECS- 2008-89 University of California Berkeley, Berkeley, CA. submitted
    • J. Demmel, L. Grigori, M. Hoemmen, and J. Langou, Communication-optimal Parallel and Sequential QR and LU Factorizations, Technical report EECS-2008-89, University of California Berkeley, Berkeley, CA, 2008, SIAM. J. Sci. Comput., submitted.
    • (2008) SIAM. J. Sci. Comput.
    • Demmel, J.1    Grigori, L.2    Hoemmen, M.3    Langou, J.4
  • 14
    • 85140867620 scopus 로고    scopus 로고
    • Implementing communication-optimal parallel and sequential QR and LU factorizations
    • submitted
    • J. Demmel, L. Grigori, M. Hoemmen, and J. Langou, Implementing communication-optimal parallel and sequential QR and LU factorizations, SIAM. J. Sci. Comput., submitted.
    • SIAM. J. Sci. Comput.
    • Demmel, J.1    Grigori, L.2    Hoemmen, M.3    Langou, J.4
  • 16
    • 1842832833 scopus 로고    scopus 로고
    • Recursive blocked algorithms and hybrid data structures for dense matrix library software
    • E. Elmroth, F. Gustavson, I. Jonsson, and B. Ka°gström, Recursive blocked algorithms and hybrid data structures for dense matrix library software, SIAM Rev., 46 (2004), pp. 3-45.
    • (2004) SIAM Rev. , vol.46 , pp. 3-45
    • Elmroth, E.1    Gustavson, F.2    Jonsson, I.3    Kagström, B.4
  • 19
    • 79251581739 scopus 로고    scopus 로고
    • Personal communication
    • L. Grigori. Personal communication, 2009.
    • (2009)
    • Grigori., L.1
  • 20
    • 0031273280 scopus 로고    scopus 로고
    • Recursion leads to automatic variable blocking for dense linear-algebra algorithms
    • F. G. Gustavson, Recursion leads to automatic variable blocking for dense linear-algebra algorithms, IBM J. Res. Dev., 41 (1997), pp. 737-756.
    • (1997) IBM J. Res. Dev. , vol.41 , pp. 737-756
    • Gustavson, F.G.1
  • 21
    • 84956987224 scopus 로고    scopus 로고
    • High performance Cholesky factorization via blocking and recursion that uses minimal storage
    • New Paradigms for HPC in Industry and Academia, London, UK Springer-Verlag
    • F. G. Gustavson and I. Jonsson, High performance Cholesky factorization via blocking and recursion that uses minimal storage, in PARA '00: Proceedings of the 5th International Workshop on Applied Parallel Computing, New Paradigms for HPC in Industry and Academia, London, UK, 2001, Springer-Verlag, pp. 82-91.
    • (2001) PARA '00: Proceedings of the 5th International Workshop on Applied Parallel Computing , pp. 82-91
    • Gustavson, F.G.1    Jonsson, I.2
  • 25
    • 10844258198 scopus 로고    scopus 로고
    • Communication lower bounds for distributed-memory matrix multiplication
    • D. Irony, S. Toledo, and A. Tiskin, Communication lower bounds for distributed-memory matrix multiplication, J. Parallel Distrib. Comput., 64 (2004), pp. 1017-1026.
    • (2004) J. Parallel Distrib. Comput. , vol.64 , pp. 1017-1026
    • Irony, D.1    Toledo, S.2    Tiskin, A.3
  • 26
    • 84957579840 scopus 로고
    • Extending the Hong-Kung model to memory hierarchies
    • J. E. Savage, Extending the Hong-Kung model to memory hierarchies, in COCOON, 1995, pp. 270-281.
    • (1995) COCOON , pp. 270-281
    • Savage, J.E.1
  • 28
    • 0031496750 scopus 로고    scopus 로고
    • Locality of reference in LU decomposition with partial pivoting
    • S. Toledo, Locality of reference in LU decomposition with partial pivoting, SIAM J. Matrix Anal. Appl., 18 (1997), pp. 1065-1081.
    • (1997) SIAM J. Matrix Anal. Appl. , vol.18 , pp. 1065-1081
    • Toledo, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.