메뉴 건너뛰기




Volumn 20, Issue 3, 2004, Pages 425-440

Parallel and fully recursive multifrontal sparse Cholesky

Author keywords

Block layouts; Cilk; Multifrontal factorizations; Parallel Cholesky factorization; Recursive factorizations; Recursive layouts; Sparse Cholesky factorization

Indexed keywords

BLOCK CODES; COMBINATORIAL MATHEMATICS; DATA REDUCTION; MATRIX ALGEBRA; MICROPROCESSOR CHIPS; PARALLEL PROCESSING SYSTEMS; TREES (MATHEMATICS);

EID: 1642372163     PISSN: 0167739X     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.future.2003.07.007     Document Type: Conference Paper
Times cited : (23)

References (42)
  • 1
    • 0028513316 scopus 로고
    • Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms
    • Agarwal R.C., Gustavson F.G., Zubair M. Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms. IBM J. Res. Dev. 38(5):1994;563-576.
    • (1994) IBM J. Res. Dev. , vol.38 , Issue.5 , pp. 563-576
    • Agarwal, R.C.1    Gustavson, F.G.2    Zubair, M.3
  • 2
    • 0028427170 scopus 로고
    • Improving performance of linear algebra algorithms for dense matrices using algorithmic prefetch
    • Agarwal R.C., Gustavson F.G., Zubair M. Improving performance of linear algebra algorithms for dense matrices using algorithmic prefetch. IBM J. Res. Dev. 38(3):1994;265-275.
    • (1994) IBM J. Res. Dev. , vol.38 , Issue.3 , pp. 265-275
    • Agarwal, R.C.1    Gustavson, F.G.2    Zubair, M.3
  • 5
    • 18044400448 scopus 로고    scopus 로고
    • A recursive formulation of Cholesky factorization of a matrix in packed storage
    • Andersen B.S., Waśniewski J., Gustavson F.G. A recursive formulation of Cholesky factorization of a matrix in packed storage. ACM Trans. Math. Softw. 27:2001;214-244.
    • (2001) ACM Trans. Math. Softw. , vol.27 , pp. 214-244
    • Andersen, B.S.1    Waśniewski, J.2    Gustavson, F.G.3
  • 6
    • 0024901312 scopus 로고
    • The influence of relaxed supernode partitions on the multifrontal method
    • Ashcraft C., Grimes R. The influence of relaxed supernode partitions on the multifrontal method. ACM Trans. Math. Softw. 15(4):1989;291-309.
    • (1989) ACM Trans. Math. Softw. , vol.15 , Issue.4 , pp. 291-309
    • Ashcraft, C.1    Grimes, R.2
  • 9
    • 0003459808 scopus 로고
    • Ph.D. Thesis, MIT Department of Electrical Engineering and Computer Science, September
    • R.D. Blumofe, Executing multithreaded programs efficiently, Ph.D. Thesis, MIT Department of Electrical Engineering and Computer Science, September 1995.
    • (1995) Executing Multithreaded Programs Efficiently
    • Blumofe, R.D.1
  • 13
    • 0012493293 scopus 로고
    • A User's Guide to the Blacs v1.0
    • Technical Report UT CS-95-281, University of Tennessee
    • J. Dongarra, R. Whaley, A User's Guide to the Blacs v1.0, Technical Report UT CS-95-281, LAPACK Working Note 94, University of Tennessee, 1995. http://www.netlib.org/blacs/.
    • (1995) LAPACK Working Note , vol.94
    • Dongarra, J.1    Whaley, R.2
  • 17
    • 0020822138 scopus 로고
    • The multifrontal solution of indefinite sparse symmetric linear equations
    • Duff I., Reid J. The multifrontal solution of indefinite sparse symmetric linear equations. ACM Trans. Math. Softw. 9:1983;302-325.
    • (1983) ACM Trans. Math. Softw. , vol.9 , pp. 302-325
    • Duff, I.1    Reid, J.2
  • 18
    • 0022754738 scopus 로고
    • Parallel implementation of multifrontal schemes
    • I.S. Duff, Parallel implementation of multifrontal schemes, Parallel Comput. 3 (1986).
    • (1986) Parallel Comput. , vol.3
    • Duff, I.S.1
  • 19
    • 0034224207 scopus 로고    scopus 로고
    • Applying recursion to serial and parallel QR factorization leads to better performance
    • Elmroth E., Gustavson F. Applying recursion to serial and parallel QR factorization leads to better performance. IBM J. Res. Dev. 44(4):2000;605-624.
    • (2000) IBM J. Res. Dev. , vol.44 , Issue.4 , pp. 605-624
    • Elmroth, E.1    Gustavson, F.2
  • 20
    • 0012536008 scopus 로고    scopus 로고
    • A faster and simpler recursive algorithm for the LAPACK routine DGELS
    • Elmroth E., Gustavson F.G. A faster and simpler recursive algorithm for the LAPACK routine DGELS. BIT. 41:2001;936-949.
    • (2001) BIT , vol.41 , pp. 936-949
    • Elmroth, E.1    Gustavson, F.G.2
  • 21
    • 0031622953 scopus 로고    scopus 로고
    • The implementation of the Cilk-5 multithreaded language
    • Frigo M., Leiserson C.E., Randall K.H. The implementation of the Cilk-5 multithreaded language. ACM SIGPLAN Notices. 33(5):1998;212-223.
    • (1998) ACM SIGPLAN Notices , vol.33 , Issue.5 , pp. 212-223
    • Frigo, M.1    Leiserson, C.E.2    Randall, K.H.3
  • 23
    • 0031140712 scopus 로고    scopus 로고
    • Highly scalable parallel algorithms for sparse matrix factorization
    • Gupta A., Karypis G., Kumar V. Highly scalable parallel algorithms for sparse matrix factorization. IEEE Trans. Parallel Distrib. Syst. 8(5):1997;502-520.
    • (1997) IEEE Trans. Parallel Distrib. Syst. , vol.8 , Issue.5 , pp. 502-520
    • Gupta, A.1    Karypis, G.2    Kumar, V.3
  • 24
    • 84947926251 scopus 로고    scopus 로고
    • Recursive blocked data formats and BLAS's for dense linear algebra algorithms
    • in: B. Kågström, J. Dongarra, E. Elmroth, J. Waśniewski (Eds.), Proceedings of the Fourth International Workshop on Applied Parallel Computing and Large Scale Scientific and Industrial Problems (PARA'98), Springer, Umeå, Sweden, June
    • F. Gustavson, A. Henriksson, I. Jonsson, B. Kågström, P. Ling, Recursive blocked data formats and BLAS's for dense linear algebra algorithms, in: B. Kågström, J. Dongarra, E. Elmroth, J. Waśniewski (Eds.), Proceedings of the Fourth International Workshop on Applied Parallel Computing and Large Scale Scientific and Industrial Problems (PARA'98), Lecture Notes in Computer Science Number 1541, Springer, Umeå, Sweden, June 1998, pp. 574-578.
    • (1998) Lecture Notes in Computer Science Number , vol.1541 , pp. 574-578
    • Gustavson, F.1    Henriksson, A.2    Jonsson, I.3    Kågström, B.4    Ling, P.5
  • 25
    • 0031273280 scopus 로고    scopus 로고
    • Recursion leads to automatic variable blocking for dense linear-algebra algorithms
    • Gustavson F.G. Recursion leads to automatic variable blocking for dense linear-algebra algorithms. IBM J. Res. Dev. 41:1997;737-755.
    • (1997) IBM J. Res. Dev. , vol.41 , pp. 737-755
    • Gustavson, F.G.1
  • 26
    • 0034312453 scopus 로고    scopus 로고
    • Minimal-storage high-performance Cholesky factorization via blocking and recursion
    • Gustavson F.G., Jonsson I. Minimal-storage high-performance Cholesky factorization via blocking and recursion. IBM J. Res. Dev. 44:2000;823-850.
    • (2000) IBM J. Res. Dev. , vol.44 , pp. 823-850
    • Gustavson, F.G.1    Jonsson, I.2
  • 27
    • 0036467470 scopus 로고    scopus 로고
    • PaStiX: A high-performance parallel direct solver for sparse symmetric definite systems
    • Hénon P., Ramet P., Roman J. PaStiX: a high-performance parallel direct solver for sparse symmetric definite systems. Parallel Comput. 28:2002;301-321.
    • (2002) Parallel Comput. , vol.28 , pp. 301-321
    • Hénon, P.1    Ramet, P.2    Roman, J.3
  • 29
    • 1642274432 scopus 로고    scopus 로고
    • Intel, Math Kernel Library (MKL), 2001. http://www.intel.com/software/products/mkl/.
    • (2001) Math Kernel Library (MKL)
  • 31
    • 0003406235 scopus 로고    scopus 로고
    • PSPASES: Scalable parallel direct solver library for sparse symmetric positive definite linear systems
    • User's Manual for Version 1.0.3, Department of Computer Science, University of Minnesota, revised 1999
    • M. Joshi, A. Gupta, F. Gustavson, G. Karypis, V. Kumar, PSPASES: scalable parallel direct solver library for sparse symmetric positive definite linear systems, in: User's Manual for Version 1.0.3, Technical Report TR 97-059, Department of Computer Science, University of Minnesota, 1997, revised 1999.
    • (1997) Technical Report TR 97-059 , vol.TR 97-059
    • Joshi, M.1    Gupta, A.2    Gustavson, F.3    Karypis, G.4    Kumar, V.5
  • 33
    • 0028459839 scopus 로고
    • DXML: A high-performance scientific subroutine library
    • Kamath C., Ho R., Manley D.P. DXML: a high-performance scientific subroutine library. Dig. Tech. J. 6(3):1994;44-56.
    • (1994) Dig. Tech. J. , vol.6 , Issue.3 , pp. 44-56
    • Kamath, C.1    Ho, R.2    Manley, D.P.3
  • 34
    • 0022785798 scopus 로고
    • On the storage requirement in the out-of-core multifrontal method for sparse factorization
    • Liu J.W.H. On the storage requirement in the out-of-core multifrontal method for sparse factorization. ACM Trans. Math. Softw. 12(3):1986;249-264.
    • (1986) ACM Trans. Math. Softw. , vol.12 , Issue.3 , pp. 249-264
    • Liu, J.W.H.1
  • 35
    • 0024877196 scopus 로고
    • The multifrontal method and paging in sparse Cholesky factorization
    • Liu J.W.H. The multifrontal method and paging in sparse Cholesky factorization. ACM Trans. Math. Softw. 15(4):1989;310-325.
    • (1989) ACM Trans. Math. Softw. , vol.15 , Issue.4 , pp. 310-325
    • Liu, J.W.H.1
  • 36
    • 0026840122 scopus 로고
    • The multifrontal method for sparse matrix solution: Theory and practice
    • Liu J.W.H. The multifrontal method for sparse matrix solution: theory and practice. SIAM Rev. 34(1):1992;82-109.
    • (1992) SIAM Rev. , vol.34 , Issue.1 , pp. 82-109
    • Liu, J.W.H.1
  • 39
    • 1642298823 scopus 로고    scopus 로고
    • Cilk-5.3.2 Reference Manual, MIT Laboratory for Computer Science, Cambridge, MA, November
    • Cilk-5.3.2 Reference Manual, Supercomputing Technologies Group, MIT Laboratory for Computer Science, Cambridge, MA, November 2001. http://supertech.lcs.mit.edu/cilk.
    • (2001) Supercomputing Technologies Group
  • 40
    • 0003753533 scopus 로고
    • August, The MathWorks Inc., Natick, MA
    • MATLAB Reference Guide, August 1992, The MathWorks Inc., Natick, MA.
    • (1992) MATLAB Reference Guide
  • 41
    • 0031496750 scopus 로고    scopus 로고
    • Locality of reference in LU decomposition with partial pivoting
    • Toledo S. Locality of reference in LU decomposition with partial pivoting. SIAM J. Matrix Anal. Appl. 18(4):1997;1065-1081.
    • (1997) SIAM J. Matrix Anal. Appl. , vol.18 , Issue.4 , pp. 1065-1081
    • Toledo, S.1
  • 42
    • 0003418094 scopus 로고    scopus 로고
    • Automatically tuned linear algebra software
    • Computer Science Department, University of Tennessee
    • R.C. Whaley, J.J. Dongarra, Automatically tuned linear algebra software, Technical Report, Computer Science Department, University of Tennessee, 1998, http://www.netlib.org/atlas .
    • (1998) Technical Report
    • Whaley, R.C.1    Dongarra, J.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.