메뉴 건너뛰기




Volumn 8, Issue 5, 1997, Pages 502-520

Highly scalable parallel algorithms for sparse matrix factorization

Author keywords

Cholesky factorization; High performance computing; Parallel processing; Parallel scientific computing; Scalability analysis; Sparse linear systems; Sparse matrices

Indexed keywords

COMPUTER ARCHITECTURE; LEAST SQUARES APPROXIMATIONS; LINEAR PROGRAMMING; MATRIX ALGEBRA; PARALLEL PROCESSING SYSTEMS; PROBLEM SOLVING; SUPERCOMPUTERS;

EID: 0031140712     PISSN: 10459219     EISSN: None     Source Type: Journal    
DOI: 10.1109/71.598277     Document Type: Article
Times cited : (138)

References (62)
  • 2
    • 0001314661 scopus 로고
    • The Fan-Both Family of Column-Based Distributed Cholesky Factorization Algorithms
    • A. George, J.R. Gilbert, and J.W.-H. Liu, eds. New York: Springer-Verlag
    • C. Ashcraft. "The Fan-Both Family of Column-Based Distributed Cholesky Factorization Algorithms," Graph Theory and Sparse Matrix Computations, A. George, J.R. Gilbert, and J.W.-H. Liu, eds. New York: Springer-Verlag, 1993.
    • (1993) Graph Theory and Sparse Matrix Computations
    • Ashcraft, C.1
  • 4
    • 0025683849 scopus 로고
    • Parallel Nested Dissection
    • J.M. Conroy, "Parallel Nested Dissection," Parallel Computing, vol. 16, pp 139-156, 1990.
    • (1990) Parallel Computing , vol.16 , pp. 139-156
    • Conroy, J.M.1
  • 5
    • 0007899068 scopus 로고
    • Multifrontal Sparse Solvers in Message Passing and Data Parallel Environments - A Comparitive Study
    • J.M. Conroy, S.G. Kratzer, and R.F. Lucas, "Multifrontal Sparse Solvers in Message Passing and Data Parallel Environments - A Comparitive Study," Proc. PARCO, 1993.
    • (1993) Proc. PARCO
    • Conroy, J.M.1    Kratzer, S.G.2    Lucas, R.F.3
  • 7
    • 0020822138 scopus 로고
    • The Multifrontal Solution of Indefinite Sparse Symmetric Linear Equations
    • I.S. Duff and J.K. Reid, "The Multifrontal Solution of Indefinite Sparse Symmetric Linear Equations," ACM Trans. Math. Software, vol. 9, pp. 302-325, 1983.
    • (1983) ACM Trans. Math. Software , vol.9 , pp. 302-325
    • Duff, I.S.1    Reid, J.K.2
  • 8
    • 0003100264 scopus 로고
    • Parallel Algorithms for Dense Linear Algebra Computations
    • Mar. appears in K.A. Gallivan et al. Parallel Algorithms for Matrix Computations. Philadelphia: SIAM, 1990
    • K.A. Gallivan, R.J. Plemmons, and A.H. Sameh, "Parallel Algorithms for Dense Linear Algebra Computations," SIAM Rev., vol. 32, no. 1, pp. 54-135, Mar. 1990. Also appears in K.A. Gallivan et al. Parallel Algorithms for Matrix Computations. Philadelphia: SIAM, 1990.
    • (1990) SIAM Rev. , vol.32 , Issue.1 , pp. 54-135
    • Gallivan, K.A.1    Plemmons, R.J.2    Sameh, A.H.3
  • 9
    • 0024717588 scopus 로고
    • Task Scheduling for Parallel Sparse Cholesky Factorization
    • G.A. Geist and E.G.-Y. Ng, "Task Scheduling for Parallel Sparse Cholesky Factorization," Int'l J. Parallel Programming, vol. 18, no. 4, pp. 291-314, 1989.
    • (1989) Int'l J. Parallel Programming , vol.18 , Issue.4 , pp. 291-314
    • Geist, G.A.1    Ng, E.G.-Y.2
  • 10
    • 0039821547 scopus 로고
    • LU Factorization Algorithms on Distributed-Memory Multiprocessor Architectures
    • Technical Report ORNL/TM-10383, Oak Ridge National Laboratory, Oak Ridge, Tenn., 1987
    • G.A. Geist and C.H. Romine, "LU Factorization Algorithms on Distributed-Memory Multiprocessor Architectures," SIAM J. Scientific and Statistical Computing, vol. 9, no. 4, pp. 639-649, 1988. Also available as Technical Report ORNL/TM-10383, Oak Ridge National Laboratory, Oak Ridge, Tenn., 1987.
    • (1988) SIAM J. Scientific and Statistical Computing , vol.9 , Issue.4 , pp. 639-649
    • Geist, G.A.1    Romine, C.H.2
  • 11
    • 0000264382 scopus 로고
    • Nested Dissection of a Regular Finite-Element Mesh
    • A. George, "Nested Dissection of a Regular Finite-Element Mesh," SIAM J. Numerical Analysis, vol. 10, pp. 345-363, 1973.
    • (1973) SIAM J. Numerical Analysis , vol.10 , pp. 345-363
    • George, A.1
  • 14
    • 4043084524 scopus 로고
    • Communication Reduction in Parallel Sparse Cholesky Factorization on a Hypercube
    • M.T. Heath, ed., Philadelphia: SIAM
    • A. George, J.W.-H. Liu, and E.G.-Y. Ng, "Communication Reduction in Parallel Sparse Cholesky Factorization on a Hypercube," Hypercube Multiprocessors 1987, M.T. Heath, ed., pp. 576-586. Philadelphia: SIAM, 1987.
    • (1987) Hypercube Multiprocessors 1987 , pp. 576-586
    • George, A.1    Liu, J.W.-H.2    Ng, E.G.-Y.3
  • 15
    • 0024666243 scopus 로고
    • Communication Results for Parallel Sparse Cholesky Factorization on a Hypercube
    • May
    • A. George, J.W.-H. Liu, and E.G.-Y. Ng, "Communication Results for Parallel Sparse Cholesky Factorization on a Hypercube," Parallel Computing, vol. 10, no. 3, pp. 287-298, May 1989.
    • (1989) Parallel Computing , vol.10 , Issue.3 , pp. 287-298
    • George, A.1    Liu, J.W.-H.2    Ng, E.G.-Y.3
  • 17
    • 0027649341 scopus 로고
    • Isoefficiency: Measuring the Scalability of Parallel Algorithms and Architectures
    • Aug. Technical Report TR 93-24, Dept. of Computer Science, Univ. of Minnesota, Minneapolis, Minn.
    • A. Grama, A. Gupta, and V. Kumar, "Isoefficiency: Measuring the Scalability of Parallel Algorithms and Architectures," IEEE Parallel and Distributed Technology, vol. 1, no. 3, pp. 12-21, Aug. 1993. Also available as Technical Report TR 93-24, Dept. of Computer Science, Univ. of Minnesota, Minneapolis, Minn.
    • (1993) IEEE Parallel and Distributed Technology , vol.1 , Issue.3 , pp. 12-21
    • Grama, A.1    Gupta, A.2    Kumar, V.3
  • 18
    • 33748075444 scopus 로고
    • An Efficient Block-Oriented Approach to Parallel Sparse Cholesky Factorization
    • A. Gupta and E. Rothberg, "An Efficient Block-Oriented Approach to Parallel Sparse Cholesky Factorization," Proc. Supercomputing '93, 1993.
    • (1993) Proc. Supercomputing '93
    • Gupta, A.1    Rothberg, E.2
  • 21
    • 0039301862 scopus 로고    scopus 로고
    • Technical Report RC 20467 (90480), IBM T.J. Watson Research Center, Yorktown Heights, N.Y., 21 May
    • A. Gupta, "Graph Partitioning Based Sparse Matrix Ordering Algorithms for Interior-Point Methods," Technical Report RC 20467 (90480), IBM T.J. Watson Research Center, Yorktown Heights, N.Y., 21 May 1996. Also available at ftp://ftp.cs.umn.edu/ users/kumar/anshul/IP-ordering.ps.Z.
    • (1996) Graph Partitioning Based Sparse Matrix Ordering Algorithms for Interior-Point Methods
    • Gupta, A.1
  • 22
    • 0040487002 scopus 로고    scopus 로고
    • Technical Report RC 20453 (90427), IBM T. J. Watson Research Center, Yorktown Heights, N.Y., 6 May
    • A. Gupta, "WGPP: Watson Graph Partitioning (and Sparse Matrix Ordering) Package: Users Manual," Technical Report RC 20453 (90427), IBM T. J. Watson Research Center, Yorktown Heights, N.Y., 6 May 1996. Available on the WWW at the IBM Research CyberJournal site at http://watson.ibm.com:8080.
    • (1996) WGPP: Watson Graph Partitioning (and Sparse Matrix Ordering) Package: Users Manual
    • Gupta, A.1
  • 23
    • 38248999718 scopus 로고
    • Performance Properties of Large Scale Parallel Systems
    • Technical Report TR 92-32, Dept. of Computer Science, Univ. of Minnesota, Minneapolis, Minn.
    • A. Gupta and V. Kumar, "Performance Properties of Large Scale Parallel Systems," J. Parallel and Distributed Computing, vol. 19, pp. 234-244, 1993. Also available as Technical Report TR 92-32, Dept. of Computer Science, Univ. of Minnesota, Minneapolis, Minn.
    • (1993) J. Parallel and Distributed Computing , vol.19 , pp. 234-244
    • Gupta, A.1    Kumar, V.2
  • 24
    • 0043015968 scopus 로고
    • A Scalable Parallel Algorithm for Sparse Matrix Factorization
    • Technical Report 94-19, Dept. of Computer Science, Univ. of Minnesota, Minneapolis, Minn., appears in
    • A. Gupta and V. Kumar, "A Scalable Parallel Algorithm for Sparse Matrix Factorization," Technical Report 94-19, Dept. of Computer Science, Univ. of Minnesota, Minneapolis, Minn., 1994. A short version appears in Proc. Supercomputing '94. Technical report available in users/kumar at anonymous FTP site ftp.cs.umn.edu.
    • (1994) Proc. Supercomputing '94
    • Gupta, A.1    Kumar, V.2
  • 25
    • 0004318623 scopus 로고
    • Parallel Algorithms for Forward and Back Substitution in Direct Solution of Sparse Linear Systems
    • Dec.
    • A. Gupta and V. Kumar, "Parallel Algorithms for Forward and Back Substitution in Direct Solution of Sparse Linear Systems," Proc. Supercomputing '95, Dec. 1995.
    • (1995) Proc. Supercomputing '95
    • Gupta, A.1    Kumar, V.2
  • 26
    • 0026219647 scopus 로고
    • Parallel Algorithms for Sparse Linear Systems
    • K. A. Gallivan et al., Parallel Algorithms for Matrix Computations. Philadelphia: SIAM, 1990
    • M.T. Heath, E.G.-Y. Ng, and B.W. Peyton, "Parallel Algorithms for Sparse Linear Systems," SIAM Rev., vol. 33, pp. 420-460, 1991. Also appears in K. A. Gallivan et al., Parallel Algorithms for Matrix Computations. Philadelphia: SIAM, 1990.
    • (1991) SIAM Rev. , vol.33 , pp. 420-460
    • Heath, M.T.1    Ng, E.G.-Y.2    Peyton, B.W.3
  • 28
    • 0012652650 scopus 로고
    • Limiting Communication in Parallel Sparse Cholesky Factorization
    • Sept.
    • L. Hulbert and E. Zmijewski, "Limiting Communication in Parallel Sparse Cholesky Factorization," SIAM J. Scientific and Statistical Computing, vol. 12, no. 5, pp. 1,184-1,197, Sept. 1991.
    • (1991) SIAM J. Scientific and Statistical Computing , vol.12 , Issue.5
    • Hulbert, L.1    Zmijewski, E.2
  • 29
    • 0028742178 scopus 로고
    • Parallel Formulation of Interior Point Algorithms
    • Technical Report 94-20, Dept. of Computer Science, Univ. of Minnesota, Minneapolis, Minn., Apr. A short version appears
    • G. Karypis, A. Gupta, and V. Kumar, "Parallel Formulation of Interior Point Algorithms," Technical Report 94-20, Dept. of Computer Science, Univ. of Minnesota, Minneapolis, Minn., Apr. 1994. A short version appears in Proc. Supercomputing '94.
    • (1994) Proc. Supercomputing '94
    • Karypis, G.1    Gupta, A.2    Kumar, V.3
  • 32
    • 0003735970 scopus 로고
    • Technical Report TR 95-036, Dept. of Computer Science, Univ. of Minnesota
    • G. Karypis and V. Kumar, "Parallel Multilevel Graph Partitioning," Technical Report TR 95-036, Dept. of Computer Science, Univ. of Minnesota, 1995.
    • (1995) Parallel Multilevel Graph Partitioning
    • Karypis, G.1    Kumar, V.2
  • 33
    • 33748033687 scopus 로고
    • Sparse Matrix Factorization on simd Parallel Computers
    • A. George, J.R. Gilbert, and J.W.-H. Liu, eds. New York: Springer-Verlag
    • S.G. Kratzer and A.J. Cleary, "Sparse Matrix Factorization on simd Parallel Computers," Graph Theory and Sparse Matrix Computations, A. George, J.R. Gilbert, and J.W.-H. Liu, eds. New York: Springer-Verlag, 1993.
    • (1993) Graph Theory and Sparse Matrix Computations
    • Kratzer, S.G.1    Cleary, A.J.2
  • 36
    • 3543092493 scopus 로고
    • Analyzing Scalability of Parallel Algorithms and Architectures
    • Technical Report TR 91-18, Dept. of Computer Science Department, Univ. of Minnesota, Minneapolis, Minn.
    • V. Kumar and A. Gupta, "Analyzing Scalability of Parallel Algorithms and Architectures," J. Parallel and Distributed Computing, vol. 22, no. 3, pp. 379-391, 1994. Also available as Technical Report TR 91-18, Dept. of Computer Science Department, Univ. of Minnesota, Minneapolis, Minn.
    • (1994) J. Parallel and Distributed Computing , vol.22 , Issue.3 , pp. 379-391
    • Kumar, V.1    Gupta, A.2
  • 38
    • 0018457301 scopus 로고
    • A Separator Theorem for Planar Graphs
    • R.J. Lipton and R.E. Tarjan, "A Separator Theorem for Planar Graphs," SIAM J. Applied Math., vol. 36, pp. 177-189, 1979.
    • (1979) SIAM J. Applied Math. , vol.36 , pp. 177-189
    • Lipton, R.J.1    Tarjan, R.E.2
  • 39
    • 0026840122 scopus 로고
    • The Multifrontal Method for Sparse Matrix Solution: Theory and Practice
    • Technical Report CS-90-04, York Univ., Ontario, Canada, appears in
    • J.W.-H. Liu, "The Multifrontal Method for Sparse Matrix Solution: Theory and Practice," Technical Report CS-90-04, York Univ., Ontario, Canada, 1990. Also appears in SIAM Rev., vol. 34, pp. 82-109, 1992.
    • (1990) SIAM Rev. , vol.34 , pp. 82-109
    • Liu, J.W.-H.1
  • 41
    • 0000936603 scopus 로고
    • A Parallel Solution Method for Large Sparse Systems of Equations
    • Nov.
    • R.F. Lucas, T. Blank, and J.J. Tiemann, "A Parallel Solution Method for Large Sparse Systems of Equations," IEEE Trans. Computer-Aided Design, vol. 6, no. 6, pp. 981-991, Nov. 1987.
    • (1987) IEEE Trans. Computer-Aided Design , vol.6 , Issue.6 , pp. 981-991
    • Lucas, R.F.1    Blank, T.2    Tiemann, J.J.3
  • 43
    • 33748085997 scopus 로고
    • A Grid-Based Subtree-Subcube Assignment Strategy for Solving Partial Differential Equations on Hypercubes
    • May
    • M. Mu and J.R. Rice, "A Grid-Based Subtree-Subcube Assignment Strategy for Solving Partial Differential Equations on Hypercubes," SIAM J. Scientific and Statistical Computing, vol. 13, no. 3, pp. 826-839, May 1992.
    • (1992) SIAM J. Scientific and Statistical Computing , vol.13 , Issue.3 , pp. 826-839
    • Mu, M.1    Rice, J.R.2
  • 44
    • 33748039072 scopus 로고
    • Data Traffic Reduction Schemes Cholesky Factorization on Aynchronous Multiprocessor Systems
    • Also available as Technical Report RC 14500, IBM T.J. Watson Research Center, Yorktown Heights, N.Y.
    • V.K. Naik and M. Patrick, "Data Traffic Reduction Schemes Cholesky Factorization on Aynchronous Multiprocessor Systems," Proc. Supercomputing '89, 1989. Also available as Technical Report RC 14500, IBM T.J. Watson Research Center, Yorktown Heights, N.Y.
    • (1989) Proc. Supercomputing '89
    • Naik, V.K.1    Patrick, M.2
  • 46
    • 0004483809 scopus 로고
    • Assignment and Scheduling in Parallel Matrix Factorization
    • D.P. O'Leary and G.W. Stewart, "Assignment and Scheduling in Parallel Matrix Factorization," Linear Algebra and its Applications, vol. 77, pp. 275-299, 1986.
    • (1986) Linear Algebra and Its Applications , vol.77 , pp. 275-299
    • O'Leary, D.P.1    Stewart, G.W.2
  • 50
    • 0005809076 scopus 로고
    • Technical Report 92-01, Computer Science Dept., Pennsylvania State Univ., University Park, Penn.
    • A. Pothen, H.D. Simon, and L. Wang, "Spectral Nested Dissection," Technical Report 92-01, Computer Science Dept., Pennsylvania State Univ., University Park, Penn., 1992.
    • (1992) Spectral Nested Dissection
    • Pothen, A.1    Simon, H.D.2    Wang, L.3
  • 55
    • 0028583195 scopus 로고
    • Performance of Panel and Block Approaches to Sparse Cholesky Factorization on the iPSC/860 and Paragon Systems
    • May
    • E. Rothberg, "Performance of Panel and Block Approaches to Sparse Cholesky Factorization on the iPSC/860 and Paragon Systems," Proc. 1994 Scalable High Performance Computing Conf., May 1994.
    • (1994) Proc. 1994 Scalable High Performance Computing Conf.
    • Rothberg, E.1
  • 56
    • 84957663751 scopus 로고
    • An Efficient Block-Oriented Approach to Parallel Sparse Cholesky Factorization
    • E. Rothberg and A. Gupta, "An Efficient Block-Oriented Approach to Parallel Sparse Cholesky Factorization," Proc. Supercomputing '92, 1992.
    • (1992) Proc. Supercomputing '92
    • Rothberg, E.1    Gupta, A.2
  • 57
    • 0028733838 scopus 로고
    • Improved Load Distribution in Parallel Sparse Cholesky Factorization
    • E. Rothberg and R. Schreiber, "Improved Load Distribution in Parallel Sparse Cholesky Factorization," Proc. Supercomputing '94, 1994.
    • (1994) Proc. Supercomputing '94
    • Rothberg, E.1    Schreiber, R.2
  • 58
    • 33748064970 scopus 로고
    • Scalability of Sparse Direct Solvers
    • Technical Report RIACS TR 92.13, NASA Ames Research Center, Moffet Field, Calif., May An IMA Workshop Volume, A. George, J.R. Gilbert, and J.W.-H. Liu, eds. New York: Springer-Verlag
    • R. Schreiber, "Scalability of Sparse Direct Solvers," Technical Report RIACS TR 92.13, NASA Ames Research Center, Moffet Field, Calif., May 1992. Also appears in Sparse Matrix Computations: Graph Theory Issues and Algorithms (An IMA Workshop Volume), A. George, J.R. Gilbert, and J.W.-H. Liu, eds. New York: Springer-Verlag, 1993.
    • (1992) Sparse Matrix Computations: Graph Theory Issues and Algorithms
    • Schreiber, R.1
  • 59
    • 0039713448 scopus 로고
    • Technical Report UIUCDCS-R-78-946, Dept. of Computer Science, Univ. of Illinois, Urbana, Ill., Nov.
    • B. Speelpening, "The Generalized Element Method," Technical Report UIUCDCS-R-78-946, Dept. of Computer Science, Univ. of Illinois, Urbana, Ill., Nov. 1978.
    • (1978) The Generalized Element Method
    • Speelpening, B.1
  • 60
    • 0343208125 scopus 로고
    • Technical Report CTC92TR102, Advanced Computing Research Inst., Center for Theory and Simulation in Science and Eng., Cornell Univ., Ithaca, N.Y., Aug.
    • C. Sun, "Efficient Parallel Solutions of Large Sparse SPD Systems on Distributed-Memory Multiprocessors," Technical Report CTC92TR102, Advanced Computing Research Inst., Center for Theory and Simulation in Science and Eng., Cornell Univ., Ithaca, N.Y., Aug. 1992.
    • (1992) Efficient Parallel Solutions of Large Sparse SPD Systems on Distributed-Memory Multiprocessors
    • Sun, C.1
  • 61
    • 0026274696 scopus 로고
    • Effects of Partitioning and Scheduling Sparse Matrix Factorization on Communication and Load Balance
    • S. Venugopal and V.K. Naik, "Effects of Partitioning and Scheduling Sparse Matrix Factorization on Communication and Load Balance," Proc. Supercomputing '91, pp. 866-875, 1991.
    • (1991) Proc. Supercomputing '91 , pp. 866-875
    • Venugopal, S.1    Naik, V.K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.