메뉴 건너뛰기




Volumn 40, Issue 1, 2000, Pages 158-176

Efficient sparse LU factorization with left-right looking strategy on shared memory multiprocessors

Author keywords

Asynchronous computation scheduling; Device simulation; Multigrid coarse solver; Parallel sparse LU factorization; Process simulation; SMP parallel computing

Indexed keywords


EID: 0038657447     PISSN: 00063835     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1022326604210     Document Type: Article
Times cited : (161)

References (35)
  • 1
    • 84973743772 scopus 로고
    • Memory management issues in sparse multifrontal methods on multiprocessors
    • P. Amestoy and I. Duff, Memory management issues in sparse multifrontal methods on multiprocessors, Internat. J. Supercomputer Appl., 7 (1993), pp. 64-82.
    • (1993) Internat. J. Supercomputer Appl. , vol.7 , pp. 64-82
    • Amestoy, P.1    Duff, I.2
  • 4
    • 0003995040 scopus 로고
    • A fast multilevel implementation of recursive spectral bisection for partitioning unstructured problems
    • NASA Ames Research Center
    • S. Barnard and H. Simon, A fast multilevel implementation of recursive spectral bisection for partitioning unstructured problems, Tech. Rep. RNR-92-033, NASA Ames Research Center, 1994.
    • (1994) Tech. Rep. RNR-92-033
    • Barnard, S.1    Simon, H.2
  • 5
    • 84966228742 scopus 로고
    • Some stable methods for calculating inertia and solving symmetric linear systems
    • J. Bunch and L. Kaufman, Some stable methods for calculating inertia and solving symmetric linear systems, Math. Comp., 31 (1977), pp. 162-179.
    • (1977) Math. Comp. , vol.31 , pp. 162-179
    • Bunch, J.1    Kaufman, L.2
  • 6
    • 0039367538 scopus 로고
    • Level 3 BLAS in LU factorization on the CRAY-2, ETA-10P, and IBM 3090-200/VF
    • M. J. Daydé and I. S. Duff, Level 3 BLAS in LU factorization on the CRAY-2, ETA-10P, and IBM 3090-200/VF, Int. J. Supercomputer Appl., 3 (1989), pp. 40-70.
    • (1989) Int. J. Supercomputer Appl. , vol.3 , pp. 40-70
    • Daydé, M.J.1    Duff, I.S.2
  • 7
    • 0003495172 scopus 로고    scopus 로고
    • An asynchronous parallel supernodal algorithm for sparse Gaussian elimination
    • Computer Science Division, University of California, Berkeley, CA
    • J. Demmel, J. Gilbert, and X. Li., An asynchronous parallel supernodal algorithm for sparse Gaussian elimination, Tech. Report CSD-97-943, Computer Science Division, University of California, Berkeley, CA, 1997.
    • (1997) Tech. Report CSD-97-943
    • Demmel, J.1    Gilbert, J.2    Li, X.3
  • 8
    • 0003533609 scopus 로고    scopus 로고
    • Performance of various computers using standard linear equations software, (Linpack benchmark report)
    • Department of Computer Science, University of Tennessee, Knoxville, TN
    • J. Dongarra., Performance of various computers using standard linear equations software, (Linpack benchmark report), Tech. Rep. CS-89-85, Department of Computer Science, University of Tennessee, Knoxville, TN, 1998.
    • (1998) Tech. Rep. CS-89-85
    • Dongarra, J.1
  • 9
    • 0002411520 scopus 로고
    • LAPACK: A portable high-performance numerical library for linear algebra
    • J. Dongarra and J. Demmel, LAPACK: A portable high-performance numerical library for linear algebra, Supercomput., 8 (1991), pp. 33-38.
    • (1991) Supercomput. , vol.8 , pp. 33-38
    • Dongarra, J.1    Demmel, J.2
  • 12
    • 0004109578 scopus 로고
    • Users' guide for the Harwell-Boeing sparse matrix collection, release 1
    • Rutherford Appleton Laboratory, Didcot, UK
    • I. Duff, R. Grimes, and J. Lewis, Users' guide for the Harwell-Boeing sparse matrix collection, release 1, Tech. Report, RAL-92-086, Rutherford Appleton Laboratory, Didcot, UK, 1992.
    • (1992) Tech. Report, RAL-92-086
    • Duff, I.1    Grimes, R.2    Lewis, J.3
  • 13
    • 0024720802 scopus 로고
    • Multiprocessing a sparse matrix code on the Alliant FX/8
    • I. S. Duff, Multiprocessing a sparse matrix code on the Alliant FX/8, J. Comput. Appl. Math., 27 (1989), pp. 229-239.
    • (1989) J. Comput. Appl. Math. , vol.27 , pp. 229-239
    • Duff, I.S.1
  • 14
    • 0031998732 scopus 로고    scopus 로고
    • Efficient sparse LU factorization with partial pivoting on distributed memory architectures
    • C. Fu, X. Jiao, and T. Yang, Efficient sparse LU factorization with partial pivoting on distributed memory architectures, IEEE Trans. Parallel Distrib. Systems, 9 (1998), pp. 109-125.
    • (1998) IEEE Trans. Parallel Distrib. Systems , vol.9 , pp. 109-125
    • Fu, C.1    Jiao, X.2    Yang, T.3
  • 16
    • 0031140712 scopus 로고    scopus 로고
    • Highly scalable parallel algorithms for sparse matrix factorization
    • A. Gupta, G. Karypis, and V. Kumar, Highly scalable parallel algorithms for sparse matrix factorization, IEEE Trans. Parallel Distrib. Systems, 8 (1997), pp. 502-520.
    • (1997) IEEE Trans. Parallel Distrib. Systems , vol.8 , pp. 502-520
    • Gupta, A.1    Karypis, G.2    Kumar, V.3
  • 18
    • 0004199626 scopus 로고    scopus 로고
    • ISE Integrated Systems Engineering AG
    • -ISE Reference Manual, ISE Integrated Systems Engineering AG, 1998.
    • (1998) -ISE Reference Manual
  • 19
    • 0004199626 scopus 로고    scopus 로고
    • ISE Integrated Systems Engineering AG
    • -ISE Reference Manual, ISE Integrated Systems Engineering AG, 1998.
    • (1998) -ISE Reference Manual
  • 20
    • 0003735970 scopus 로고
    • Analysis of multilevel graph algorithms
    • Department of Computer Science, University of Minnesota, Minneapolis, MN
    • G. Karypis and V. Kumar, Analysis of multilevel graph algorithms, Tech. Report MN 95-037, Department of Computer Science, University of Minnesota, Minneapolis, MN, 1995.
    • (1995) Tech. Report MN 95-037
    • Karypis, G.1    Kumar, V.2
  • 21
    • 0004030623 scopus 로고    scopus 로고
    • Multilevel algorithms for multi-constraint graph partitioning
    • Department of Computer Science, University of Minnesota, Minneapolis, MN
    • G. Karypis and V. Kumar, Multilevel algorithms for multi-constraint graph partitioning, Tech. Report MN 98-019, Department of Computer Science, University of Minnesota, Minneapolis, MN, 1998.
    • (1998) Tech. Report MN 98-019
    • Karypis, G.1    Kumar, V.2
  • 22
    • 0042724958 scopus 로고    scopus 로고
    • ParMETIS: Parallel graph partitioning library
    • University of Minnesota, Department of Computer Science.
    • G. Karypis and V. Kumar, ParMETIS: Parallel graph partitioning library, Tech. Report, University of Minnesota, Department of Computer Science. Available via URL http://www-users.cs.umn.edu/∼metis/, Sept. 1998.
    • Tech. Report
    • Karypis, G.1    Kumar, V.2
  • 23
    • 0002806690 scopus 로고    scopus 로고
    • OpenMP: An industry-standard API for shared-memory programming
    • R. M. L. Dagnum, OpenMP: An industry-standard API for shared-memory programming, IEEE Comput. Science Engrg., 1 (1998), pp. 46-55.
    • (1998) IEEE Comput. Science Engrg. , vol.1 , pp. 46-55
    • Dagnum, R.M.L.1
  • 24
    • 0007956992 scopus 로고    scopus 로고
    • Making sparse Gaussian elimination scalable by static pivoting
    • Nov. ACM
    • X. Li and J. Demmel, Making sparse Gaussian elimination scalable by static pivoting, in Proceedings of the Supercomputing 98, Nov. 1998, ACM.
    • (1998) Proceedings of the Supercomputing , vol.98
    • Li, X.1    Demmel, J.2
  • 25
    • 0003950261 scopus 로고    scopus 로고
    • PhD Thesis, UCB//CSD-96-919, Computer Science Division, University of California, Berkeley, CA
    • X. S. Li, Sparse Gaussian Elimination on High Performance Computers, PhD Thesis, UCB//CSD-96-919, Computer Science Division, University of California, Berkeley, CA, 1997.
    • (1997) Sparse Gaussian Elimination on High Performance Computers
    • Li, X.S.1
  • 27
    • 0022077450 scopus 로고
    • Modification of the minimum-degree algorithm by multiple elimination
    • J. W.-H. Liu, Modification of the minimum-degree algorithm by multiple elimination, ACM Trans. Math. Software, 11 (1985), pp. 141-153.
    • (1985) ACM Trans. Math. Software , vol.11 , pp. 141-153
    • Liu, J.W.-H.1
  • 28
    • 0001450272 scopus 로고
    • The role of elimination trees in sparse factorization
    • J. W.-H. Liu, The role of elimination trees in sparse factorization, SIAM J. Matrix Anal. Appl., 11 (1990), pp. 134-172.
    • (1990) SIAM J. Matrix Anal. Appl. , vol.11 , pp. 134-172
    • Liu, J.W.-H.1
  • 29
    • 0029267316 scopus 로고
    • Parallel sparse QR factorization on shared memory architectures
    • P. Matstoms, Parallel sparse QR factorization on shared memory architectures, Parallel Computing, 21 (1995), pp. 473-486.
    • (1995) Parallel Computing , vol.21 , pp. 473-486
    • Matstoms, P.1
  • 30
    • 0042724959 scopus 로고
    • Parallel direct solution of sparse linear systems
    • G. F. Carey, ed., John Wiley, Chichester, UK
    • E. Ng, Parallel direct solution of sparse linear systems, in Parallel Supercomputing: Methods, Algorithms and Applications, G. F. Carey, ed., John Wiley, Chichester, UK, 1989, pp. 157-176.
    • (1989) Parallel Supercomputing: Methods, Algorithms and Applications , pp. 157-176
    • Ng, E.1
  • 31
    • 0000817412 scopus 로고
    • A supernodal Cholesky factorization algorithm for shared-memory multiprocessors
    • E. Ng and B. Peyton, A supernodal Cholesky factorization algorithm for shared-memory multiprocessors, SIAM J. Sci. Comput., 14 (1993), pp. 761-769.
    • (1993) SIAM J. Sci. Comput. , vol.14 , pp. 761-769
    • Ng, E.1    Peyton, B.2
  • 32
    • 0001102965 scopus 로고    scopus 로고
    • Performance of panel and block approaches to sparse Cholesky factorization on the iPSC/860 and Paragon multicomputers
    • E. Rothberg, Performance of panel and block approaches to sparse Cholesky factorization on the iPSC/860 and Paragon multicomputers, SIAM J. Sci. Comput., 17 (1996), pp. 699-711.
    • (1996) SIAM J. Sci. Comput. , vol.17 , pp. 699-711
    • Rothberg, E.1
  • 33
    • 84956861101 scopus 로고    scopus 로고
    • Scalable parallel sparse factorization with left-right looking strategy on shared memory multiprocessors
    • High Performance Computing and Networking, Proceedings of 7th International Conference, HPCN Europe 1999, Amsterdam, P. Sloot, M. Bubak, A. Hoekstra, and B. Hertzberger, eds., Springer-Verlag, Berlin
    • O. Schenk, K. Gärtner, and W. Fichtner, Scalable parallel sparse factorization with left-right looking strategy on shared memory multiprocessors, in High Performance Computing and Networking, Proceedings of 7th International Conference, HPCN Europe 1999, Amsterdam, P. Sloot, M. Bubak, A. Hoekstra, and B. Hertzberger, eds., Lecture Notes in Computational Science Vol. 1593, Springer-Verlag, Berlin, 1999.
    • (1999) Lecture Notes in Computational Science , vol.1593
    • Schenk, O.1    Gärtner, K.2    Fichtner, W.3
  • 34
    • 0041723602 scopus 로고    scopus 로고
    • A parallel sparse direct solver for large structurally symmetric linear systems and parallel multigrid methods for the continuity equations in semiconductor device simulation
    • Integrated Systems Laboratory, ETH, Zürich, Switzerland
    • O. Schenk, K. Gärtner, and W. Fichtner, A parallel sparse direct solver for large structurally symmetric linear systems and parallel multigrid methods for the continuity equations in semiconductor device simulation, Tech. Report 97/17, Integrated Systems Laboratory, ETH, Zürich, Switzerland, 1997.
    • (1997) Tech. Report 97/17
    • Schenk, O.1    Gärtner, K.2    Fichtner, W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.