메뉴 건너뛰기




Volumn 33, Issue 7-8, 2007, Pages 521-540

Parallel Arnoldi eigensolvers with enhanced scalability via global communications rearrangement

Author keywords

Arnoldi eigensolvers; Distributed memory programming; Iterative Gram Schmidt orthogonalization

Indexed keywords

GLOBAL COMMUNICATIONS; NONSYMMETRIC MATRIX; ORTHOGONALIZATION; PARALLEL ARNOLDI EIGENSOLVERS;

EID: 34447537819     PISSN: 01678191     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.parco.2007.04.004     Document Type: Article
Times cited : (46)

References (35)
  • 1
    • 0002807741 scopus 로고
    • The principle of minimized iterations in the solution of the matrix eigenvalue problem
    • Arnoldi W.E. The principle of minimized iterations in the solution of the matrix eigenvalue problem. Quart. Appl. Math. 9 (1951) 17-29
    • (1951) Quart. Appl. Math. , vol.9 , pp. 17-29
    • Arnoldi, W.E.1
  • 2
    • 49149143501 scopus 로고
    • Variations of Arnoldi's method for computing eigenelements of large unsymmetric matrices
    • Saad Y. Variations of Arnoldi's method for computing eigenelements of large unsymmetric matrices. Linear Algebra Appl. 34 (1980) 269-295
    • (1980) Linear Algebra Appl. , vol.34 , pp. 269-295
    • Saad, Y.1
  • 3
    • 0024663197 scopus 로고
    • Numerical solution of large nonsymmetric eigenvalue problems
    • Saad Y. Numerical solution of large nonsymmetric eigenvalue problems. Comput. Phys. Commun. 53 (1989) 71-90
    • (1989) Comput. Phys. Commun. , vol.53 , pp. 71-90
    • Saad, Y.1
  • 4
    • 0000094594 scopus 로고
    • An iteration method for the solution of the eigenvalue problem of linear differential and integral operators
    • Lanczos C. An iteration method for the solution of the eigenvalue problem of linear differential and integral operators. J. Res. Nat. Bur. Stand. 45 (1950) 255-282
    • (1950) J. Res. Nat. Bur. Stand. , vol.45 , pp. 255-282
    • Lanczos, C.1
  • 5
    • 84966231631 scopus 로고
    • A look-ahead Lánczos algorithm for unsymmetric matrices
    • Parlett B.N., Taylor D.R., and Liu Z.A. A look-ahead Lánczos algorithm for unsymmetric matrices. Math. Comp. 44 169 (1985) 105-124
    • (1985) Math. Comp. , vol.44 , Issue.169 , pp. 105-124
    • Parlett, B.N.1    Taylor, D.R.2    Liu, Z.A.3
  • 6
    • 0031529238 scopus 로고    scopus 로고
    • An efficient implementation of the nonsymmetric Lanczos algorithm
    • Day D. An efficient implementation of the nonsymmetric Lanczos algorithm. SIAM J. Matrix Anal. Appl. 18 3 (1997) 566-589
    • (1997) SIAM J. Matrix Anal. Appl. , vol.18 , Issue.3 , pp. 566-589
    • Day, D.1
  • 7
    • 0030353046 scopus 로고    scopus 로고
    • On restarting the Arnoldi method for large nonsymmetric eigenvalue problems
    • Morgan R.B. On restarting the Arnoldi method for large nonsymmetric eigenvalue problems. Math. Comp. 65 (1996) 1213-1230
    • (1996) Math. Comp. , vol.65 , pp. 1213-1230
    • Morgan, R.B.1
  • 8
    • 33745196196 scopus 로고    scopus 로고
    • SLEPc: a scalable and flexible toolkit for the solution of eigenvalue problems
    • Hernandez V., Roman J.E., and Vidal V. SLEPc: a scalable and flexible toolkit for the solution of eigenvalue problems. ACM Trans. Math. Software 31 3 (2005) 351-362
    • (2005) ACM Trans. Math. Software , vol.31 , Issue.3 , pp. 351-362
    • Hernandez, V.1    Roman, J.E.2    Vidal, V.3
  • 9
    • 34447533031 scopus 로고    scopus 로고
    • V. Hernandez, J.E. Roman, A. Tomas, V. Vidal, SLEPc users manual, Tech. Rep. DSIC-II/24/02 - Revision 2.3.1, D. Sistemas Informáticos y Computación, Universidad Politécnica de Valencia, available from: http://www.grycap.upv.es/slepc, 2006.
  • 10
    • 34447551341 scopus 로고    scopus 로고
    • S. Balay, K. Buschelman, W. Gropp, D. Kaushik, M. Knepley, L.C. McInnes, B. Smith, H. Zhang, PETSc users manual, Tech. Rep. ANL-95/11 - Revision 2.3.1, Argonne National Laboratory, 2006.
  • 11
    • 0002522266 scopus 로고    scopus 로고
    • Parallel empirical pseudopotential electronic structure calculations for million atom systems
    • Canning A., Wang L.W., Williamson A., and Zunger A. Parallel empirical pseudopotential electronic structure calculations for million atom systems. J. Comput. Phys. 160 1 (2000) 29-41
    • (2000) J. Comput. Phys. , vol.160 , Issue.1 , pp. 29-41
    • Canning, A.1    Wang, L.W.2    Williamson, A.3    Zunger, A.4
  • 12
    • 0006551950 scopus 로고
    • A Newton basis GMRES implementation
    • Bai Z., Hu D., and Reichel L. A Newton basis GMRES implementation. IMA J. Numer. Anal. 14 (1994) 563-581
    • (1994) IMA J. Numer. Anal. , vol.14 , pp. 563-581
    • Bai, Z.1    Hu, D.2    Reichel, L.3
  • 13
    • 0031519150 scopus 로고    scopus 로고
    • Alternatives for parallel Krylov subspace basis computation
    • Sidje R.B. Alternatives for parallel Krylov subspace basis computation. Numer. Linear Algebra Appl. 4 4 (1997) 305-331
    • (1997) Numer. Linear Algebra Appl. , vol.4 , Issue.4 , pp. 305-331
    • Sidje, R.B.1
  • 14
    • 0029390218 scopus 로고
    • Reducing the effect of global communication in GMRES(m) and CG on parallel distributed memory computers
    • de Sturler E., and van der Vorst H.A. Reducing the effect of global communication in GMRES(m) and CG on parallel distributed memory computers. App. Numer. Math. 18 4 (1995) 441-459
    • (1995) App. Numer. Math. , vol.18 , Issue.4 , pp. 441-459
    • de Sturler, E.1    van der Vorst, H.A.2
  • 15
    • 0000072705 scopus 로고
    • A parallel GMRES version for general sparse matrices
    • Erhel J. A parallel GMRES version for general sparse matrices. Electron. Trans. Numer. Anal. 3 (1995) 160-176
    • (1995) Electron. Trans. Numer. Anal. , vol.3 , pp. 160-176
    • Erhel, J.1
  • 16
    • 0036447603 scopus 로고    scopus 로고
    • A block orthogonalization procedure with constant synchronization requirements
    • Stathopoulos A., and Wu K. A block orthogonalization procedure with constant synchronization requirements. SIAM J. Sci. Comput. 23 6 (2002) 2165-2182
    • (2002) SIAM J. Sci. Comput. , vol.23 , Issue.6 , pp. 2165-2182
    • Stathopoulos, A.1    Wu, K.2
  • 17
    • 0031210417 scopus 로고    scopus 로고
    • A block variant of the GMRES method on massively parallel processors
    • Li G. A block variant of the GMRES method on massively parallel processors. Parallel Comput. 23 8 (1997) 1005-1019
    • (1997) Parallel Comput. , vol.23 , Issue.8 , pp. 1005-1019
    • Li, G.1
  • 18
    • 0000043097 scopus 로고
    • Implementation of the GMRES method using Householder transformations
    • Walker H.F. Implementation of the GMRES method using Householder transformations. SIAM J. Sci. Statist. Comput. 9 (1988) 152-163
    • (1988) SIAM J. Sci. Statist. Comput. , vol.9 , pp. 152-163
    • Walker, H.F.1
  • 19
    • 0033164616 scopus 로고    scopus 로고
    • Parallel implementation of a multiblock method with approximate subdomain solution
    • Frank J., and Vuik C. Parallel implementation of a multiblock method with approximate subdomain solution. App. Numer. Math. 30 4 (1999) 403-423
    • (1999) App. Numer. Math. , vol.30 , Issue.4 , pp. 403-423
    • Frank, J.1    Vuik, C.2
  • 20
    • 34447498170 scopus 로고    scopus 로고
    • V. Hernandez, J.E. Roman, A. Tomas, V. Vidal, Orthogonalization routines in SLEPc, Tech. Rep. STR-1, available from: http://www.grycap.upv.es/slepc/documentation/reports/str1.pdf, 2006.
  • 21
    • 0034418186 scopus 로고    scopus 로고
    • The influence of orthogonality on the Arnoldi method
    • Braconnier T., Langlois P., and Rioual J.C. The influence of orthogonality on the Arnoldi method. Linear Algebra Appl. 309 1-3 (2000) 307-323
    • (2000) Linear Algebra Appl. , vol.309 , Issue.1-3 , pp. 307-323
    • Braconnier, T.1    Langlois, P.2    Rioual, J.C.3
  • 22
    • 84966217951 scopus 로고
    • Reorthogonalization and stable algorithms for updating the Gram-Schmidt QR factorization
    • Daniel J.W., Gragg W.B., Kaufman L., and Stewart G.W. Reorthogonalization and stable algorithms for updating the Gram-Schmidt QR factorization. Math. Comp. 30 136 (1976) 772-795
    • (1976) Math. Comp. , vol.30 , Issue.136 , pp. 772-795
    • Daniel, J.W.1    Gragg, W.B.2    Kaufman, L.3    Stewart, G.W.4
  • 23
    • 0024776757 scopus 로고
    • Iterative algorithms for Gram-Schmidt orthogonalization
    • Hoffmann W. Iterative algorithms for Gram-Schmidt orthogonalization. Computing 41 4 (1989) 335-348
    • (1989) Computing , vol.41 , Issue.4 , pp. 335-348
    • Hoffmann, W.1
  • 25
    • 0025672456 scopus 로고
    • FORTRAN subroutines for updating the QR decomposition
    • Reichel L., and Gragg W.B. FORTRAN subroutines for updating the QR decomposition. ACM Trans. Math. Software 16 (1990) 369-377
    • (1990) ACM Trans. Math. Software , vol.16 , pp. 369-377
    • Reichel, L.1    Gragg, W.B.2
  • 26
    • 30544434868 scopus 로고    scopus 로고
    • Algorithm 842: a set of GMRES routines for real and complex arithmetics on high performance computers
    • Frayssé V., Giraud L., Gratton S., and Langou J. Algorithm 842: a set of GMRES routines for real and complex arithmetics on high performance computers. ACM Trans. Math. Software 31 2 (2005) 228-238
    • (2005) ACM Trans. Math. Software , vol.31 , Issue.2 , pp. 228-238
    • Frayssé, V.1    Giraud, L.2    Gratton, S.3    Langou, J.4
  • 27
    • 0001609443 scopus 로고
    • Krylov subspace methods on supercomputers
    • Saad Y. Krylov subspace methods on supercomputers. SIAM J. Sci. Statist. Comput. 10 6 (1989) 1200-1232
    • (1989) SIAM J. Sci. Statist. Comput. , vol.10 , Issue.6 , pp. 1200-1232
    • Saad, Y.1
  • 28
    • 0023447568 scopus 로고
    • Multitasking the conjugate gradient method on the CRAY X-MP/48
    • Meurant G. Multitasking the conjugate gradient method on the CRAY X-MP/48. Parallel Comput. 5 3 (1987) 267-280
    • (1987) Parallel Comput. , vol.5 , Issue.3 , pp. 267-280
    • Meurant, G.1
  • 29
    • 30244529284 scopus 로고
    • An efficient parallel algorithm for extreme eigenvalues of sparse nonsymmetric matrices
    • Kim S.K., and Chronopoulos A.T. An efficient parallel algorithm for extreme eigenvalues of sparse nonsymmetric matrices. Int. J. Supercomp. Appl. 6 1 (1992) 98-111
    • (1992) Int. J. Supercomp. Appl. , vol.6 , Issue.1 , pp. 98-111
    • Kim, S.K.1    Chronopoulos, A.T.2
  • 30
    • 34447560660 scopus 로고    scopus 로고
    • V. Hernandez, J.E. Roman, A. Tomas, A parallel variant of the Gram-Schmidt process with reorthogonalization, in: G.R. Joubert, W.E. Nagel, F.J. Peters, O.G. Plata, P. Tirado, E.L. Zapata (Eds.), Proceedings of the International Conference on Parallel Computing (ParCo 2005), vol. 33, Central Institute for Applied Mathematics, Jülich, Germany, 2006, pp. 221-228.
  • 32
    • 34447516433 scopus 로고    scopus 로고
    • Z. Bai, D. Day, J. Demmel, J. Dongarra, A test matrix collection for non-Hermitian eigenvalue problems (release 1.0), Technical Report CS-97-355, Department of Computer Science, University of Tennessee, Knoxville, TN, USA, available from: http://math.nist.gov/MatrixMarket, 1997.
  • 33
    • 34447543637 scopus 로고    scopus 로고
    • T. Davis, University of Florida Sparse Matrix Collection, NA Digest, available from: http://www.cise.ufl.edu/research/sparse/matrices, 1992.
  • 34
    • 38049152894 scopus 로고    scopus 로고
    • V. Hernandez, J.E. Roman, A. Tomas, Evaluation of several variants of explicitly restarted Lanczos eigensolvers and their parallel implementations, in: High Performance Computing for Computational Science - VECPAR 2006, LNCS vol. 4395, 2007, pp. 403-416.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.