메뉴 건너뛰기




Volumn 29, Issue 11-12 SPEC.ISS., 2003, Pages 1723-1743

Self-adapting software for numerical linear algebra and LAPACK for clusters

Author keywords

High performance computing; LAPACK; Linear algebra; Numerical parallel libraries; PBLAS; ScaLAPACK; Self adapting software

Indexed keywords

ALGORITHMS; COMPUTATIONAL COMPLEXITY; COMPUTER SOFTWARE; LINEAR ALGEBRA; MATHEMATICAL MODELS;

EID: 0242658775     PISSN: 01678191     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.parco.2003.05.014     Document Type: Article
Times cited : (42)

References (69)
  • 1
    • 0000793139 scopus 로고
    • Cramming more components onto integrated circuits
    • Moore G.E. Cramming more components onto integrated circuits. Electronics. 38(8):1965.
    • (1965) Electronics , vol.38 , Issue.8
    • Moore, G.E.1
  • 4
    • 0002687459 scopus 로고    scopus 로고
    • No silver bullet: Essence and accidents of software engineering
    • F.P. Brooks Jr., No silver bullet: essence and accidents of software engineering, Information Processing.
    • Information Processing
    • Brooks Jr., F.P.1
  • 5
    • 0242590446 scopus 로고    scopus 로고
    • Self-adapting numerical software for next generation applications
    • Innovative Computing Laboratory University of Tennessee, August
    • J. Dongarra, V. Eijkhout, Self-adapting numerical software for next generation applications, Technical Report, Innovative Computing Laboratory University of Tennessee, Available from: 〈http://icl.cs.utk.edu/iclprojects/pages/sans.html〉, August 2002.
    • (2002) Technical Report
    • Dongarra, J.1    Eijkhout, V.2
  • 6
    • 0038368778 scopus 로고    scopus 로고
    • Deploying parallel numerical library routines to cluster computing in a self adapting fashion
    • Joubert, Murli, Peters, & Vanneschi. London, England: Imperial College Press
    • Roche K.J., Dongarra J.J. Deploying parallel numerical library routines to cluster computing in a self adapting fashion. Joubert, Murli, Peters, Vanneschi Parallel Computing: Advances and Current Issues. 2002;Imperial College Press, London, England.
    • (2002) Parallel Computing: Advances and Current Issues
    • Roche, K.J.1    Dongarra, J.J.2
  • 9
    • 0024018137 scopus 로고
    • A polynomial approximation scheme for machine scheduling on uniform processors: Using the dual approach
    • Hochbaum D., Shmoys D. A polynomial approximation scheme for machine scheduling on uniform processors: using the dual approach. SIAM Journal of Computing. 17:1988;539-551.
    • (1988) SIAM Journal of Computing , vol.17 , pp. 539-551
    • Hochbaum, D.1    Shmoys, D.2
  • 10
    • 0000438412 scopus 로고
    • Approximation algorithms for scheduling unrelated parallel machines
    • Lenstra J., Shmoys D., Tardos E. Approximation algorithms for scheduling unrelated parallel machines. Mathematical Programming. 46:1990;259-271.
    • (1990) Mathematical Programming , vol.46 , pp. 259-271
    • Lenstra, J.1    Shmoys, D.2    Tardos, E.3
  • 11
    • 84958040361 scopus 로고    scopus 로고
    • Approximation algorithms for dynamic storage allocation
    • Lecture Notes in Computer Science, Springer-Verlag
    • Gergov J. Approximation algorithms for dynamic storage allocation. Proceedings of the 4th Annual European Symposium on Algorithms. Lecture Notes in Computer Science. 1136:1996;52-56 Springer-Verlag.
    • (1996) Proceedings of the 4th Annual European Symposium on Algorithms , vol.1136 , pp. 52-56
    • Gergov, J.1
  • 12
    • 0004493166 scopus 로고    scopus 로고
    • On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems
    • Amaldi E., Kann V. On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems. Theoretical Computer Science. 209:1998;237-260.
    • (1998) Theoretical Computer Science , vol.209 , pp. 237-260
    • Amaldi, E.1    Kann, V.2
  • 13
    • 84947928463 scopus 로고
    • Strong lower bounds on the approximability of some NPO PB - Complete maximization problems
    • Lecture Notes in Computer Science, Springer-Verlag
    • Kann V. Strong lower bounds on the approximability of some NPO PB - complete maximization problems. Proceedings of the 20th International Symposium on Mathematical Foundations of Computer Science. Lecture Notes in Computer Science. 969:1995;227-236 Springer-Verlag.
    • (1995) Proceedings of the 20th International Symposium on Mathematical Foundations of Computer Science , vol.969 , pp. 227-236
    • Kann, V.1
  • 14
    • 34250487811 scopus 로고
    • Gaussian elimination is not optimal
    • Strassen V. Gaussian elimination is not optimal. Numerical Mathematics. 13:1969;354-356.
    • (1969) Numerical Mathematics , vol.13 , pp. 354-356
    • Strassen, V.1
  • 16
    • 0003424372 scopus 로고    scopus 로고
    • Philadelphia: Society for Industrial and Applied Mathematics
    • Demmel J.W. Applied Numerical Linear Algebra. 1997;Society for Industrial and Applied Mathematics, Philadelphia.
    • (1997) Applied Numerical Linear Algebra
    • Demmel, J.W.1
  • 18
    • 0031273280 scopus 로고    scopus 로고
    • Recursion leads to automatic variable blocking for dense linear-algebra algorithms
    • Gustavson F.G. Recursion leads to automatic variable blocking for dense linear-algebra algorithms. IBM Journal of Research and Development. 41(6):1997;737-755.
    • (1997) IBM Journal of Research and Development , vol.41 , Issue.6 , pp. 737-755
    • Gustavson, F.G.1
  • 19
    • 0031496750 scopus 로고    scopus 로고
    • Locality of reference in LU decomposition with partial pivoting
    • Toledo S. Locality of reference in LU decomposition with partial pivoting. SIAM Journal on Matrix Analysis and Applications. 18(4):1997;1065-1081.
    • (1997) SIAM Journal on Matrix Analysis and Applications , vol.18 , Issue.4 , pp. 1065-1081
    • Toledo, S.1
  • 22
    • 0026368758 scopus 로고
    • Using profile information to assist classic code optimization
    • Chang P.P., Mahlke S.A., Hwu W.W. Using profile information to assist classic code optimization. Software Practice and Experience. 21(12):1991;1301-1321.
    • (1991) Software Practice and Experience , vol.21 , Issue.12 , pp. 1301-1321
    • Chang, P.P.1    Mahlke, S.A.2    Hwu, W.W.3
  • 23
    • 85086055276 scopus 로고
    • On the construction of poly-algorithms for automatic numerical analysis
    • M. Klerer, & J. Reinfelds. Academic Press
    • Rice J.R. On the construction of poly-algorithms for automatic numerical analysis. Klerer M., Reinfelds J. Interactive Systems for Experimental Applied Mathematics. 1968;31-313 Academic Press.
    • (1968) Interactive Systems for Experimental Applied Mathematics , pp. 31-313
    • Rice, J.R.1
  • 24
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimizations of software and the ATLAS project
    • Whaley R.C., Petitet A., Dongarra J.J. Automated empirical optimizations of software and the ATLAS project. Parallel Computing. 27(1-2):2001;3-35.
    • (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
    • Whaley, R.C.1    Petitet, A.2    Dongarra, J.J.3
  • 37
    • 0031269220 scopus 로고    scopus 로고
    • Improving the memory-system performance of sparse matrix-vector multiplication
    • Toledo S. Improving the memory-system performance of sparse matrix-vector multiplication. IBM Journal of Research and Development. 41(6):1997.
    • (1997) IBM Journal of Research and Development , vol.41 , Issue.6
    • Toledo, S.1
  • 38
    • 3042576437 scopus 로고    scopus 로고
    • Improving performance of sparse matrix-vector multiplication
    • A. Pinar, M.T. Heath, Improving performance of sparse matrix-vector multiplication, in: Proceedings of SC'99, 1999.
    • (1999) Proceedings of SC'99
    • Pinar, A.1    Heath, M.T.2
  • 49
    • 34250261642 scopus 로고
    • Adaptive procedure for estimating parameters for the nonsymmetric Tchebyshev iteration
    • Manteuffel T.A. Adaptive procedure for estimating parameters for the nonsymmetric Tchebyshev iteration. Numerische Mathematik. 31:1978;183-208.
    • (1978) Numerische Mathematik , vol.31 , pp. 183-208
    • Manteuffel, T.A.1
  • 50
    • 0001256129 scopus 로고
    • The Tchebyshev iteration for nonsymmetric linear systems
    • Manteuffel T.A. The Tchebyshev iteration for nonsymmetric linear systems. Numerische Mathematik. 28:1977;307-327.
    • (1977) Numerische Mathematik , vol.28 , pp. 307-327
    • Manteuffel, T.A.1
  • 51
    • 0000659752 scopus 로고
    • A practical termination criterion for the Conjugate Gradient method
    • Kaasschieter E.F. A practical termination criterion for the Conjugate Gradient method. BIT. 28:1988;308-322.
    • (1988) BIT , vol.28 , pp. 308-322
    • Kaasschieter, E.F.1
  • 52
    • 4243917643 scopus 로고
    • Computational variants of the CGS and BiCGSTAB methods
    • Computer Science Department, The University of Tennessee Knoxville, August
    • V. Eijkhout, Computational variants of the CGS and BiCGSTAB methods, Technical Report CS-94-241, Computer Science Department, The University of Tennessee Knoxville, August 1994 (Also LAPACK Working Note No. 78).
    • (1994) Technical Report , vol.CS-94-241
    • Eijkhout, V.1
  • 53
    • 0242422519 scopus 로고    scopus 로고
    • V. Eijkhout, Computational variants of the CGS and BiCGSTAB methods, Technical Report CS-94-241, Computer Science Department, The University of Tennessee Knoxville, August 1994 (Also LAPACK Working Note No. 78).
    • LAPACK Working Note No. 78 , vol.78
  • 54
    • 0003978709 scopus 로고
    • A proposal for a set of parallel basic linear algebra subprograms
    • Technical Report CS-95-292, University of Tennessee Knoxville, May
    • J. Choi, J. Dongarra, S. Ostrouchov, A. Petitet, D. Walker, R.C. Whaley, A proposal for a set of parallel basic linear algebra subprograms, Technical Report CS-95-292, University of Tennessee Knoxville, LAPACK Working Note 100, May 1995.
    • (1995) LAPACK Working Note , vol.100
    • Choi, J.1    Dongarra, J.2    Ostrouchov, S.3    Petitet, A.4    Walker, D.5    Whaley, R.C.6
  • 55
    • 0005713748 scopus 로고    scopus 로고
    • New serial and parallel recursive QR factorization algorithms for SMP systems
    • E. Elmroth, F.G. Gustavson, New serial and parallel recursive QR factorization algorithms for SMP systems, in: Proceedings of PARA 1998, 1998.
    • (1998) Proceedings of PARA 1998
    • Elmroth, E.1    Gustavson, F.G.2
  • 60
    • 0242674324 scopus 로고    scopus 로고
    • MPICH
    • MPICH, Available from: 〈 http://www.mcs.anl.gov/mpi/mpich/〉.
  • 62
    • 0000235223 scopus 로고    scopus 로고
    • The network weather service: A distributed resource performance forecasting service for metacomputing
    • Wolski R., Spring N., Hayes H. The network weather service: a distributed resource performance forecasting service for metacomputing. Future Generation Computing Systems. 14:1998.
    • (1998) Future Generation Computing Systems , vol.14
    • Wolski, R.1    Spring, N.2    Hayes, H.3
  • 63
    • 0242505770 scopus 로고    scopus 로고
    • A framework for performance modeling and prediction
    • IEEE
    • Snavely A.et al. A framework for performance modeling and prediction. Proceedings of Supercomputing 2002. 2002;IEEE.
    • (2002) Proceedings of Supercomputing 2002
    • Snavely, A.1
  • 64
    • 0003487728 scopus 로고
    • High performance fortran language specification
    • H.P. Forum, Center for Research on Parallel Computing, Rice University, Houston, TX, May
    • H.P. Forum, High performance fortran language specification, Technical Report CRPC-TR92225, Center for Research on Parallel Computing, Rice University, Houston, TX, May 1993.
    • (1993) Technical Report , vol.CRPC-TR92225
  • 66
    • 0036467455 scopus 로고    scopus 로고
    • Dense linear algebra kernels on heterogeneous platforms: Redistribution issues
    • Beaumont O., Legrand A., Rastello F., Robert Y. Dense linear algebra kernels on heterogeneous platforms: redistribution issues. Parallel Computing. 28(2):2002;155-185.
    • (2002) Parallel Computing , vol.28 , Issue.2 , pp. 155-185
    • Beaumont, O.1    Legrand, A.2    Rastello, F.3    Robert, Y.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.