메뉴 건너뛰기




Volumn 50, Issue 2-3, 2006, Pages 223-238

Self-adapting numerical software (SANS) effort

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTER SIMULATION; DECISION THEORY; MANAGEMENT SCIENCE; PARALLEL PROCESSING SYSTEMS;

EID: 33646078910     PISSN: 00188646     EISSN: 00188646     Source Type: Journal    
DOI: 10.1147/rd.502.0223     Document Type: Article
Times cited : (28)

References (63)
  • 1
    • 84976742719 scopus 로고
    • "Algorithm 539: Basic Linear Algebra Subprograms for FORTRAN Usage [F1]"
    • C. L. Lawson, R. J. Hanson, F. T. Krogh, and D. R. Kincaid, "Algorithm 539: Basic Linear Algebra Subprograms for FORTRAN Usage [F1]," ACM Trans. Math. Software 5, No. 3, 324-325 (1979).
    • (1979) ACM Trans. Math. Software , vol.5 , Issue.3 , pp. 324-325
    • Lawson, C.L.1    Hanson, R.J.2    Krogh, F.T.3    Kincaid, D.R.4
  • 2
    • 0343462141 scopus 로고    scopus 로고
    • "Automated Empirical Optimization of Software and the ATLAS Project"
    • R. C. Whaley, A. Petitet, and J. J. Dongarra, "Automated Empirical Optimization of Software and the ATLAS Project," Parallel Computing 27, No. 1/2, 3-35 (2001).
    • (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
    • Whaley, R.C.1    Petitet, A.2    Dongarra, J.J.3
  • 3
    • 0000793139 scopus 로고
    • "Cramming More Components onto Integrated Circuits"
    • G. E. Moore, "Cramming More Components onto Integrated Circuits," Electronics 38, No. 8, 114-117 (1965).
    • (1965) Electronics , vol.38 , Issue.8 , pp. 114-117
    • Moore, G.E.1
  • 5
    • 0022874874 scopus 로고
    • "Advanced Compiler Optimizations for Supercomputers"
    • D. A. Padua and M. J. Wolfe, "Advanced Compiler Optimizations for Supercomputers," Source Commun. ACM 29, No. 12, 1184-1201 (1986).
    • (1986) Source Commun. ACM , vol.29 , Issue.12 , pp. 1184-1201
    • Padua, D.A.1    Wolfe, M.J.2
  • 7
    • 0003929457 scopus 로고    scopus 로고
    • "Automatic Blocking of Nested Loops"
    • Department of Computer Science, University of Tennessee, Knoxville, TN 37996
    • R. Schreiber and J. Dongarra, "Automatic Blocking of Nested Loops," Technical Report CS-90-108, Department of Computer Science, University of Tennessee, Knoxville, TN 37996, 1990.
    • Technical Report CS-90-108 , pp. 1990
    • Schreiber, R.1    Dongarra, J.2
  • 14
    • 0000238336 scopus 로고
    • "A Simplex Method for Function Minimization"
    • J. A. Nelder and R. Mead, "A Simplex Method for Function Minimization," The Computer J. 7, No. 4, 308-313 (1965).
    • (1965) The Computer J. , vol.7 , Issue.4 , pp. 308-313
    • Nelder, J.A.1    Mead, R.2
  • 17
    • 0004493166 scopus 로고    scopus 로고
    • "On the Approximability of Minimizing Nonzero Variables or Unsatisfied Relations in Linear Systems"
    • E. Amaldi and V. Kann, "On the Approximability of Minimizing Nonzero Variables or Unsatisfied Relations in Linear Systems," Theoret. Computer Sci. 209, 237-260 (1998).
    • (1998) Theoret. Computer Sci. , vol.209 , pp. 237-260
    • Amaldi, E.1    Kann, V.2
  • 21
    • 0024018137 scopus 로고
    • "A Polynomial Approximation Scheme for Machine Scheduling on Uniform Processors: Using the Dual Approach"
    • D. S. Hochbaum and D. B. Shmoys, "A Polynomial Approximation Scheme for Machine Scheduling on Uniform Processors: Using the Dual Approach," SIAM J. Computing 17, No. 3, 539-551 (1988).
    • (1988) SIAM J. Computing , vol.17 , Issue.3 , pp. 539-551
    • Hochbaum, D.S.1    Shmoys, D.B.2
  • 23
    • 0000438412 scopus 로고
    • "Approximation Algorithms for Scheduling Unrelated Parallel Machines"
    • J. Lenstra, D. Shmoys, and E. Tardos, "Approximation Algorithms for Scheduling Unrelated Parallel Machines," Math. Program. 46, No. 3, 259-271 (1990).
    • (1990) Math. Program. , vol.46 , Issue.3 , pp. 259-271
    • Lenstra, J.1    Shmoys, D.2    Tardos, E.3
  • 24
    • 0038368778 scopus 로고    scopus 로고
    • "Deploying Parallel Numerical Library Routines to Cluster Computing in a Self-Adapting Fashion"
    • Imperial College Press, London
    • K. J. Roche and J. J. Dongarra, "Deploying Parallel Numerical Library Routines to Cluster Computing in a Self-Adapting Fashion," Parallel Computing: Advances and Current Issues, Imperial College Press, London, 2002.
    • (2002) Parallel Computing: Advances and Current Issues
    • Roche, K.J.1    Dongarra, J.J.2
  • 27
    • 0001439335 scopus 로고
    • "MPI: A Message-Passing Interface Standard"
    • Message Passing Interface Forum
    • Message Passing Interface Forum, "MPI: A Message-Passing Interface Standard," Intl. J. Supercomputer Appl. & High Perform. Computing 8, No. 3/4, 159-416 (1994).
    • (1994) Intl. J. Supercomputer Appl. & High Perform. Computing , vol.8 , Issue.3-4 , pp. 159-416
  • 28
    • 33646105586 scopus 로고
    • Message Passing Interface Forum, MPI: A Message-Passing Interface Standard Version 1.1, see
    • Message Passing Interface Forum, MPI: A Message-Passing Interface Standard Version 1.1, 1995; see http://www.mpi-forum.org/docs/docs.html.
    • (1995)
  • 29
    • 33646113109 scopus 로고    scopus 로고
    • Message Passing Interface Forum, MPI-2: Extensions to the Message-Passing Interface, see
    • Message Passing Interface Forum, MPI-2: Extensions to the Message-Passing Interface, 1997; see http://www.mpi-forum.org/docs/ mpi2-report.pdf.
    • (1997)
  • 30
    • 33646119829 scopus 로고    scopus 로고
    • MPICH; see
    • MPICH; see http://www.mcs.anl.gov/mpi/mpich/.
  • 31
    • 33646116020 scopus 로고    scopus 로고
    • LAM/MPI Parallel Computing; see
    • LAM/MPI Parallel Computing; see http://www.lam-mpi.org/.
  • 33
    • 0030244536 scopus 로고    scopus 로고
    • "Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines"
    • J. Choi, J. J. Dongarra, L. S. Ostrouchov, A. P. Petitet, D. W. Walker, and R. C. Whaley, "Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines," Sci. Program. 5, No. 3, 173-184 (1996).
    • (1996) Sci. Program. , vol.5 , Issue.3 , pp. 173-184
    • Choi, J.1    Dongarra, J.J.2    Ostrouchov, L.S.3    Petitet, A.P.4    Walker, D.W.5    Whaley, R.C.6
  • 34
    • 33646092244 scopus 로고    scopus 로고
    • TOP500 Supercomputer Sites; see and http://www.netlib.org/benchmark/top500.html
    • TOP500 Supercomputer Sites; see http://www.top500.org and http://www.netlib.org/benchmark/top500.html.
  • 36
    • 12444275589 scopus 로고    scopus 로고
    • "A Proposed Standard for Numerical Metadata"
    • Technical Report ICL-UT-03-02, Innovative Computing Laboratory, University of Tennessee, Knoxville, TN 37996
    • V. Eijkhout and E. Fuentes, "A Proposed Standard for Numerical Metadata," Technical Report ICL-UT-03-02, Innovative Computing Laboratory, University of Tennessee, Knoxville, TN 37996, 2003.
    • (2003)
    • Eijkhout, V.1    Fuentes, E.2
  • 37
    • 33646100261 scopus 로고    scopus 로고
    • Matrix Market; see
    • Matrix Market; see http://math.nist.gov/MatrixMarket.
  • 39
    • 33646101226 scopus 로고    scopus 로고
    • The ParMETIS/METIS package; see
    • The ParMETIS/METIS package; see http://glaros.dtc.umn.edu/gkhome/views/ metis/.
  • 40
    • 33646106745 scopus 로고    scopus 로고
    • "Automatic Determination of Matrix Blocks"
    • Technical Report UT-CS-01-458, Department of Computer Science, University of Tennessee, Knoxville, TN 37996
    • V. Eijkhout, "Automatic Determination of Matrix Blocks," Technical Report UT-CS-01-458, Department of Computer Science, University of Tennessee, Knoxville, TN 37996, 2001.
    • Eijkhout, V.1
  • 43
    • 0031570636 scopus 로고    scopus 로고
    • "Fault-Tolerant Matrix Operations for Networks of Workstations Using Diskless Checkpointing"
    • J. S. Plank, Y. Kim, and J. J. Dongarra, "Fault-Tolerant Matrix Operations for Networks of Workstations Using Diskless Checkpointing," J. Parallel & Distr. Computing 43, No. 2, 125-138 (1997).
    • (1997) J. Parallel & Distr. Computing , vol.43 , Issue.2 , pp. 125-138
    • Plank, J.S.1    Kim, Y.2    Dongarra, J.J.3
  • 44
    • 31844452364 scopus 로고    scopus 로고
    • "Recovery Patterns for Iterative Methods in a Parallel Unstable Environment"
    • Technical Report UT-CS-04-538, Computer Science Department, University of Tennessee, Knoxville, TN 37996
    • G. Bosilca, Z. Chen, J. Dongarra, and J. Langou, "Recovery Patterns for Iterative Methods in a Parallel Unstable Environment," Technical Report UT-CS-04-538, Computer Science Department, University of Tennessee, Knoxville, TN 37996, 2004.
    • Bosilca, G.1    Chen, Z.2    Dongarra, J.3    Langou, J.4
  • 46
    • 0037447584 scopus 로고    scopus 로고
    • "A Bandwidth Latency Tradeoff for Broadcast and Reduction"
    • P. Sanders and J. F. Sibeyn, "A Bandwidth Latency Tradeoff for Broadcast and Reduction," Info. Process. Lett. 86, No. 1, 33-38 (2003).
    • (2003) Info. Process. Lett. , vol.86 , Issue.1 , pp. 33-38
    • Sanders, P.1    Sibeyn, J.F.2
  • 47
    • 33646079822 scopus 로고    scopus 로고
    • "Development of Naturally Fault Tolerant Algorithms for Computing on 100,000 Processors"
    • see
    • C. Engelmann and G. A. Geist, "Development of Naturally Fault Tolerant Algorithms for Computing on 100,000 Processors," see http://www.csm.ornl.gov/~geist/Lyon2002-geist.pdf.
    • Engelmann, C.1    Geist, G.A.2
  • 51
    • 0028401457 scopus 로고
    • "The Communication Challenge for MPP: Intel Paragon and Meiko CS-2"
    • (March)
    • R. W. Hockney, "The Communication Challenge for MPP: Intel Paragon and Meiko CS-2," Parallel Computing 20, No. 3, 389-398 (March 1994).
    • (1994) Parallel Computing , vol.20 , Issue.3 , pp. 389-398
    • Hockney, R.W.1
  • 57
    • 34548696258 scopus 로고    scopus 로고
    • "More Efficient Reduction Algorithms for Non-Power-of-Two Number of Processors in Message-Passing Parallel Systems"
    • R. Rabenseifner and J. L. Träff, "More Efficient Reduction Algorithms for Non-Power-of-Two Number of Processors in Message-Passing Parallel Systems," Proceedings of the 11th European PVM/MPI Users' Group Meeting, 2004, pp. 36-46.
    • (2004) Proceedings of the 11th European PVM/MPI Users' Group Meeting , pp. 36-46
    • Rabenseifner, R.1    Träff, J.L.2
  • 60
    • 0141732229 scopus 로고    scopus 로고
    • "Efficient Implementation of Reduce-Scatter in MPI"
    • M. Bernaschi, G. Iannello, and M. Lauria, "Efficient Implementation of Reduce-Scatter in MPI," J. Syst. Arch. 49, No. 3, 89-108 (2003).
    • (2003) J. Syst. Arch. , vol.49 , Issue.3 , pp. 89-108
    • Bernaschi, M.1    Iannello, G.2    Lauria, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.