메뉴 건너뛰기




Volumn 22, Issue 3, 1994, Pages 523-537

Scalability issues affecting the design of a dense linear algebra library

Author keywords

[No Author keywords available]

Indexed keywords


EID: 0000778168     PISSN: 07437315     EISSN: None     Source Type: Journal    
DOI: 10.1006/jpdc.1994.1108     Document Type: Article
Times cited : (48)

References (49)
  • 8
    • 0025997771 scopus 로고
    • Using Strassen’s algorithm to accelerate the solution of linear systems
    • Bailey, D. H., Lee, K., and Simon, H. D. Using Strassen’s algorithm to accelerate the solution of linear systems. J. Supercomputing 4 (1990), 357-371.
    • (1990) J. Supercomputing , vol.4 , pp. 357-371
    • Bailey, D.H.1    Lee, K.2    Simon, H.D.3
  • 9
    • 3042648854 scopus 로고    scopus 로고
    • The LINPACK benchmark on the AP 1000: Preliminary report
    • Brent, R. P. The LINPACK benchmark on the AP 1000: Preliminary report. Proceedings of the 2nd CAP Workshop. Nov. 1991.
    • Proceedings of the 2Nd CAP Workshop , pp. 1991
    • Brent, R.P.1
  • 12
    • 35248831050 scopus 로고
    • Electromagnetic scattering calculations on the Intel Touchstone Delta
    • IEEE Comput. Soc. Press
    • Cwik, T., Patterson, J., and Scott, D. Electromagnetic scattering calculations on the Intel Touchstone Delta. Proceedings of Supercomputing '92. IEEE Comput. Soc. Press, 1992. pp. 538-542.
    • (1992) Proceedings of Supercomputing 92 , pp. 538-542
    • Cwik, T.1    Patterson, J.2    Scott, D.3
  • 14
    • 84947657247 scopus 로고
    • LINPACK benchmark: Performance of various computers using standard linear equations software
    • Dongarra, J. J. LINPACK benchmark: Performance of various computers using standard linear equations software. Supercomputing Rev. 5, 3 (March 1992), 54-63.
    • (1992) Supercomputing Rev , vol.5 , Issue.3 , pp. 54-63
    • Dongarra, J.J.1
  • 22
    • 0004060334 scopus 로고
    • Two-dimensional basic linear algebra communication subprograms
    • Computer Science Department, University of Tennessee. Knoxville, TN
    • Dongarra, J. J., and van de Geijn, R. A. Two-dimensional basic linear algebra communication subprograms. Technical Report LAPACK working note 37, Computer Science Department, University of Tennessee. Knoxville, TN, Oct. 1991.
    • (1991) Technical Report LAPACK Working Note , vol.37
    • Dongarra, J.J.1    Van De Geijn, R.A.2
  • 23
    • 0026912004 scopus 로고
    • Reduction to condensed form for the eigenvalue problem on distributed memory architectures
    • Dongarra, J. J. and van de Geijn, R. A. Reduction to condensed form for the eigenvalue problem on distributed memory architectures. Parallel Comput. 18 (1992), 973-982.
    • (1992) Parallel Comput , vol.18 , pp. 973-982
    • Dongarra, J.J.1    Van De Geijn, R.A.2
  • 25
    • 0002663082 scopus 로고
    • GEMMW: A portable level 3 BLAS Winograd variant of Strassen's matrix-matrix multiply algorithm
    • Douglas, C. C., Heroux, M., Slishman, G., and Smith, R. M. GEMMW: A portable level 3 BLAS Winograd variant of Strassen's matrix-matrix multiply algorithm. J. Comput. Phys. 110 (1994), 1-10.
    • (1994) J. Comput. Phys. , vol.110 , pp. 1-10
    • Douglas, C.C.1    Heroux, M.2    Slishman, G.3    Smith, R.M.4
  • 26
    • 84888771978 scopus 로고
    • Large dense numerical linear algebra in 1993: The parallel computing influence
    • Edelman, A. Large dense numerical linear algebra in 1993: The parallel computing influence. Int. J. Supercomputing Appl. 7, 2 (1993).
    • (1993) Int. J. Supercomputing Appl. , vol.2 , pp. 7
    • Edelman, A.1
  • 27
  • 31
    • 0027644684 scopus 로고
    • The scalability of FFT on parallel computers
    • A detailed version is available as Technical Report TR 90-53, Department of Computer Science, University of Minnesota. MN 55455
    • Gupta, A., and Kumar, V. The scalability of FFT on parallel computers. IEEE Trans. Parallel Distrib. Systems 4, 7 (July 1993). A detailed version is available as Technical Report TR 90-53, Department of Computer Science, University of Minnesota. MN 55455.
    • (1993) IEEE Trans. Parallel Distrib. Systems , vol.4 , Issue.7
    • Gupta, A.1    Kumar, V.2
  • 32
    • 0024012163 scopus 로고
    • Reevaluating Amdahl's law
    • Gustafson, J. Reevaluating Amdahl's law. Comm. ACM 31, 5 (1988), 532-533.
    • (1988) Comm. ACM , vol.31 , Issue.5 , pp. 532-533
    • Gustafson, J.1
  • 36
    • 0025637437 scopus 로고
    • Exploiting fast matrix multiplication within the level 3 BLAS
    • Higham, N. J. Exploiting fast matrix multiplication within the level 3 BLAS. ACM Trans. Math. Software 16, 4 (1990), 352-368.
    • (1990) ACM Trans. Math. Software , vol.16 , Issue.4 , pp. 352-368
    • Higham, N.J.1
  • 39
    • 3543092493 scopus 로고
    • Analyzing scalability of parallel algorithms and architectures. Technical report, TR-91-18. Computer Science Department, University of Minnesota. June 1991
    • A short version of the paper, Urbana, IL, Oct, 1991
    • Kumar, V., and Gupta, A. Analyzing scalability of parallel algorithms and architectures. Technical report, TR-91-18. Computer Science Department, University of Minnesota. June 1991. J. Parallel Distrib. Comput. 22, 3 (1994) 379-391. A short version of the paper appears in the Proceedings of the 1991 International Conference on Supercomputing. Germany, and as an invited paper in the Proceedings of the 29th Annual Allerton Conference on Communication, Control and Computing. Urbana, IL, Oct. 1991.
    • (1994) J. Parallel Distrib. Comput , vol.22 , Issue.3 , pp. 379-391
    • Kumar, V.1    Gupta, A.2
  • 45
    • 34250487811 scopus 로고
    • Gaussian elimination is not optimal
    • Strassen, V. Gaussian elimination is not optimal. Ntuner. Math. 13 (1969), 354-356.
    • (1969) Ntuner. Math. , vol.13 , pp. 354-356
    • Strassen, V.1
  • 46
    • 0002853545 scopus 로고
    • Scalable problems and memory-bounded speedup
    • Sun, X.-H., and Ni, L. Scalable problems and memory-bounded speedup. J. Parallel Distrib. Computing 19, 1 (1993). 27-37.
    • (1993) J. Parallel Distrib. Computing , vol.19 , Issue.1 , pp. 27-37
    • Sun, X.-H.1    Ni, L.2
  • 47
    • 85027614460 scopus 로고
    • Cambridge, MA
    • Thinking Machines Corporation. CMS Technical Summary. Cambridge, MA, 1991.
    • (1991) CMS Technical Summary
  • 49
    • 0025639404 scopus 로고
    • Data redistribution and concurrency
    • Van de Velde, E. F. Data redistribution and concurrency. Parallel Comput. 16 (Dec 1990).
    • (1990) Parallel Comput , pp. 16
    • Van De Velde, E.F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.