메뉴 건너뛰기




Volumn 15, Issue 9, 2003, Pages 803-820

The LINPACK benchmark: Past, present and future

Author keywords

Benchmarking; BLAS; High performance computing; HPL; Linear algebra; LINPACK; TOP500

Indexed keywords

ALGORITHMS; BENCHMARKING; CODES (SYMBOLS); COMPUTATIONAL COMPLEXITY; DIGITAL ARITHMETIC; EXTRAPOLATION; FORTRAN (PROGRAMMING LANGUAGE); LINEAR EQUATIONS; MATRIX ALGEBRA; PARALLEL PROCESSING SYSTEMS; VECTORS;

EID: 0042674307     PISSN: 15320626     EISSN: None     Source Type: Journal    
DOI: 10.1002/cpe.728     Document Type: Article
Times cited : (644)

References (57)
  • 1
    • 0042895870 scopus 로고    scopus 로고
    • The LINPACK 1000x1000 benchmark program
    • The LINPACK 1000x1000 benchmark program. (http://www.netlib.org/benchmark/1000d for source code.).
  • 4
    • 0003533609 scopus 로고    scopus 로고
    • Performance of various computers using standard linear equations software
    • Technical Report CS-89-85, University of Tennessee
    • Dongarra JJ. Performance of various computers using standard linear equations software. Technical Report CS-89-85, University of Tennessee, 2002. (An updated version of this report can be found at benchmark/performance.ps).
    • (2002)
    • Dongarra, J.J.1
  • 5
    • 0015279213 scopus 로고
    • Some notes on speeding up certain loops by software, firmware, and hardware means
    • Pager D. Some notes on speeding up certain loops by software, firmware, and hardware means. IEEE Transactions on Computers 1972; 97-100.
    • (1972) IEEE Transactions on Computers , pp. 97-100
    • Pager, D.1
  • 7
    • 84983965442 scopus 로고
    • An empirical study of Fortran programs
    • Knuth D. An empirical study of Fortran programs. Software-Practice and Experience 1971; 1:105-133.
    • (1971) Software-Practice and Experience , vol.1 , pp. 105-133
    • Knuth, D.1
  • 10
    • 0042895868 scopus 로고
    • Implementing dense linear algebra algorithms using mutlitasking on CRAY X-MP-4
    • Dongarra JJ, Hewitt T. Implementing dense linear algebra algorithms using mutlitasking on CRAY X-MP-4. SIAM Journal of Science and Statistics in Computing 1986; 7(1):347-350.
    • (1986) SIAM Journal of Science and Statistics in Computing , vol.7 , Issue.1 , pp. 347-350
    • Dongarra, J.J.1    Hewitt, T.2
  • 11
    • 84909708535 scopus 로고
    • Linear algebra on high-performance computers
    • Schendel U (ed.). North Holland
    • Dongarra JJ, Sorensen DC. Linear algebra on high-performance computers. Proceedings Parallel Computing 85, Schendel U (ed.). North Holland, 1986; 3-32.
    • (1986) Proceedings Parallel Computing , vol.85 , pp. 3-32
    • Dongarra, J.J.1    Sorensen, D.C.2
  • 13
    • 0012062653 scopus 로고
    • Solution of simultaneous linear equations using a magnetic tape store
    • Barron DW, Swinnerton-Dyer HPF. Solution of simultaneous linear equations using a magnetic tape store. Computer J. 1990; 3:28-33.
    • (1990) Computer J. , vol.3 , pp. 28-33
    • Barron, D.W.1    Swinnerton-Dyer, H.P.F.2
  • 14
    • 0022862642 scopus 로고
    • Block-oriented local-memory-based linear equation solution on the CRAY-2: Uniprocessor algorithms
    • Schendel U (ed). IEEE Computer Society Press
    • Calahan DA. Block-oriented local-memory-based linear equation solution on the CRAY-2: Uniprocessor algorithms. Proceedings International Conference on Parallel Processing, August 1986, Schendel U (ed). IEEE Computer Society Press, 1986; 375-378.
    • (1986) Proceedings International Conference on Parallel Processing, August 1986 , pp. 375-378
    • Calahan, D.A.1
  • 15
    • 0042895869 scopus 로고
    • Adaption of the Jacobi and Givens methods for a computer with magnetic tape backup store
    • Technical Report 8, University of Sydney
    • Chartres B. Adaption of the Jacobi and Givens methods for a computer with magnetic tape backup store. Technical Report 8, University of Sydney, 1960.
    • (1960)
    • Chartres, B.1
  • 16
    • 0041392786 scopus 로고
    • Sparse matrix calculations on the CRAY-2
    • Technical Report CSS 197, AERE Harwell
    • Dave AK, Duff IS. Sparse matrix calculations on the CRAY-2. Technical Report CSS 197, AERE Harwell, 1986.
    • (1986)
    • Dave, A.K.1    Duff, I.S.2
  • 18
    • 84945709131 scopus 로고
    • Organizing matrices and matrix operations for paged memory systems
    • McKellar AC, Coffman EG Jr. Organizing matrices and matrix operations for paged memory systems. Communications of the ACM 1969; 12(3):153-165.
    • (1969) Communications of the ACM , vol.12 , Issue.3 , pp. 153-165
    • McKellar, A.C.1    Coffman E.G., Jr.2
  • 20
    • 0001951009 scopus 로고
    • The WY representation for products of householder matrices
    • Bischof C, Van Loan CF. The WY representation for products of Householder matrices. SIAM SISSC 1987; 8(2).
    • (1987) SIAM SISSC , vol.8 , Issue.2
    • Bischof, C.1    Van Loan, C.F.2
  • 21
    • 0041893750 scopus 로고
    • Linear algebra programs for use on a vector computer with a secondary solid state storage device
    • Vichnevetsky R, Stepleman R (eds). IMACS
    • Bucher I, Jordan T. Linear algebra programs for use on a vector computer with a secondary solid state storage device. Advances in Computer Methods for Practical Differential Equations, Vichnevetsky R, Stepleman R (eds). IMACS, 1984; 546-550.
    • (1984) Advances in Computer Methods for Practical Differential Equations , pp. 546-550
    • Bucher, I.1    Jordan, T.2
  • 23
    • 0042895867 scopus 로고
    • The LU decomposition algorithm and its efficient Fortran implementation on the IBM 3090 vector multiprocessor
    • Technical Report ECSEC Report ICE-0006, IBM, March
    • Robert Y, Suguazerro P. The LU decomposition algorithm and its efficient Fortran implementation on the IBM 3090 vector multiprocessor. Technical Report ECSEC Report ICE-0006, IBM, March 1987.
    • (1987)
    • Robert, Y.1    Suguazerro, P.2
  • 27
    • 0012305554 scopus 로고
    • Auxiliary storage methods for solving finite element systems
    • George A, Rashwan H. Auxiliary storage methods for solving finite element systems. SIAM SISSC 1985; 6:882-910.
    • (1985) SIAM SISSC , vol.6 , pp. 882-910
    • George, A.1    Rashwan, H.2
  • 28
    • 0031273280 scopus 로고    scopus 로고
    • Recursion leads to automatic variable blocking for dense linear-algebra algorithms
    • Gustavson FG. Recursion leads to automatic variable blocking for dense linear-algebra algorithms. IBM Journal of Research and Development 1997; 41(6):737-755.
    • (1997) IBM Journal of Research and Development , vol.41 , Issue.6 , pp. 737-755
    • Gustavson, F.G.1
  • 29
    • 0031496750 scopus 로고    scopus 로고
    • Locality of reference in LU decomposition with partial pivoting
    • Toledo S. Locality of reference in LU decomposition with partial pivoting. SIAM Journal on Matrix Analysis and Applications 1997; 18(4).
    • (1997) SIAM Journal on Matrix Analysis and Applications , vol.18 , Issue.4
    • Toledo, S.1
  • 32
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimization of software and the Atlas project
    • Dongarra JJ, Petitet A, Whaley RC. Automated empirical optimization of software and the Atlas project. Parallel Computing 2001; 27(1-2):3-25.
    • (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-25
    • Dongarra, J.J.1    Petitet, A.2    Whaley, R.C.3
  • 35
    • 0042394908 scopus 로고
    • Ahrendt G. Weekly postings to comp.sys.super [1993].
    • (1993)
    • Ahrendt, G.1
  • 36
    • 0041893749 scopus 로고
    • Kahaner report on supercomputer in Japan
    • Technical Report, The Computer Science Department, University of Arizona
    • Kahaner D. Kahaner report on supercomputer in Japan. Technical Report, The Computer Science Department, University of Arizona, 1992. ftp://ftp.cs.arizona.edu/japan/kahaner.reports/jsuper.92.
    • (1992)
    • Kahaner, D.1
  • 37
    • 0025997771 scopus 로고
    • Using Strassen's algorithm to accelerate the solution of linear systems
    • Bailey D, Lee K, Simon H. Using Strassen's algorithm to accelerate the solution of linear systems. Journal of Supercomputing 1990; 4:357-371.
    • (1990) Journal of Supercomputing , vol.4 , pp. 357-371
    • Bailey, D.1    Lee, K.2    Simon, H.3
  • 38
    • 0030092443 scopus 로고    scopus 로고
    • A high performance parallel Strassen implementation
    • Grayson B, van de Geijn R. A high performance parallel Strassen implementation. Parallel Processing Letters 1996; 6(1):3-12.
    • (1996) Parallel Processing Letters , vol.6 , Issue.1 , pp. 3-12
    • Grayson, B.1    Van De Geijn, R.2
  • 39
    • 0041893748 scopus 로고    scopus 로고
    • Using Strassen's matrix multiplication in high performance solution of linear systems
    • Paprzycki M, Cyphers C. Using Strassen's matrix multiplication in high performance solution of linear systems. Computers and Mathematics with Applications 1996; 31(4/5):55-61.
    • (1996) Computers and Mathematics with Applications , vol.31 , Issue.4-5 , pp. 55-61
    • Paprzycki, M.1    Cyphers, C.2
  • 40
    • 34250487811 scopus 로고
    • Gaussian elimination is not optimal
    • Strassen V. Gaussian elimination is not optimal. Numerical Mathematics 1969; 13:354-356.
    • (1969) Numerical Mathematics , vol.13 , pp. 354-356
    • Strassen, V.1
  • 43
    • 0040675695 scopus 로고
    • High performance fortran language specification. Version 1.1
    • High Performance Fortran Forum; Technical Report, Rice University, November
    • High Performance Fortran Forum. High Performance Fortran Language specification. Version 1.1. Technical Report, Rice University, November 1994.
    • (1994)
  • 44
    • 0003565849 scopus 로고    scopus 로고
    • High performance fortran language specification. Version 2.0
    • High Performance Fortran Forum; Technical Report, Rice University, January
    • High Performance Fortran Forum. High Performance Fortran Language specification. version 2.0. Technical Report, Rice University, January 1997.
    • (1997)
  • 48
    • 84871844180 scopus 로고    scopus 로고
    • Openmp: Simple, portable, scalable smp programming
    • Openmp: Simple, portable, scalable smp programming. http://www.openmp.org/.
  • 51
    • 0003413675 scopus 로고
    • MPI: A message-passing interface standard (version 1.1)
    • Message Passing Interface Forum
    • Message Passing Interface Forum. MPI: A Message-Passing Interface Standard (version 1.1). http://www.mpi-forum.org/ [1995].
    • (1995)
  • 52
    • 0003604499 scopus 로고    scopus 로고
    • MPI-2: Extensions to the message-passing interface
    • Message Passing Interface Forum; [July]
    • Message Passing Interface Forum. MPI-2: Extensions to the Message-Passing Interface. http://www.mpi-forum.org/ [July 1997].
    • (1997)
  • 54
    • 0000778168 scopus 로고
    • Scalability issues in the design of a library for dense linear algebra
    • (Also LAPACK Working Note No. 43)
    • Dongarra JJ, van de Geijn R, Walker DW. Scalability issues in the design of a library for dense linear algebra. Journal of Parallel and Distributed Computing 1994: 22(3):523-537. (Also LAPACK Working Note No. 43).
    • (1994) Journal of Parallel and Distributed Computing , vol.22 , Issue.3 , pp. 523-537
    • Dongarra, J.J.1    Van De Geijn, R.2    Walker, D.W.3
  • 55
    • 0042895859 scopus 로고    scopus 로고
    • Massively parallel LINPACK benchmark on the intel touchstone DELTA and iPSC/860 systems
    • Intel Supercomputer Users Group, 1991
    • van de Geijn R. Massively parallel LINPACK Benchmark on the Intel Touchstone DELTA and iPSC/860 systems. 1991 Annual Users Conference Proceedings, Dallas, Texas. Intel Supercomputer Users Group, 1991.
    • 1991 Annual Users Conference Proceedings, Dallas, Texas
    • Van De Geijn, R.1
  • 56
    • 0031123769 scopus 로고    scopus 로고
    • SUMMA: Scalable universal matrix multiplication algorithm
    • van de Geijn R, Watts J. SUMMA: Scalable universal matrix multiplication algorithm. Concurrency: Practice and Experience 1997; 9(4):255-274.
    • (1997) Concurrency: Practice and Experience , vol.9 , Issue.4 , pp. 255-274
    • Van De Geijn, R.1    Watts, J.2
  • 57
    • 0000793139 scopus 로고
    • Cramming more components onto integrated circuits
    • Moore GE. Cramming more components onto integrated circuits. Electronics 1965; 38(8).
    • (1965) Electronics , vol.38 , Issue.8
    • Moore, G.E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.