SCOPUS 정보 검색 플랫폼

Concurrency and Computation: Practice and Experience

Volumn 15, Issue 9, 2003, Pages 803-820

The LINPACK benchmark: Past, present and future

(3) Dongarra, Jack J a Luszczek, Piotr a Petite, Antoine b

a University of Tennessee (United States)

b SUN MICROSYSTEMS (United States)

Author keywords

Benchmarking; BLAS; High performance computing; HPL; Linear algebra; LINPACK; TOP500

Indexed keywords

ALGORITHMS; BENCHMARKING; CODES (SYMBOLS); COMPUTATIONAL COMPLEXITY; DIGITAL ARITHMETIC; EXTRAPOLATION; FORTRAN (PROGRAMMING LANGUAGE); LINEAR EQUATIONS; MATRIX ALGEBRA; PARALLEL PROCESSING SYSTEMS; VECTORS;

BASIC LINEAR ALGEBRA SUBROUTINES; HIGH PARALLEL COMPUTING; HIGH PERFORMANCE COMPUTING; SOFTWARE PACKAGE LINPACK;

COMPUTER SOFTWARE SELECTION AND EVALUATION;

EID: 0042674307 PISSN: 15320626 EISSN: None Source Type: Journal
DOI: 10.1002/cpe.728 Document Type: Article

Times cited : (665)

References (57)

1
- 0042895870
- The LINPACK 1000x1000 benchmark program
- The LINPACK 1000x1000 benchmark program. (http://www.netlib.org/benchmark/1000d for source code.).

2
- 0003555195
- SIAM: Philadelphia, PA
- Dongarra JJ, Bunch J, Moller C, Stewart GW. LINPACK User's Guide. SIAM: Philadelphia, PA, 1979.
- (1979) LINPACK User's Guide
- Dongarra, J.J.¹ Bunch, J.² Moller, C.³ Stewart, G.W.⁴

3
- 0018515759
- Basic linear algebra subprograms for fortran usage
- Lawson C, Hanson R, Kincaid D, Krogh F. Basic Linear Algebra Subprograms for Fortran usage. ACM Transactions on Mathematical Software 1979;5:308-323.
- (1979) ACM Transactions on Mathematical Software , vol.5 , pp. 308-323
- Lawson, C.¹ Hanson, R.² Kincaid, D.³ Krogh, F.⁴

4
- 0003533609
- Performance of various computers using standard linear equations software
- Technical Report CS-89-85, University of Tennessee
- Dongarra JJ. Performance of various computers using standard linear equations software. Technical Report CS-89-85, University of Tennessee, 2002. (An updated version of this report can be found at benchmark/performance.ps).
- (2002)
- Dongarra, J.J.¹

5
- 0015279213
- Some notes on speeding up certain loops by software, firmware, and hardware means
- Pager D. Some notes on speeding up certain loops by software, firmware, and hardware means. IEEE Transactions on Computers 1972; 97-100.
- (1972) IEEE Transactions on Computers , pp. 97-100
- Pager, D.¹

6
- 0018440818
- Unrolling loops in Fortran
- Dongarra JJ, Hinds A. Unrolling loops in Fortran. Software-Practice and Experience 1979; 9:219-226.
- (1979) Software-Practice and Experience , vol.9 , pp. 219-226
- Dongarra, J.J.¹ Hinds, A.²

7
- 84983965442
- An empirical study of Fortran programs
- Knuth D. An empirical study of Fortran programs. Software-Practice and Experience 1971; 1:105-133.
- (1971) Software-Practice and Experience , vol.1 , pp. 105-133
- Knuth, D.¹

8
- 0023983122
- An extended set of Fortran basic linear algebra subprograms
- Dongarra JJ, Du Croz J, Hammarling S, Hanson R. An extended set of Fortran Basic Linear Algebra Subprograms. ACM Transactions on Mathematical Software 1988; 14:1-17.
- (1988) ACM Transactions on Mathematical Software , vol.14 , pp. 1-17
- Dongarra, J.J.¹ Du Croz, J.² Hammarling, S.³ Hanson, R.⁴

9
- 0003851784
- SIAM: Philadelphia, PA
- Dongarra JJ, Duff IS, Sorensen DC, van der Vorst HA. Numerical Linear Algebra for High-Performance Computers. SIAM: Philadelphia, PA, 1998.
- (1998) Numerical Linear Algebra for High-Performance Computers
- Dongarra, J.J.¹ Duff, I.S.² Sorensen, D.C.³ Van Der Vorst, H.A.⁴

10
- 0042895868
- Implementing dense linear algebra algorithms using mutlitasking on CRAY X-MP-4
- Dongarra JJ, Hewitt T. Implementing dense linear algebra algorithms using mutlitasking on CRAY X-MP-4. SIAM Journal of Science and Statistics in Computing 1986; 7(1):347-350.
- (1986) SIAM Journal of Science and Statistics in Computing , vol.7 , Issue.1 , pp. 347-350
- Dongarra, J.J.¹ Hewitt, T.²

11
- 84909708535
- Linear algebra on high-performance computers
- Schendel U (ed.). North Holland
- Dongarra JJ, Sorensen DC. Linear algebra on high-performance computers. Proceedings Parallel Computing 85, Schendel U (ed.). North Holland, 1986; 3-32.
- (1986) Proceedings Parallel Computing , vol.85 , pp. 3-32
- Dongarra, J.J.¹ Sorensen, D.C.²

12
- 0025402476
- A set of level 3 Fortran basic linear algebra subprograms
- Dongarra JJ, Du Croz J, Duff IS, Hammarling S. A set of Level 3 Fortran Basic Linear Algebra Subprograms. ACM Transactions on Mathematical Software 1990; 16:1-17.
- (1990) ACM Transactions on Mathematical Software , vol.16 , pp. 1-17
- Dongarra, J.J.¹ Du Croz, J.² Duff, I.S.³ Hammarling, S.⁴

13
- 0012062653
- Solution of simultaneous linear equations using a magnetic tape store
- Barron DW, Swinnerton-Dyer HPF. Solution of simultaneous linear equations using a magnetic tape store. Computer J. 1990; 3:28-33.
- (1990) Computer J. , vol.3 , pp. 28-33
- Barron, D.W.¹ Swinnerton-Dyer, H.P.F.²

14
- 0022862642
- Block-oriented local-memory-based linear equation solution on the CRAY-2: Uniprocessor algorithms
- Schendel U (ed). IEEE Computer Society Press
- Calahan DA. Block-oriented local-memory-based linear equation solution on the CRAY-2: Uniprocessor algorithms. Proceedings International Conference on Parallel Processing, August 1986, Schendel U (ed). IEEE Computer Society Press, 1986; 375-378.
- (1986) Proceedings International Conference on Parallel Processing, August 1986 , pp. 375-378
- Calahan, D.A.¹

15
- 0042895869
- Adaption of the Jacobi and Givens methods for a computer with magnetic tape backup store
- Technical Report 8, University of Sydney
- Chartres B. Adaption of the Jacobi and Givens methods for a computer with magnetic tape backup store. Technical Report 8, University of Sydney, 1960.
- (1960)
- Chartres, B.¹

16
- 0041392786
- Sparse matrix calculations on the CRAY-2
- Technical Report CSS 197, AERE Harwell
- Dave AK, Duff IS. Sparse matrix calculations on the CRAY-2. Technical Report CSS 197, AERE Harwell, 1986.
- (1986)
- Dave, A.K.¹ Duff, I.S.²

17
- 0019661013
- Solving large full sets of linear equations in a paged virtual store
- DuCroz J, Nugent S, Reid J, Taylor D. Solving large full sets of linear equations in a paged virtual store. ACM Transactions on Mathematical Software 1981; 7(4):527-536.
- (1981) ACM Transactions on Mathematical Software , vol.7 , Issue.4 , pp. 527-536
- DuCroz, J.¹ Nugent, S.² Reid, J.³ Taylor, D.⁴

18
- 84945709131
- Organizing matrices and matrix operations for paged memory systems
- McKellar AC, Coffman EG Jr. Organizing matrices and matrix operations for paged memory systems. Communications of the ACM 1969; 12(3):153-165.
- (1969) Communications of the ACM , vol.12 , Issue.3 , pp. 153-165
- McKellar, A.C.¹ Coffman E.G., Jr.²

19
- 31044454349
- Parallel algorithms on Cedar system
- Technical Report Report No. 581, CSRD
- Berry M, Gallivan K, Harrod W, Jalby W, Lo S, Meier U, Philippe B, Sameh A. Parallel algorithms on Cedar system. Technical Report Report No. 581, CSRD, 1986.
- (1986)
- Berry, M.¹ Gallivan, K.² Harrod, W.³ Jalby, W.⁴ Lo, S.⁵ Meier, U.⁶ Philippe, B.⁷ Sameh, A.⁸

20
- 0001951009
- The WY representation for products of householder matrices
- Bischof C, Van Loan CF. The WY representation for products of Householder matrices. SIAM SISSC 1987; 8(2).
- (1987) SIAM SISSC , vol.8 , Issue.2
- Bischof, C.¹ Van Loan, C.F.²

21
- 0041893750
- Linear algebra programs for use on a vector computer with a secondary solid state storage device
- Vichnevetsky R, Stepleman R (eds). IMACS
- Bucher I, Jordan T. Linear algebra programs for use on a vector computer with a secondary solid state storage device. Advances in Computer Methods for Practical Differential Equations, Vichnevetsky R, Stepleman R (eds). IMACS, 1984; 546-550.
- (1984) Advances in Computer Methods for Practical Differential Equations , pp. 546-550
- Bucher, I.¹ Jordan, T.²

22
- 0042895866
- IBM; IBM; Program Number: 5668-863
- IBM. Engineering and Scientific Subroutine Library. IBM, 1986. Program Number: 5668-863.
- (1986) Engineering and Scientific Subroutine Library

23
- 0042895867
- The LU decomposition algorithm and its efficient Fortran implementation on the IBM 3090 vector multiprocessor
- Technical Report ECSEC Report ICE-0006, IBM, March
- Robert Y, Suguazerro P. The LU decomposition algorithm and its efficient Fortran implementation on the IBM 3090 vector multiprocessor. Technical Report ECSEC Report ICE-0006, IBM, March 1987.
- (1987)
- Robert, Y.¹ Suguazerro, P.²

24
- 0042394909
- SAXPY Computer Corporation, 255 San Geronimo Way, Sunnyvale, CA 94086
- Schreiber R. Engineering and Scientific Subroutine Library, Module Design Specification. SAXPY Computer Corporation, 255 San Geronimo Way, Sunnyvale, CA 94086, 1986.
- (1986) Engineering and Scientific Subroutine Library, Module Design Specification
- Schreiber, R.¹

25
- 0035176737
- Recursive approach in sparse matrix LU factorization
- Dongarra JJ, Eijkhout V, Luszczek P. Recursive approach in sparse matrix LU factorization. Scientific Programming 2001; 9(1):51-60.
- (2001) Scientific Programming , vol.9 , Issue.1 , pp. 51-60
- Dongarra, J.J.¹ Eijkhout, V.² Luszczek, P.³

26
- 0013136934
- Full matrix techniques in sparse Gaussian elimination
- Duff IS. Full matrix techniques in sparse Gaussian elimination. Numerical Analysis Proceedings, Dundee, 1981 (Lecture Notes in Mathematics, vol. 912). Springer: Berlin, 1981; 71-84.
- Numerical Analysis Proceedings, Dundee, 1981 (Lecture Notes in Mathematics, Vol. 912). Springer Berlin, 1981 , pp. 71-84
- Duff, I.S.¹

27
- 0012305554
- Auxiliary storage methods for solving finite element systems
- George A, Rashwan H. Auxiliary storage methods for solving finite element systems. SIAM SISSC 1985; 6:882-910.
- (1985) SIAM SISSC , vol.6 , pp. 882-910
- George, A.¹ Rashwan, H.²

28
- 0031273280
- Recursion leads to automatic variable blocking for dense linear-algebra algorithms
- Gustavson FG. Recursion leads to automatic variable blocking for dense linear-algebra algorithms. IBM Journal of Research and Development 1997; 41(6):737-755.
- (1997) IBM Journal of Research and Development , vol.41 , Issue.6 , pp. 737-755
- Gustavson, F.G.¹

29
- 0031496750
- Locality of reference in LU decomposition with partial pivoting
- Toledo S. Locality of reference in LU decomposition with partial pivoting. SIAM Journal on Matrix Analysis and Applications 1997; 18(4).
- (1997) SIAM Journal on Matrix Analysis and Applications , vol.18 , Issue.4
- Toledo, S.¹

30
- 0003706460
- Society for Industrial and Applied Mathematics: Philadelphia, PA
- Anderson E, Bai Z, Bischof C, Blackford SL, Demmel JW, Dongarra JJ, Du Croz J, Greenbaum A, Hammarling S, McKenney A, Sorensen DC. LAPACK User's Guide (3rd edn). Society for Industrial and Applied Mathematics: Philadelphia, PA, 1999.
- (1999) LAPACK User's Guide (3rd Edn)
- Anderson, E.¹ Bai, Z.² Bischof, C.³ Blackford, S.L.⁴ Demmel, J.W.⁵ Dongarra, J.J.⁶ Du Croz, J.⁷ Greenbaum, A.⁸ Hammarling, S.⁹ McKenney, A.¹⁰ Sorensen, D.C.¹¹

31
- 0029694501
- The design and implementation of Solar, a portable library of resealable out-of-core linear algebra computations
- ACM Press
- Toledo S, Gustavson FG. The design and implementation of Solar, a portable library of resealable out-of-core linear algebra computations. Proceedings of the 4th Annual Workshop on I/O in Parallel and Distributed Systems, May 1996. ACM Press, 1996; 28-40.
- (1996) Proceedings of the 4th Annual Workshop on I/O in Parallel and Distributed Systems, May 1996 , pp. 28-40
- Toledo, S.¹ Gustavson, F.G.²

32
- 0343462141
- Automated empirical optimization of software and the Atlas project
- Dongarra JJ, Petitet A, Whaley RC. Automated empirical optimization of software and the Atlas project. Parallel Computing 2001; 27(1-2):3-25.
- (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-25
- Dongarra, J.J.¹ Petitet, A.² Whaley, R.C.³

33
- 0042895865
- Automatically tuned linear algebra software (ATLAS)
- IEEE
- Dongarra JJ, Whaley RC. Automatically tuned linear algebra software (ATLAS). Proceedings SC'89 Conference. IEEE, 1998.
- (1998) Proceedings SC'89 Conference
- Dongarra, J.J.¹ Whaley, R.C.²

34
- 32844469834
- November 2
- Meuer HW, Strohmaier E, Dongarra JJ, Simon HD. Top500 Supercomputer Sites, 17th edition, November 2 2001. (The report can be downloaded from http://www.netlib.org/benchmark/top500.html).
- (2001) Top500 Supercomputer Sites, 17th Edition
- Meuer, H.W.¹ Strohmaier, E.² Dongarra, J.J.³ Simon, H.D.⁴

35
- 0042394908
- Ahrendt G. Weekly postings to comp.sys.super [1993].
- (1993)
- Ahrendt, G.¹

36
- 0041893749
- Kahaner report on supercomputer in Japan
- Technical Report, The Computer Science Department, University of Arizona
- Kahaner D. Kahaner report on supercomputer in Japan. Technical Report, The Computer Science Department, University of Arizona, 1992. ftp://ftp.cs.arizona.edu/japan/kahaner.reports/jsuper.92.
- (1992)
- Kahaner, D.¹

37
- 0025997771
- Using Strassen's algorithm to accelerate the solution of linear systems
- Bailey D, Lee K, Simon H. Using Strassen's algorithm to accelerate the solution of linear systems. Journal of Supercomputing 1990; 4:357-371.
- (1990) Journal of Supercomputing , vol.4 , pp. 357-371
- Bailey, D.¹ Lee, K.² Simon, H.³

38
- 0030092443
- A high performance parallel Strassen implementation
- Grayson B, van de Geijn R. A high performance parallel Strassen implementation. Parallel Processing Letters 1996; 6(1):3-12.
- (1996) Parallel Processing Letters , vol.6 , Issue.1 , pp. 3-12
- Grayson, B.¹ Van De Geijn, R.²

39
- 0041893748
- Using Strassen's matrix multiplication in high performance solution of linear systems
- Paprzycki M, Cyphers C. Using Strassen's matrix multiplication in high performance solution of linear systems. Computers and Mathematics with Applications 1996; 31(4/5):55-61.
- (1996) Computers and Mathematics with Applications , vol.31 , Issue.4-5 , pp. 55-61
- Paprzycki, M.¹ Cyphers, C.²

40
- 34250487811
- Gaussian elimination is not optimal
- Strassen V. Gaussian elimination is not optimal. Numerical Mathematics 1969; 13:354-356.
- (1969) Numerical Mathematics , vol.13 , pp. 354-356
- Strassen, V.¹

41
- 85023205150
- Matrix multiplication via arithmetic progressions
- Coppersmith D, Winograd S. Matrix multiplication via arithmetic progressions. Journal of Symbolic Computation 1990; 9:251-280.
- (1990) Journal of Symbolic Computation , vol.9 , pp. 251-280
- Coppersmith, D.¹ Winograd, S.²

42
- 0041893747
- Innovative Computing Laboratory, September; and http://www.netlib.org/benchmark/hpl/
- Petitet A, Whaley RC, Dongarra JJ, Cleary A. HPL-A Portable Implementation of the High-Performance Linpack Benchmark for Distributed-Memory Computers. Innovative Computing Laboratory, September 2000. Available at http://icl.cs.utk.edu/hpl/ and http://www.netlib.org/benchmark/hpl/.
- (2000) HPL-A Portable Implementation of the High-Performance Linpack Benchmark for Distributed-Memory Computers
- Petitet, A.¹ Whaley, R.C.² Dongarra, J.J.³ Cleary, A.⁴

43
- 0040675695
- High performance fortran language specification. Version 1.1
- High Performance Fortran Forum; Technical Report, Rice University, November
- High Performance Fortran Forum. High Performance Fortran Language specification. Version 1.1. Technical Report, Rice University, November 1994.
- (1994)

44
- 0003565849
- High performance fortran language specification. Version 2.0
- High Performance Fortran Forum; Technical Report, Rice University, January
- High Performance Fortran Forum. High Performance Fortran Language specification. version 2.0. Technical Report, Rice University, January 1997.
- (1997)

45
- 0041392784
- IBM; IBM
- IBM. Parallel Engineering and Scientific Subroutine Library for AIX Version 2 Release 3. IBM, 2001.
- (2001) Parallel Engineering and Scientific Subroutine Library for AIX Version 2 Release 3

46
- 0004247841
- The MIT Press
- van de Geijn RA. Using PLAPACK. The MIT Press. 1997.
- (1997) Using PLAPACK
- Van De Geijn, R.A.¹

47
- 0003615167
- Society for Industrial and Applied Mathematics: Philadelphia, PA
- Blackford LS, Choi J, Cleary A, D'Azevedo E, Demmel JW, Dhillon IS, Dongarra JJ, Hammarling S, Henry G, Petitet A, Stanley K, Walker DW, Whaley RC. ScaLAPACK Users' Guide. Society for Industrial and Applied Mathematics: Philadelphia, PA, 1997.
- (1997) ScaLAPACK Users' Guide
- Blackford, L.S.¹ Choi, J.² Cleary, A.³ D'Azevedo, E.⁴ Demmel, J.W.⁵ Dhillon, I.S.⁶ Dongarra, J.J.⁷ Hammarling, S.⁸ Henry, G.⁹ Petitet, A.¹⁰ Stanley, K.¹¹ Walker, D.W.¹² Whaley, R.C.¹³

48
- 84871844180
- Openmp: Simple, portable, scalable smp programming
- Openmp: Simple, portable, scalable smp programming. http://www.openmp.org/.

49
- 0013025153
- International Organization for Standardization; ISO/IEC 9945-1:1996, Geneva, Switzerland
- International Organization for Standardization. Information technology-Portable operating system interface (POSIX)-Part 1: System Application Programming Interface (API) [C language]. ISO/IEC 9945-1:1996, Geneva, Switzerland, 1996.
- (1996) Information Technology-Portable Operating System Interface (POSIX)-Part 1: System Application Programming Interface (API) [C Language]

50
- 0042394907
- VSIPL 1.02 API
- [26 February]
- Shwartz DA, Judd RR, Harrod WJ, Manley DP. VSIPL 1.02 API. http://www.vsipl.org/ [26 February 2002].
- (2002)
- Shwartz, D.A.¹ Judd, R.R.² Harrod, W.J.³ Manley, D.P.⁴

51
- 0003413675
- MPI: A message-passing interface standard (version 1.1)
- Message Passing Interface Forum
- Message Passing Interface Forum. MPI: A Message-Passing Interface Standard (version 1.1). http://www.mpi-forum.org/ [1995].
- (1995)

52
- 0003604499
- MPI-2: Extensions to the message-passing interface
- Message Passing Interface Forum; [July]
- Message Passing Interface Forum. MPI-2: Extensions to the Message-Passing Interface. http://www.mpi-forum.org/ [July 1997].
- (1997)

53
- 0031221523
- Parallel implementation of BLAS: General techniques for level 3 BLAS
- Chtchelkanova A, Gunnels J, Morrow G, Overfelt J, van de Geijn R. Parallel implementation of BLAS: General techniques for Level 3 BLAS. Concurrency: Practice and Experience 1997, 9(9):837-857.
- (1997) Concurrency: Practice and Experience , vol.9 , Issue.9 , pp. 837-857
- Chtchelkanova, A.¹ Gunnels, J.² Morrow, G.³ Overfelt, J.⁴ Van De Geijn, R.⁵

54
- 0000778168
- Scalability issues in the design of a library for dense linear algebra
- (Also LAPACK Working Note No. 43)
- Dongarra JJ, van de Geijn R, Walker DW. Scalability issues in the design of a library for dense linear algebra. Journal of Parallel and Distributed Computing 1994: 22(3):523-537. (Also LAPACK Working Note No. 43).
- (1994) Journal of Parallel and Distributed Computing , vol.22 , Issue.3 , pp. 523-537
- Dongarra, J.J.¹ Van De Geijn, R.² Walker, D.W.³

55
- 0042895859
- Massively parallel LINPACK benchmark on the intel touchstone DELTA and iPSC/860 systems
- Intel Supercomputer Users Group, 1991
- van de Geijn R. Massively parallel LINPACK Benchmark on the Intel Touchstone DELTA and iPSC/860 systems. 1991 Annual Users Conference Proceedings, Dallas, Texas. Intel Supercomputer Users Group, 1991.
- 1991 Annual Users Conference Proceedings, Dallas, Texas
- Van De Geijn, R.¹

56
- 0031123769
- SUMMA: Scalable universal matrix multiplication algorithm
- van de Geijn R, Watts J. SUMMA: Scalable universal matrix multiplication algorithm. Concurrency: Practice and Experience 1997; 9(4):255-274.
- (1997) Concurrency: Practice and Experience , vol.9 , Issue.4 , pp. 255-274
- Van De Geijn, R.¹ Watts, J.²

57
- 0000793139
- Cramming more components onto integrated circuits
- Moore GE. Cramming more components onto integrated circuits. Electronics 1965; 38(8).
- (1965) Electronics , vol.38 , Issue.8
- Moore, G.E.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.