-
1
-
-
0042895870
-
-
The LINPACK 1000x1000 benchmark program
-
The LINPACK 1000x1000 benchmark program. (http://www.netlib.org/benchmark/1000d for source code.).
-
-
-
-
2
-
-
0003555195
-
-
SIAM: Philadelphia, PA
-
Dongarra JJ, Bunch J, Moller C, Stewart GW. LINPACK User's Guide. SIAM: Philadelphia, PA, 1979.
-
(1979)
LINPACK User's Guide
-
-
Dongarra, J.J.1
Bunch, J.2
Moller, C.3
Stewart, G.W.4
-
4
-
-
0003533609
-
Performance of various computers using standard linear equations software
-
Technical Report CS-89-85, University of Tennessee
-
Dongarra JJ. Performance of various computers using standard linear equations software. Technical Report CS-89-85, University of Tennessee, 2002. (An updated version of this report can be found at benchmark/performance.ps).
-
(2002)
-
-
Dongarra, J.J.1
-
5
-
-
0015279213
-
Some notes on speeding up certain loops by software, firmware, and hardware means
-
Pager D. Some notes on speeding up certain loops by software, firmware, and hardware means. IEEE Transactions on Computers 1972; 97-100.
-
(1972)
IEEE Transactions on Computers
, pp. 97-100
-
-
Pager, D.1
-
7
-
-
84983965442
-
An empirical study of Fortran programs
-
Knuth D. An empirical study of Fortran programs. Software-Practice and Experience 1971; 1:105-133.
-
(1971)
Software-Practice and Experience
, vol.1
, pp. 105-133
-
-
Knuth, D.1
-
9
-
-
0003851784
-
-
SIAM: Philadelphia, PA
-
Dongarra JJ, Duff IS, Sorensen DC, van der Vorst HA. Numerical Linear Algebra for High-Performance Computers. SIAM: Philadelphia, PA, 1998.
-
(1998)
Numerical Linear Algebra for High-Performance Computers
-
-
Dongarra, J.J.1
Duff, I.S.2
Sorensen, D.C.3
Van Der Vorst, H.A.4
-
11
-
-
84909708535
-
Linear algebra on high-performance computers
-
Schendel U (ed.). North Holland
-
Dongarra JJ, Sorensen DC. Linear algebra on high-performance computers. Proceedings Parallel Computing 85, Schendel U (ed.). North Holland, 1986; 3-32.
-
(1986)
Proceedings Parallel Computing
, vol.85
, pp. 3-32
-
-
Dongarra, J.J.1
Sorensen, D.C.2
-
13
-
-
0012062653
-
Solution of simultaneous linear equations using a magnetic tape store
-
Barron DW, Swinnerton-Dyer HPF. Solution of simultaneous linear equations using a magnetic tape store. Computer J. 1990; 3:28-33.
-
(1990)
Computer J.
, vol.3
, pp. 28-33
-
-
Barron, D.W.1
Swinnerton-Dyer, H.P.F.2
-
14
-
-
0022862642
-
Block-oriented local-memory-based linear equation solution on the CRAY-2: Uniprocessor algorithms
-
Schendel U (ed). IEEE Computer Society Press
-
Calahan DA. Block-oriented local-memory-based linear equation solution on the CRAY-2: Uniprocessor algorithms. Proceedings International Conference on Parallel Processing, August 1986, Schendel U (ed). IEEE Computer Society Press, 1986; 375-378.
-
(1986)
Proceedings International Conference on Parallel Processing, August 1986
, pp. 375-378
-
-
Calahan, D.A.1
-
15
-
-
0042895869
-
Adaption of the Jacobi and Givens methods for a computer with magnetic tape backup store
-
Technical Report 8, University of Sydney
-
Chartres B. Adaption of the Jacobi and Givens methods for a computer with magnetic tape backup store. Technical Report 8, University of Sydney, 1960.
-
(1960)
-
-
Chartres, B.1
-
16
-
-
0041392786
-
Sparse matrix calculations on the CRAY-2
-
Technical Report CSS 197, AERE Harwell
-
Dave AK, Duff IS. Sparse matrix calculations on the CRAY-2. Technical Report CSS 197, AERE Harwell, 1986.
-
(1986)
-
-
Dave, A.K.1
Duff, I.S.2
-
17
-
-
0019661013
-
Solving large full sets of linear equations in a paged virtual store
-
DuCroz J, Nugent S, Reid J, Taylor D. Solving large full sets of linear equations in a paged virtual store. ACM Transactions on Mathematical Software 1981; 7(4):527-536.
-
(1981)
ACM Transactions on Mathematical Software
, vol.7
, Issue.4
, pp. 527-536
-
-
DuCroz, J.1
Nugent, S.2
Reid, J.3
Taylor, D.4
-
18
-
-
84945709131
-
Organizing matrices and matrix operations for paged memory systems
-
McKellar AC, Coffman EG Jr. Organizing matrices and matrix operations for paged memory systems. Communications of the ACM 1969; 12(3):153-165.
-
(1969)
Communications of the ACM
, vol.12
, Issue.3
, pp. 153-165
-
-
McKellar, A.C.1
Coffman E.G., Jr.2
-
19
-
-
31044454349
-
Parallel algorithms on Cedar system
-
Technical Report Report No. 581, CSRD
-
Berry M, Gallivan K, Harrod W, Jalby W, Lo S, Meier U, Philippe B, Sameh A. Parallel algorithms on Cedar system. Technical Report Report No. 581, CSRD, 1986.
-
(1986)
-
-
Berry, M.1
Gallivan, K.2
Harrod, W.3
Jalby, W.4
Lo, S.5
Meier, U.6
Philippe, B.7
Sameh, A.8
-
20
-
-
0001951009
-
The WY representation for products of householder matrices
-
Bischof C, Van Loan CF. The WY representation for products of Householder matrices. SIAM SISSC 1987; 8(2).
-
(1987)
SIAM SISSC
, vol.8
, Issue.2
-
-
Bischof, C.1
Van Loan, C.F.2
-
21
-
-
0041893750
-
Linear algebra programs for use on a vector computer with a secondary solid state storage device
-
Vichnevetsky R, Stepleman R (eds). IMACS
-
Bucher I, Jordan T. Linear algebra programs for use on a vector computer with a secondary solid state storage device. Advances in Computer Methods for Practical Differential Equations, Vichnevetsky R, Stepleman R (eds). IMACS, 1984; 546-550.
-
(1984)
Advances in Computer Methods for Practical Differential Equations
, pp. 546-550
-
-
Bucher, I.1
Jordan, T.2
-
23
-
-
0042895867
-
The LU decomposition algorithm and its efficient Fortran implementation on the IBM 3090 vector multiprocessor
-
Technical Report ECSEC Report ICE-0006, IBM, March
-
Robert Y, Suguazerro P. The LU decomposition algorithm and its efficient Fortran implementation on the IBM 3090 vector multiprocessor. Technical Report ECSEC Report ICE-0006, IBM, March 1987.
-
(1987)
-
-
Robert, Y.1
Suguazerro, P.2
-
24
-
-
0042394909
-
-
SAXPY Computer Corporation, 255 San Geronimo Way, Sunnyvale, CA 94086
-
Schreiber R. Engineering and Scientific Subroutine Library, Module Design Specification. SAXPY Computer Corporation, 255 San Geronimo Way, Sunnyvale, CA 94086, 1986.
-
(1986)
Engineering and Scientific Subroutine Library, Module Design Specification
-
-
Schreiber, R.1
-
26
-
-
0013136934
-
Full matrix techniques in sparse Gaussian elimination
-
Duff IS. Full matrix techniques in sparse Gaussian elimination. Numerical Analysis Proceedings, Dundee, 1981 (Lecture Notes in Mathematics, vol. 912). Springer: Berlin, 1981; 71-84.
-
Numerical Analysis Proceedings, Dundee, 1981 (Lecture Notes in Mathematics, Vol. 912). Springer Berlin, 1981
, pp. 71-84
-
-
Duff, I.S.1
-
27
-
-
0012305554
-
Auxiliary storage methods for solving finite element systems
-
George A, Rashwan H. Auxiliary storage methods for solving finite element systems. SIAM SISSC 1985; 6:882-910.
-
(1985)
SIAM SISSC
, vol.6
, pp. 882-910
-
-
George, A.1
Rashwan, H.2
-
28
-
-
0031273280
-
Recursion leads to automatic variable blocking for dense linear-algebra algorithms
-
Gustavson FG. Recursion leads to automatic variable blocking for dense linear-algebra algorithms. IBM Journal of Research and Development 1997; 41(6):737-755.
-
(1997)
IBM Journal of Research and Development
, vol.41
, Issue.6
, pp. 737-755
-
-
Gustavson, F.G.1
-
29
-
-
0031496750
-
Locality of reference in LU decomposition with partial pivoting
-
Toledo S. Locality of reference in LU decomposition with partial pivoting. SIAM Journal on Matrix Analysis and Applications 1997; 18(4).
-
(1997)
SIAM Journal on Matrix Analysis and Applications
, vol.18
, Issue.4
-
-
Toledo, S.1
-
30
-
-
0003706460
-
-
Society for Industrial and Applied Mathematics: Philadelphia, PA
-
Anderson E, Bai Z, Bischof C, Blackford SL, Demmel JW, Dongarra JJ, Du Croz J, Greenbaum A, Hammarling S, McKenney A, Sorensen DC. LAPACK User's Guide (3rd edn). Society for Industrial and Applied Mathematics: Philadelphia, PA, 1999.
-
(1999)
LAPACK User's Guide (3rd Edn)
-
-
Anderson, E.1
Bai, Z.2
Bischof, C.3
Blackford, S.L.4
Demmel, J.W.5
Dongarra, J.J.6
Du Croz, J.7
Greenbaum, A.8
Hammarling, S.9
McKenney, A.10
Sorensen, D.C.11
-
32
-
-
0343462141
-
Automated empirical optimization of software and the Atlas project
-
Dongarra JJ, Petitet A, Whaley RC. Automated empirical optimization of software and the Atlas project. Parallel Computing 2001; 27(1-2):3-25.
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-25
-
-
Dongarra, J.J.1
Petitet, A.2
Whaley, R.C.3
-
34
-
-
32844469834
-
-
November 2
-
Meuer HW, Strohmaier E, Dongarra JJ, Simon HD. Top500 Supercomputer Sites, 17th edition, November 2 2001. (The report can be downloaded from http://www.netlib.org/benchmark/top500.html).
-
(2001)
Top500 Supercomputer Sites, 17th Edition
-
-
Meuer, H.W.1
Strohmaier, E.2
Dongarra, J.J.3
Simon, H.D.4
-
35
-
-
0042394908
-
-
Ahrendt G. Weekly postings to comp.sys.super [1993].
-
(1993)
-
-
Ahrendt, G.1
-
36
-
-
0041893749
-
Kahaner report on supercomputer in Japan
-
Technical Report, The Computer Science Department, University of Arizona
-
Kahaner D. Kahaner report on supercomputer in Japan. Technical Report, The Computer Science Department, University of Arizona, 1992. ftp://ftp.cs.arizona.edu/japan/kahaner.reports/jsuper.92.
-
(1992)
-
-
Kahaner, D.1
-
37
-
-
0025997771
-
Using Strassen's algorithm to accelerate the solution of linear systems
-
Bailey D, Lee K, Simon H. Using Strassen's algorithm to accelerate the solution of linear systems. Journal of Supercomputing 1990; 4:357-371.
-
(1990)
Journal of Supercomputing
, vol.4
, pp. 357-371
-
-
Bailey, D.1
Lee, K.2
Simon, H.3
-
38
-
-
0030092443
-
A high performance parallel Strassen implementation
-
Grayson B, van de Geijn R. A high performance parallel Strassen implementation. Parallel Processing Letters 1996; 6(1):3-12.
-
(1996)
Parallel Processing Letters
, vol.6
, Issue.1
, pp. 3-12
-
-
Grayson, B.1
Van De Geijn, R.2
-
39
-
-
0041893748
-
Using Strassen's matrix multiplication in high performance solution of linear systems
-
Paprzycki M, Cyphers C. Using Strassen's matrix multiplication in high performance solution of linear systems. Computers and Mathematics with Applications 1996; 31(4/5):55-61.
-
(1996)
Computers and Mathematics with Applications
, vol.31
, Issue.4-5
, pp. 55-61
-
-
Paprzycki, M.1
Cyphers, C.2
-
40
-
-
34250487811
-
Gaussian elimination is not optimal
-
Strassen V. Gaussian elimination is not optimal. Numerical Mathematics 1969; 13:354-356.
-
(1969)
Numerical Mathematics
, vol.13
, pp. 354-356
-
-
Strassen, V.1
-
43
-
-
0040675695
-
High performance fortran language specification. Version 1.1
-
High Performance Fortran Forum; Technical Report, Rice University, November
-
High Performance Fortran Forum. High Performance Fortran Language specification. Version 1.1. Technical Report, Rice University, November 1994.
-
(1994)
-
-
-
44
-
-
0003565849
-
High performance fortran language specification. Version 2.0
-
High Performance Fortran Forum; Technical Report, Rice University, January
-
High Performance Fortran Forum. High Performance Fortran Language specification. version 2.0. Technical Report, Rice University, January 1997.
-
(1997)
-
-
-
47
-
-
0003615167
-
-
Society for Industrial and Applied Mathematics: Philadelphia, PA
-
Blackford LS, Choi J, Cleary A, D'Azevedo E, Demmel JW, Dhillon IS, Dongarra JJ, Hammarling S, Henry G, Petitet A, Stanley K, Walker DW, Whaley RC. ScaLAPACK Users' Guide. Society for Industrial and Applied Mathematics: Philadelphia, PA, 1997.
-
(1997)
ScaLAPACK Users' Guide
-
-
Blackford, L.S.1
Choi, J.2
Cleary, A.3
D'Azevedo, E.4
Demmel, J.W.5
Dhillon, I.S.6
Dongarra, J.J.7
Hammarling, S.8
Henry, G.9
Petitet, A.10
Stanley, K.11
Walker, D.W.12
Whaley, R.C.13
-
48
-
-
84871844180
-
Openmp: Simple, portable, scalable smp programming
-
Openmp: Simple, portable, scalable smp programming. http://www.openmp.org/.
-
-
-
-
51
-
-
0003413675
-
MPI: A message-passing interface standard (version 1.1)
-
Message Passing Interface Forum
-
Message Passing Interface Forum. MPI: A Message-Passing Interface Standard (version 1.1). http://www.mpi-forum.org/ [1995].
-
(1995)
-
-
-
52
-
-
0003604499
-
MPI-2: Extensions to the message-passing interface
-
Message Passing Interface Forum; [July]
-
Message Passing Interface Forum. MPI-2: Extensions to the Message-Passing Interface. http://www.mpi-forum.org/ [July 1997].
-
(1997)
-
-
-
53
-
-
0031221523
-
Parallel implementation of BLAS: General techniques for level 3 BLAS
-
Chtchelkanova A, Gunnels J, Morrow G, Overfelt J, van de Geijn R. Parallel implementation of BLAS: General techniques for Level 3 BLAS. Concurrency: Practice and Experience 1997, 9(9):837-857.
-
(1997)
Concurrency: Practice and Experience
, vol.9
, Issue.9
, pp. 837-857
-
-
Chtchelkanova, A.1
Gunnels, J.2
Morrow, G.3
Overfelt, J.4
Van De Geijn, R.5
-
54
-
-
0000778168
-
Scalability issues in the design of a library for dense linear algebra
-
(Also LAPACK Working Note No. 43)
-
Dongarra JJ, van de Geijn R, Walker DW. Scalability issues in the design of a library for dense linear algebra. Journal of Parallel and Distributed Computing 1994: 22(3):523-537. (Also LAPACK Working Note No. 43).
-
(1994)
Journal of Parallel and Distributed Computing
, vol.22
, Issue.3
, pp. 523-537
-
-
Dongarra, J.J.1
Van De Geijn, R.2
Walker, D.W.3
-
55
-
-
0042895859
-
Massively parallel LINPACK benchmark on the intel touchstone DELTA and iPSC/860 systems
-
Intel Supercomputer Users Group, 1991
-
van de Geijn R. Massively parallel LINPACK Benchmark on the Intel Touchstone DELTA and iPSC/860 systems. 1991 Annual Users Conference Proceedings, Dallas, Texas. Intel Supercomputer Users Group, 1991.
-
1991 Annual Users Conference Proceedings, Dallas, Texas
-
-
Van De Geijn, R.1
-
57
-
-
0000793139
-
Cramming more components onto integrated circuits
-
Moore GE. Cramming more components onto integrated circuits. Electronics 1965; 38(8).
-
(1965)
Electronics
, vol.38
, Issue.8
-
-
Moore, G.E.1
|