-
1
-
-
0000793139
-
Cramming more components onto integrated circuits
-
Moore G.E. Cramming more components onto integrated circuits. Electronics. 38(8):1965.
-
(1965)
Electronics
, vol.38
, Issue.8
-
-
Moore, G.E.1
-
2
-
-
2142848860
-
-
November
-
H.W. Meuer, E. Strohmaier, J.J. Dongarra, H.D. Simon, Top 500 Supercomputer Sites, 20th ed., The report can be downloaded from: 〈http://www.netlib.org/benchmark/top500.html〉, November 2002.
-
(2002)
Top 500 Supercomputer Sites, 20th Ed.
-
-
Meuer, H.W.1
Strohmaier, E.2
Dongarra, J.J.3
Simon, H.D.4
-
4
-
-
0002687459
-
No silver bullet: Essence and accidents of software engineering
-
F.P. Brooks Jr., No silver bullet: essence and accidents of software engineering, Information Processing.
-
Information Processing
-
-
Brooks Jr., F.P.1
-
5
-
-
0242590446
-
Self-adapting numerical software for next generation applications
-
Innovative Computing Laboratory University of Tennessee, August
-
J. Dongarra, V. Eijkhout, Self-adapting numerical software for next generation applications, Technical Report, Innovative Computing Laboratory University of Tennessee, Available from: 〈http://icl.cs.utk.edu/iclprojects/pages/sans.html〉, August 2002.
-
(2002)
Technical Report
-
-
Dongarra, J.1
Eijkhout, V.2
-
6
-
-
0038368778
-
Deploying parallel numerical library routines to cluster computing in a self adapting fashion
-
Joubert, Murli, Peters, & Vanneschi. London, England: Imperial College Press
-
Roche K.J., Dongarra J.J. Deploying parallel numerical library routines to cluster computing in a self adapting fashion. Joubert, Murli, Peters, Vanneschi Parallel Computing: Advances and Current Issues. 2002;Imperial College Press, London, England.
-
(2002)
Parallel Computing: Advances and Current Issues
-
-
Roche, K.J.1
Dongarra, J.J.2
-
9
-
-
0024018137
-
A polynomial approximation scheme for machine scheduling on uniform processors: Using the dual approach
-
Hochbaum D., Shmoys D. A polynomial approximation scheme for machine scheduling on uniform processors: using the dual approach. SIAM Journal of Computing. 17:1988;539-551.
-
(1988)
SIAM Journal of Computing
, vol.17
, pp. 539-551
-
-
Hochbaum, D.1
Shmoys, D.2
-
10
-
-
0000438412
-
Approximation algorithms for scheduling unrelated parallel machines
-
Lenstra J., Shmoys D., Tardos E. Approximation algorithms for scheduling unrelated parallel machines. Mathematical Programming. 46:1990;259-271.
-
(1990)
Mathematical Programming
, vol.46
, pp. 259-271
-
-
Lenstra, J.1
Shmoys, D.2
Tardos, E.3
-
11
-
-
84958040361
-
Approximation algorithms for dynamic storage allocation
-
Lecture Notes in Computer Science, Springer-Verlag
-
Gergov J. Approximation algorithms for dynamic storage allocation. Proceedings of the 4th Annual European Symposium on Algorithms. Lecture Notes in Computer Science. 1136:1996;52-56 Springer-Verlag.
-
(1996)
Proceedings of the 4th Annual European Symposium on Algorithms
, vol.1136
, pp. 52-56
-
-
Gergov, J.1
-
12
-
-
0004493166
-
On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems
-
Amaldi E., Kann V. On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems. Theoretical Computer Science. 209:1998;237-260.
-
(1998)
Theoretical Computer Science
, vol.209
, pp. 237-260
-
-
Amaldi, E.1
Kann, V.2
-
13
-
-
84947928463
-
Strong lower bounds on the approximability of some NPO PB - Complete maximization problems
-
Lecture Notes in Computer Science, Springer-Verlag
-
Kann V. Strong lower bounds on the approximability of some NPO PB - complete maximization problems. Proceedings of the 20th International Symposium on Mathematical Foundations of Computer Science. Lecture Notes in Computer Science. 969:1995;227-236 Springer-Verlag.
-
(1995)
Proceedings of the 20th International Symposium on Mathematical Foundations of Computer Science
, vol.969
, pp. 227-236
-
-
Kann, V.1
-
14
-
-
34250487811
-
Gaussian elimination is not optimal
-
Strassen V. Gaussian elimination is not optimal. Numerical Mathematics. 13:1969;354-356.
-
(1969)
Numerical Mathematics
, vol.13
, pp. 354-356
-
-
Strassen, V.1
-
16
-
-
0003424372
-
-
Philadelphia: Society for Industrial and Applied Mathematics
-
Demmel J.W. Applied Numerical Linear Algebra. 1997;Society for Industrial and Applied Mathematics, Philadelphia.
-
(1997)
Applied Numerical Linear Algebra
-
-
Demmel, J.W.1
-
18
-
-
0031273280
-
Recursion leads to automatic variable blocking for dense linear-algebra algorithms
-
Gustavson F.G. Recursion leads to automatic variable blocking for dense linear-algebra algorithms. IBM Journal of Research and Development. 41(6):1997;737-755.
-
(1997)
IBM Journal of Research and Development
, vol.41
, Issue.6
, pp. 737-755
-
-
Gustavson, F.G.1
-
19
-
-
0031496750
-
Locality of reference in LU decomposition with partial pivoting
-
Toledo S. Locality of reference in LU decomposition with partial pivoting. SIAM Journal on Matrix Analysis and Applications. 18(4):1997;1065-1081.
-
(1997)
SIAM Journal on Matrix Analysis and Applications
, vol.18
, Issue.4
, pp. 1065-1081
-
-
Toledo, S.1
-
20
-
-
0030645124
-
Exploiting hardware performance counters with flow and context sensitive profiling
-
Las Vegas, Nevada, USA
-
G. Ammons, T. Ball, J.R. Larus, Exploiting hardware performance counters with flow and context sensitive profiling, in: Proceedings of ACM SIGPLAN'97 Conference on Programming Language Design and Implementation, Las Vegas, Nevada, USA, 1997.
-
(1997)
Proceedings of ACM SIGPLAN'97 Conference on Programming Language Design and Implementation
-
-
Ammons, G.1
Ball, T.2
Larus, J.R.3
-
22
-
-
0026368758
-
Using profile information to assist classic code optimization
-
Chang P.P., Mahlke S.A., Hwu W.W. Using profile information to assist classic code optimization. Software Practice and Experience. 21(12):1991;1301-1321.
-
(1991)
Software Practice and Experience
, vol.21
, Issue.12
, pp. 1301-1321
-
-
Chang, P.P.1
Mahlke, S.A.2
Hwu, W.W.3
-
23
-
-
85086055276
-
On the construction of poly-algorithms for automatic numerical analysis
-
M. Klerer, & J. Reinfelds. Academic Press
-
Rice J.R. On the construction of poly-algorithms for automatic numerical analysis. Klerer M., Reinfelds J. Interactive Systems for Experimental Applied Mathematics. 1968;31-313 Academic Press.
-
(1968)
Interactive Systems for Experimental Applied Mathematics
, pp. 31-313
-
-
Rice, J.R.1
-
24
-
-
0343462141
-
Automated empirical optimizations of software and the ATLAS project
-
Whaley R.C., Petitet A., Dongarra J.J. Automated empirical optimizations of software and the ATLAS project. Parallel Computing. 27(1-2):2001;3-35.
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Whaley, R.C.1
Petitet, A.2
Dongarra, J.J.3
-
30
-
-
0003706460
-
-
Philadelphia: Society for Industrial and Applied Mathematics
-
Anderson E., Bai Z., Bischof C., Blackford S.L., Demmel J.W., Dongarra J.J., Croz J.D., Greenbaum A., Hammarling S., McKenney A., Sorensen D.C. LAPACK User's Guide. third ed. 1999;Society for Industrial and Applied Mathematics, Philadelphia.
-
(1999)
LAPACK User's Guide, Third Ed.
-
-
Anderson, E.1
Bai, Z.2
Bischof, C.3
Blackford, S.L.4
Demmel, J.W.5
Dongarra, J.J.6
Croz, J.D.7
Greenbaum, A.8
Hammarling, S.9
McKenney, A.10
Sorensen, D.C.11
-
31
-
-
0030661485
-
Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
-
Vienna, Austria: ACM SIGARC
-
Bilmes J., Asanovic K., Chin C., Demmel J. Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology. Proceedings of International Conference on Supercomputing. 1997;ACM SIGARC, Vienna, Austria.
-
(1997)
Proceedings of International Conference on Supercomputing
-
-
Bilmes, J.1
Asanovic, K.2
Chin, C.3
Demmel, J.4
-
33
-
-
0030571238
-
Algorithmic bombardment for the iterative solution of linear systems: A poly-iterative approach
-
Barrett R., Berry M., Dongarra J., Eijkhout V., Romine C. Algorithmic bombardment for the iterative solution of linear systems: a poly-iterative approach. Journal of Computational and Applied Mathematics. 74(1-2):1996;91-109.
-
(1996)
Journal of Computational and Applied Mathematics
, vol.74
, Issue.1-2
, pp. 91-109
-
-
Barrett, R.1
Berry, M.2
Dongarra, J.3
Eijkhout, V.4
Romine, C.5
-
36
-
-
0004972603
-
-
Ph.D. Thesis, University of California, Berkeley, California
-
E.-J. Im, Automatic optimization of sparse matrix-vector multiplication, Ph.D. Thesis, University of California, Berkeley, California, 2000.
-
(2000)
Automatic Optimization of Sparse Matrix-vector Multiplication
-
-
Im, E.-J.1
-
37
-
-
0031269220
-
Improving the memory-system performance of sparse matrix-vector multiplication
-
Toledo S. Improving the memory-system performance of sparse matrix-vector multiplication. IBM Journal of Research and Development. 41(6):1997.
-
(1997)
IBM Journal of Research and Development
, vol.41
, Issue.6
-
-
Toledo, S.1
-
38
-
-
3042576437
-
Improving performance of sparse matrix-vector multiplication
-
A. Pinar, M.T. Heath, Improving performance of sparse matrix-vector multiplication, in: Proceedings of SC'99, 1999.
-
(1999)
Proceedings of SC'99
-
-
Pinar, A.1
Heath, M.T.2
-
41
-
-
0242590439
-
Performance modeling for self adapting collective communications for MPI
-
Sante Fe, New Mexico
-
S. Vadhiyar, G. Fagg, J.J. Dongarra, Performance modeling for self adapting collective communications for MPI, In: Los Alamos Computer Science Institute Symposium (LACSI 2001), Sante Fe, New Mexico, 2001.
-
(2001)
Los Alamos Computer Science Institute Symposium (LACSI 2001)
-
-
Vadhiyar, S.1
Fagg, G.2
Dongarra, J.J.3
-
42
-
-
0042674307
-
The LINPACK benchmark: Past, present, and future
-
Dongarra J.J., Luszczek P., Petitet A. The LINPACK benchmark: past, present, and future. Concurrency and Computation: Practice and Experience. 15:2003;1-18.
-
(2003)
Concurrency and Computation: Practice and Experience
, vol.15
, pp. 1-18
-
-
Dongarra, J.J.1
Luszczek, P.2
Petitet, A.3
-
43
-
-
0031636309
-
FFTW: An adaptive software architecture for the FFT
-
Seattle, Washington, USA
-
M. Frigo, S.G. Johnson, FFTW: an adaptive software architecture for the FFT, in: Proceedings International Conference on Acoustics, Speech, and Signal Processing, Seattle, Washington, USA, 1998.
-
(1998)
Proceedings International Conference on Acoustics, Speech, and Signal Processing
-
-
Frigo, M.1
Johnson, S.G.2
-
45
-
-
0242590430
-
Automatic performance tuning in the UHFFT library
-
San Francisco, California, USA
-
D. Mirkovic, S.L. Johnsson, Automatic performance tuning in the UHFFT library, in: 2001 International Conference on Computational Science, San Francisco, California, USA, 2001.
-
(2001)
2001 International Conference on Computational Science
-
-
Mirkovic, D.1
Johnsson, S.L.2
-
48
-
-
0035551989
-
Numerical libraries and the grid
-
Petitet A., Blackford S., Dongarra J., Ellis B., Fagg G., Roche K., Vadhiyar S. Numerical libraries and the grid. International Journal of High Performance Computing Applications. 15:2001;359-374.
-
(2001)
International Journal of High Performance Computing Applications
, vol.15
, pp. 359-374
-
-
Petitet, A.1
Blackford, S.2
Dongarra, J.3
Ellis, B.4
Fagg, G.5
Roche, K.6
Vadhiyar, S.7
-
49
-
-
34250261642
-
Adaptive procedure for estimating parameters for the nonsymmetric Tchebyshev iteration
-
Manteuffel T.A. Adaptive procedure for estimating parameters for the nonsymmetric Tchebyshev iteration. Numerische Mathematik. 31:1978;183-208.
-
(1978)
Numerische Mathematik
, vol.31
, pp. 183-208
-
-
Manteuffel, T.A.1
-
50
-
-
0001256129
-
The Tchebyshev iteration for nonsymmetric linear systems
-
Manteuffel T.A. The Tchebyshev iteration for nonsymmetric linear systems. Numerische Mathematik. 28:1977;307-327.
-
(1977)
Numerische Mathematik
, vol.28
, pp. 307-327
-
-
Manteuffel, T.A.1
-
51
-
-
0000659752
-
A practical termination criterion for the Conjugate Gradient method
-
Kaasschieter E.F. A practical termination criterion for the Conjugate Gradient method. BIT. 28:1988;308-322.
-
(1988)
BIT
, vol.28
, pp. 308-322
-
-
Kaasschieter, E.F.1
-
52
-
-
4243917643
-
Computational variants of the CGS and BiCGSTAB methods
-
Computer Science Department, The University of Tennessee Knoxville, August
-
V. Eijkhout, Computational variants of the CGS and BiCGSTAB methods, Technical Report CS-94-241, Computer Science Department, The University of Tennessee Knoxville, August 1994 (Also LAPACK Working Note No. 78).
-
(1994)
Technical Report
, vol.CS-94-241
-
-
Eijkhout, V.1
-
53
-
-
0242422519
-
-
V. Eijkhout, Computational variants of the CGS and BiCGSTAB methods, Technical Report CS-94-241, Computer Science Department, The University of Tennessee Knoxville, August 1994 (Also LAPACK Working Note No. 78).
-
LAPACK Working Note No. 78
, vol.78
-
-
-
54
-
-
0003978709
-
A proposal for a set of parallel basic linear algebra subprograms
-
Technical Report CS-95-292, University of Tennessee Knoxville, May
-
J. Choi, J. Dongarra, S. Ostrouchov, A. Petitet, D. Walker, R.C. Whaley, A proposal for a set of parallel basic linear algebra subprograms, Technical Report CS-95-292, University of Tennessee Knoxville, LAPACK Working Note 100, May 1995.
-
(1995)
LAPACK Working Note
, vol.100
-
-
Choi, J.1
Dongarra, J.2
Ostrouchov, S.3
Petitet, A.4
Walker, D.5
Whaley, R.C.6
-
55
-
-
0005713748
-
New serial and parallel recursive QR factorization algorithms for SMP systems
-
E. Elmroth, F.G. Gustavson, New serial and parallel recursive QR factorization algorithms for SMP systems, in: Proceedings of PARA 1998, 1998.
-
(1998)
Proceedings of PARA 1998
-
-
Elmroth, E.1
Gustavson, F.G.2
-
56
-
-
0242674322
-
Communication-efficient parallel dense LU using a 3-dimensional approach
-
Norfolk, Virginia, USA
-
D. Irony, S. Toledo, Communication-efficient parallel dense LU using a 3-dimensional approach, in: Proceedings of the 10th SIAM Conference on Parallel Processing for Scientific Computing, Norfolk, Virginia, USA, 2001.
-
(2001)
Proceedings of the 10th SIAM Conference on Parallel Processing for Scientific Computing
-
-
Irony, D.1
Toledo, S.2
-
60
-
-
0242674324
-
-
MPICH
-
MPICH, Available from: 〈 http://www.mcs.anl.gov/mpi/mpich/〉.
-
-
-
-
62
-
-
0000235223
-
The network weather service: A distributed resource performance forecasting service for metacomputing
-
Wolski R., Spring N., Hayes H. The network weather service: a distributed resource performance forecasting service for metacomputing. Future Generation Computing Systems. 14:1998.
-
(1998)
Future Generation Computing Systems
, vol.14
-
-
Wolski, R.1
Spring, N.2
Hayes, H.3
-
63
-
-
0242505770
-
A framework for performance modeling and prediction
-
IEEE
-
Snavely A.et al. A framework for performance modeling and prediction. Proceedings of Supercomputing 2002. 2002;IEEE.
-
(2002)
Proceedings of Supercomputing 2002
-
-
Snavely, A.1
-
64
-
-
0003487728
-
High performance fortran language specification
-
H.P. Forum, Center for Research on Parallel Computing, Rice University, Houston, TX, May
-
H.P. Forum, High performance fortran language specification, Technical Report CRPC-TR92225, Center for Research on Parallel Computing, Rice University, Houston, TX, May 1993.
-
(1993)
Technical Report
, vol.CRPC-TR92225
-
-
-
65
-
-
0242505768
-
-
Ph.D. Thesis, University of Tennessee, Knoxville, Tennessee
-
A. Petitet, Algorithmic redistribution methods for block cyclic decompositions, Ph.D. Thesis, University of Tennessee, Knoxville, Tennessee, 1996.
-
(1996)
Algorithmic Redistribution Methods for Block Cyclic Decompositions
-
-
Petitet, A.1
-
66
-
-
0036467455
-
Dense linear algebra kernels on heterogeneous platforms: Redistribution issues
-
Beaumont O., Legrand A., Rastello F., Robert Y. Dense linear algebra kernels on heterogeneous platforms: redistribution issues. Parallel Computing. 28(2):2002;155-185.
-
(2002)
Parallel Computing
, vol.28
, Issue.2
, pp. 155-185
-
-
Beaumont, O.1
Legrand, A.2
Rastello, F.3
Robert, Y.4
-
68
-
-
0003615167
-
-
Philadelphia: Society for Industrial and Applied Mathematics
-
Blackford L.S., Choi J., Cleary A., D'Azevedo E.F., Demmel J.W., Dhillon I.S., Dongarra J.J., Hammarling S., Henry G., Petitet A., Stanley K., Walker D.W., Whaley R.C. ScaLAPACK Users' Guide. 1997;Society for Industrial and Applied Mathematics, Philadelphia.
-
(1997)
ScaLAPACK Users' Guide
-
-
Blackford, L.S.1
Choi, J.2
Cleary, A.3
D'Azevedo, E.F.4
Demmel, J.W.5
Dhillon, I.S.6
Dongarra, J.J.7
Hammarling, S.8
Henry, G.9
Petitet, A.10
Stanley, K.11
Walker, D.W.12
Whaley, R.C.13
-
69
-
-
0030244536
-
The design and implementation of the ScaLAPACK LU, QR, and Cholesky factorization routines
-
Choi J., Dongarra J.J., Ostrouchov S., Petitet A., Walker D.W., Whaley R.C. The design and implementation of the ScaLAPACK LU, QR, and Cholesky factorization routines. Scientific Programming. 5:1996;173-184.
-
(1996)
Scientific Programming
, vol.5
, pp. 173-184
-
-
Choi, J.1
Dongarra, J.J.2
Ostrouchov, S.3
Petitet, A.4
Walker, D.W.5
Whaley, R.C.6
|