-
1
-
-
0003473816
-
-
2nd Edition. SIAM, Philadelphia, PA
-
R. Barrett, M. Berry, T. F. Chan, J. Demmel, J. Donato, J. Dongarra, V. Eijkhout, R. Pozo, C. Romine, and H. V. der Vorst. Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, 2nd Edition. SIAM, Philadelphia, PA, 1994.
-
(1994)
Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods
-
-
Barrett, R.1
Berry, M.2
Chan, T.F.3
Demmel, J.4
Donato, J.5
Dongarra, J.6
Eijkhout, V.7
Pozo, R.8
Romine, C.9
der Vorst, H.V.10
-
2
-
-
0032606267
-
Automatic nonzero structure analysis
-
A. J. C. Bik and H. A. G. Wijshoff. Automatic nonzero structure analysis. SIAM Journal on Computing, 28(5):1576-1587, 1999.
-
(1999)
SIAM Journal on Computing
, vol.28
, Issue.5
, pp. 1576-1587
-
-
Bik, A.J.C.1
Wijshoff, H.A.G.2
-
3
-
-
0030661485
-
Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
-
Vienna, Austria, July ACM SIGARC.
-
J. Bilmes, K. Asanović, C. Chin, and J. Demmel. Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology. In Proceedings of the International Conference on Supercomputing, Vienna, Austria, July 1997. ACM SIGARC. see http://www.icsi.berkeley.edu/~bilmes/phipac.
-
(1997)
Proceedings of the International Conference on Supercomputing
-
-
Bilmes, J.1
Asanović, K.2
Chin, C.3
Demmel, J.4
-
4
-
-
84891471315
-
-
S. Blackford, G. Corliss, J. Demmel, J. Dongarra, I. Duff, S. Hammarling, G. Henry, M. Heroux, C. Hu, W. Kahan, L. Kaufman, B. Kearfott, F. Krogh, X. Li, Z. Maany, A. Petitet, R. Pozo, K. Remington, W. Walster, C. Whaley, and J. W. von Gudenberg. Document for the Basic Linear Algebra Subprograms (BLAS) standard: BLAS Technical Forum, 2001. www.netlib.org/blast.
-
(2001)
Document for the Basic Linear Algebra Subprograms (BLAS) Standard: BLAS Technical Forum
-
-
Blackford, S.1
Corliss, G.2
Demmel, J.3
Dongarra, J.4
Duff, I.5
Hammarling, S.6
Henry, G.7
Heroux, M.8
Hu, C.9
Kahan, W.10
Kaufman, L.11
Kearfott, B.12
Krogh, F.13
Li, X.14
Maany, Z.15
Petitet, A.16
Pozo, R.17
Remington, K.18
Walster, W.19
Whaley, C.20
von Gudenberg, J.W.21
more..
-
5
-
-
0000488282
-
The matrix market: A web resource for test matrix collections
-
R. F. Boisvert, editor, London, Chapman and Hall. math.nist.gov/MatrixMarket
-
R. F. Boisvert, R. Pozo, K. Remington, R. Barrett, and J. J. Dongarra. The Matrix Market: A web resource for test matrix collections. In R. F. Boisvert, editor, Quality of Numerical Software, Assessment and Enhancement, pages 125-137, London, 1997. Chapman and Hall. math.nist.gov/MatrixMarket.
-
(1997)
Quality of Numerical Software, Assessment and Enhancement
, pp. 125-137
-
-
Boisvert, R.F.1
Pozo, R.2
Remington, K.3
Barrett, R.4
Dongarra, J.J.5
-
6
-
-
12844276307
-
A scalable cross-platform infrastructure for application performance tuning using hardware counters
-
November
-
S. Browne, J. Dongarra, N. Garner, K. London, and P. Mucci. A scalable cross-platform infrastructure for application performance tuning using hardware counters. In Proceedings of Supercomputing, November 2000.
-
(2000)
Proceedings of Supercomputing
-
-
Browne, S.1
Dongarra, J.2
Garner, N.3
London, K.4
Mucci, P.5
-
7
-
-
84964748976
-
Compiler blockability of numerical algorithms
-
S. Carr and K. Kennedy. Compiler blockability of numerical algorithms. In Proceedings of Supercomputing, pages 114-124, 1992.
-
(1992)
Proceedings of Supercomputing
, pp. 114-124
-
-
Carr, S.1
Kennedy, K.2
-
8
-
-
0034832018
-
Exact analysis of the cache behavior of nested loops
-
Snowbird, UT, USA, June
-
S. Chatterjee, E. Parker, P. J. Hanlon, and A. R. Lebeck. Exact analysis of the cache behavior of nested loops. In Proceedings of the ACM SIGPLAN 2001 Conference on Programming Language Design and Implementation, pages 286-297, Snowbird, UT, USA, June 2001.
-
(2001)
Proceedings of the ACM SIGPLAN 2001 Conference on Programming Language Design and Implementation
, pp. 286-297
-
-
Chatterjee, S.1
Parker, E.2
Hanlon, P.J.3
Lebeck, A.R.4
-
10
-
-
0033189408
-
Memory hierarchy performance prediction for sparse blocked algorithms
-
March
-
B. B. Fraguela, R. Doallo, and E. L. Zapata. Memory hierarchy performance prediction for sparse blocked algorithms. Parallel Processing Letters, 9(3), March 1999.
-
(1999)
Parallel Processing Letters
, vol.9
, Issue.3
-
-
Fraguela, B.B.1
Doallo, R.2
Zapata, E.L.3
-
11
-
-
0001714824
-
Cache miss equations: A compiler framework for analyzing and tuning memory behavior
-
S. Ghosh, M. Martonosi, and S. Malik. Cache miss equations: a compiler framework for analyzing and tuning memory behavior. ACM Transactions on Programming Languages and Systems, 21(4):703-746, 1999.
-
(1999)
ACM Transactions on Programming Languages and Systems
, vol.21
, Issue.4
, pp. 703-746
-
-
Ghosh, S.1
Martonosi, M.2
Malik, S.3
-
12
-
-
0005271318
-
Towards realistic bounds for implicit CFD codes
-
W. D. Gropp, D. K. Kasushik, D. E. Keyes, and B. F. Smith. Towards realistic bounds for implicit CFD codes. In Proceedings of Parallel Computational Fluid Dynamics, pages 241-248, 1999.
-
(1999)
Proceedings of Parallel Computational Fluid Dynamics
, pp. 241-248
-
-
Gropp, W.D.1
Kasushik, D.K.2
Keyes, D.E.3
Smith, B.F.4
-
13
-
-
34547734670
-
Fracture mechanics on the Intel Itanium architecture: A case study
-
Austin, TX, December
-
G. Heber, A. J. Dolgert, M. Alt, K. A. Mazurkiewicz, and L. Stringer. Fracture mechanics on the Intel Itanium architecture: A case study. In Workshop on EPIC Architectures and Compiler Technology (ACM MICRO 34), Austin, TX, December 2001.
-
(2001)
Workshop on EPIC Architectures and Compiler Technology (ACM MICRO 34)
-
-
Heber, G.1
Dolgert, A.J.2
Alt, M.3
Mazurkiewicz, K.A.4
Stringer, L.5
-
14
-
-
35448943938
-
Flexible, high-performance matrix multiply via a self-modifying runtime code
-
University of Texas at Austin, December
-
G. M. Henry. Flexible, high-performance matrix multiply via a self-modifying runtime code. Technical Report TR-2001-46, University of Texas at Austin, December 2001.
-
(2001)
Technical Report TR-2001-46
-
-
Henry, G.M.1
-
15
-
-
22644452418
-
Modeling and improving locality for irregular problems: Sparse matrix-vector product on cache memories as a case study
-
D. B. Heras, V. B. Perez, J. C. C. Dominguez, and F. F. Rivera. Modeling and improving locality for irregular problems: sparse matrix-vector product on cache memories as a case study. In HPCN Europe, pages 201-210, 1999.
-
(1999)
HPCN Europe
, pp. 201-210
-
-
Heras, D.B.1
Perez, V.B.2
Dominguez, J.C.C.3
Rivera, F.F.4
-
17
-
-
84949647432
-
Optimizing sparse matrix computations for register reuse in SPARSITY
-
of LNCS, Springer, May
-
E.-J. Im and K. A. Yelick. Optimizing sparse matrix computations for register reuse in SPARSITY. In Proceedings of the International Conference on Computational Science, volume 2073 of LNCS, pages 127-136. Springer, May 2001.
-
(2001)
Proceedings of the International Conference on Computational Science, Volume
, vol.2073
, pp. 127-136
-
-
Im, E.-J.1
Yelick, K.A.2
-
20
-
-
0038998034
-
Memory bandwidth and machine balance in current high performance computers
-
December
-
J. D. McCalpin. Memory bandwidth and machine balance in current high performance computers. Newsletter of the IEEE Technical Committee on Computer Architecture, December 1995. http://tab.computer.org/tcca/NEWS/DEC95/DEC95.HTM.
-
(1995)
Newsletter of the IEEE Technical Committee on Computer Architecture
-
-
McCalpin, J.D.1
-
22
-
-
0030190854
-
Improving data locality with loop transformations
-
July
-
K. S. McKinley, S. Carr, and C.-W. Tseng. Improving data locality with loop transformations. ACM Transactions on Programming Languages and Systems, 18(4):424-453, July 1996.
-
(1996)
ACM Transactions on Programming Languages and Systems
, vol.18
, Issue.4
, pp. 424-453
-
-
McKinley, K.S.1
Carr, S.2
Tseng, C.-W.3
-
23
-
-
0029713939
-
Algorithms for sparse matrix computations on high-performance workstations
-
Philadelpha, PA, USA, May
-
J. J. Navarro, E. García, J. L. Larriba-Pey, and T. Juan. Algorithms for sparse matrix computations on high-performance workstations. In Proceedings of the 10th ACM International Conference on Supercomputing, pages 301-308, Philadelpha, PA, USA, May 1996.
-
(1996)
Proceedings of the 10th ACM International Conference on Supercomputing
, pp. 301-308
-
-
Navarro, J.J.1
García, E.2
Larriba-Pey, J.L.3
Juan, T.4
-
24
-
-
3042576437
-
Improving performance of sparse matrix-vector multiplication
-
A. Pinar and M. Heath. Improving performance of sparse matrix-vector multiplication. In Proceedings of Supercomputing, 1999.
-
(1999)
Proceedings of Supercomputing
-
-
Pinar, A.1
Heath, M.2
|