-
1
-
-
0013076873
-
Plapack: Parallel linear algebra package
-
Philip Alpatov, Greg Baker, Carter Edwards, John Gunnels, Greg Morrow, James Overfelt, Robert van de Geijn, Yuan-Jye J. Wu. PLAPACK: Parallel Linear Algebra Package. In Proc. of the SIAM Parallel Processing Conference, 1997.
-
(1997)
Proc. Of the SIAM Parallel Processing Conference
-
-
Alpatov, P.1
Baker, G.2
Edwards, C.3
Gunnels, J.4
Morrow, G.5
Overfelt, J.6
van de Geijn, R.7
Wu, Y.-J.J.8
-
2
-
-
0003660984
-
-
Technical Report - Revision 2.1.2, Argonne National Laboratory
-
Satish Balay, William D. Gropp, Lois Curfman McInnes, and Barry F. Smith. PETSc Users Manual. Technical Report ANL-95/11 - Revision 2.1.2, Argonne National Laboratory, 2002.
-
(2002)
PETSc Users Manual
-
-
Balay, S.1
Gropp, W.D.2
McInnes, L.C.3
Smith, B.F.4
-
3
-
-
0030661485
-
Optimizing matrix multiply using PHiPAC
-
J. Bilmes, K. Asanovic, C. Chin, and J. Demmel. Optimizing matrix multiply using PHiPAC. In Proc. ACM Intl. Conf. on Supercomputing, pp. 340-347, 1997.
-
(1997)
Proc. ACM Intl. Conf. On Supercomputing
, pp. 340-347
-
-
Bilmes, J.1
Asanovic, K.2
Chin, C.3
Demmel, J.4
-
5
-
-
0003510632
-
-
CCS-TR-99-157, IDA Center for Computing Sciences
-
W. Carlson, J. Draper, D. Culler, K. Yelick, E. Brooks, and K. Warren. Introduction to UPC and Language Specification. CCS-TR-99-157, IDA Center for Computing Sciences, 1999.
-
(1999)
Introduction to UPC and Language Specification
-
-
Carlson, W.1
Draper, J.2
Culler, D.3
Yelick, K.4
Brooks, E.5
Warren, K.6
-
6
-
-
0039255232
-
-
Technical Report University of Tennessee, Knoxville, Mar
-
J. Choi, J. Demmel, I. Dhillon, J. Dongarra, S. Ostrouchov, A. Petitet, K. Stanley, D. Walker, and R. C. Whaley. ScaLAPACK: A Portable Linear Algebra Library for Distributed Memory Computers - Design Issues and Performance. Technical Report CS-95-283, University of Tennessee, Knoxville, Mar. 1995.
-
(1995)
ScaLAPACK: A Portable Linear Algebra Library for Distributed Memory Computers - Design Issues and Performance
-
-
Choi, J.1
Demmel, J.2
Dhillon, I.3
Dongarra, J.4
Ostrouchov, S.5
Petitet, A.6
Stanley, K.7
Walker, D.8
Whaley, R.C.9
-
7
-
-
28844482601
-
Memory-constrained communication minimization for a class of array computations
-
Jul
-
D. Cociorva, G. Baumgartner, C.-C. Lam, P. Sadayappan, J. Ramanujam. Memory-Constrained Communication Minimization for a Class of Array Computations. In Proc. of the 15th International Workshop on Languages and Compilers for Parallel Computing, Jul. 2002.
-
(2002)
Proc. Of the 15th International Workshop on Languages and Compilers for Parallel Computing
-
-
Cociorva, D.1
Baumgartner, G.2
Lam, C.-C.3
Sadayappan, P.4
Ramanujam, J.5
-
8
-
-
0036041078
-
Space-time trade-off optimization for a class of electronic structure calculations
-
Jun
-
D. Cociorva, G. Baumgartner, C.-C. Lam, P. Sadayappan, J. Ramanujam, M. Nooijen, D.E. Bernholdt, and R. Harrison. Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations. In Proc. of the ACM SIGPLAN 2002 Conference on Programming Language Design and Implementation, Jun. 2002, pp. 177-186.
-
(2002)
Proc. Of the ACM SIGPLAN 2002 Conference on Programming Language Design and Implementation
, pp. 177-186
-
-
Cociorva, D.1
Baumgartner, G.2
Lam, C.-C.3
Sadayappan, P.4
Ramanujam, J.5
Nooijen, M.6
Bernholdt, D.E.7
Harrison, R.8
-
9
-
-
84947707755
-
Towards automatic synthesis of high-performance codes for electronic structure calculations: Data locality optimization
-
Springer-Verlag
-
D. Cociorva, J. Wilkins, G. Baumgartner, P. Sadayappan, J. Ramanujam, M. Nooijen, D.E. Bernholdt, and R. Harrison. Towards Automatic Synthesis of High-Performance Codes for Electronic Structure Calculations: Data Locality Optimization. Proc. of the Intl. Conf. on High Performance Computing, Dec. 2001, Lecture Notes in Computer Science, Vol. 2228, pp. 237-248, Springer-Verlag, 2001.
-
(2001)
Proc. Of the Intl. Conf. Of High Performance Computing, Dec. 2001, Lecture Notes in Computer Science
, vol.2228
, pp. 237-248
-
-
Cociorva, D.1
Wilkins, J.2
Baumgartner, G.3
Sadayappan, P.4
Ramanujam, J.5
Nooijen, M.6
Bernholdt, D.E.7
Harrison, R.8
-
10
-
-
0034836237
-
Loop optimizations for a class of memory-constrained computations
-
Jun
-
D. Cociorva, J. Wilkins, C.-C. Lam, G. Baumgartner, P. Sadayappan, J. Ramanujam. Loop Optimizations for a Class of Memory-Constrained Computations. In Proc. 15th ACM Intl. Conf. on Supercomputing, Jun. 2001, pp. 103-113.
-
(2001)
Proc. 15th ACM Intl. Conf. On Supercomputing
, pp. 103-113
-
-
Cociorva, D.1
Wilkins, J.2
Lam, C.-C.3
Baumgartner, G.4
Sadayappan, P.5
Ramanujam, J.6
-
12
-
-
0031636309
-
FFTW: An adaptive software architecture for the FFT
-
M. Frigo and S. Johnson. FFTW: An Adaptive Software Architecture for the FFT. In Proc. ICASSP 98, Vol. 3, pp. 1381-1384, 1998, http://www.fftw.org.
-
(1998)
Proc. ICASSP 98
, vol.3
, pp. 1381-1384
-
-
Frigo, M.1
Johnson, S.2
-
13
-
-
0038565371
-
Broadway: A software architecture for scientific computing
-
R.F. Boisvert and T. Tang (eds.), Kluwer Academic Press
-
Samuel Guyer and Calvin Lin. Broadway: A Software Architecture for Scientific Computing. In The Architecture of Scientific Software, R.F. Boisvert and P.T.P. Tang (eds.), Kluwer Academic Press, 2000, pp. 175-192.
-
(2000)
The Architecture of Scientific Software
, pp. 175-192
-
-
Guyer, S.1
Lin, C.2
-
14
-
-
0004341271
-
-
Version 3.3, Pacific Northwest National Laboratory, Richland, WA 99352
-
High Performance Computational Chemistry Group. NWChem, A Computational Chemistry Package for Parallel Computers, Version 3.3, 1999. Pacific Northwest National Laboratory, Richland, WA 99352.
-
(1999)
NWChem, A Computational Chemistry Package for Parallel Computers
-
-
-
16
-
-
0035707468
-
Telescoping languages: A strategy for automatic generation of scientific problem-solving systems from annotated libraries
-
K. Kennedy, B. Broo, K. Cooper, J. Dongarra, R. Fowler, D. Gannon, L. Johnsson, J. Mellor-Crummey, and L. Torczon. Telescoping Languages: A Strategy for Automatic Generation of Scientific Problem-Solving Systems from Annotated Libraries. J. Parallel and Distributed Computing, 2001.
-
(2001)
J. Parallel and Distributed Computing
-
-
Kennedy, K.1
Broo, B.2
Cooper, K.3
Dongarra, J.4
Fowler, R.5
Gannon, D.6
Johnsson, L.7
Mellor-Crummey, J.8
Torczon, L.9
-
18
-
-
22844454114
-
Memory-optimal evaluation of expression trees involving large objects
-
Springer-Verlag
-
C.-C. Lam, D. Cociorva, G. Baumgartner, and P. Sadayappan. Memory-Optimal Evaluation of Expression Trees Involving Large Objects. In Intl. Conf. on High Performance Computing, Dec. 1999, Lecture Notes in Computer Science, Vol. 1745, Springer-Verlag, 1999.
-
(1999)
Intl. Conf. On High Performance Computing, Dec. 1999, Lecture Notes in Computer Science
, vol.1745
-
-
Lam, C.-C.1
Cociorva, D.2
Baumgartner, G.3
Sadayappan, P.4
-
19
-
-
33745180598
-
Optimization of memory usage for a class of loops implementing multi-dimensional integrals
-
Aug. Lecture Notes in Computer Science,. Springer-Verlag, 1999
-
C.-C. Lam, D. Cociorva, G. Baumgartner, and P. Sadayappan. Optimization of Memory Usage for a Class of Loops Implementing Multi-Dimensional Integrals. In Languages and Compilers for Parallel Computing, Aug. 1999, Lecture Notes in Computer Science, Vol. 1863, Springer-Verlag, 1999.
-
(1999)
Languages and Compilers for Parallel Computing
, vol.1863
-
-
Lam, C.-C.1
Cociorva, D.2
Baumgartner, G.3
Sadayappan, P.4
-
20
-
-
0000523695
-
On optimizing a class of multi-dimensional loops with reductions for parallel execution
-
C.-C. Lam, P. Sadayappan, and R. Wenger. On Optimizing a Class of Multi-Dimensional Loops with Reductions for Parallel Execution. Parallel Processing Letters, Vol. 7, No. 2, pp. 157-168, 1997.
-
(1997)
Parallel Processing Letters
, vol.7
, Issue.2
, pp. 157-168
-
-
Lam, C.-C.1
Sadayappan, P.2
Wenger, R.3
-
23
-
-
0000533836
-
-
R. Schleyer, Eds, Wiley & Sons, Berne (Switzerland).
-
J. M. L. Martin. In P. v. R. Schleyer, P. R. Schreiner, N. L. Allinger, T. Clark, J. Gasteiger, P. Kollman, H. F. Schaefer III (Eds.), Encyclopedia of Computational Chemistry, Wiley & Sons, Berne (Switzerland). Vol. 1, pp. 115-128, 1998.
-
(1998)
Encyclopedia of Computational Chemistry
, vol.1
, pp. 115-128
-
-
Martin, J.M.L.1
Schreiner, P.R.2
Allinger, N.L.3
Clark, T.4
Gasteiger, J.5
Kollman, P.6
Schaefer, H.F.7
-
27
-
-
84949653778
-
Automatic performance tuning in the UHFFT library
-
Springer-Verlag
-
D. Mirkovic and L. Johnsson. Automatic Performance Tuning in the UHFFT Library. In Proc. International Conference on Computational Science, Lecture Notes in Computer Science, Vol. 2073, pp. 71-80, Springer-Verlag, 2001.
-
(2001)
Proc. International Conference on Computational Science, Lecture Notes in Computer Science
, vol.2073
, pp. 71-80
-
-
Mirkovic, D.1
Johnsson, L.2
-
28
-
-
0004014209
-
-
J. Moura, J. Johnson, R. Johnson, D. Padua, V. Prasanna, M. Puschel, and M. Veloso. SPIRAL: Portable Library of Optimized Signal Processing Algorithms, 1998. http://www.ece.cmu.edu/~spiral.
-
(1998)
SPIRAL: Portable Library of Optimized Signal Processing Algorithms
-
-
Moura, J.1
Johnson, J.2
Johnson, R.3
Padua, D.4
Prasanna, V.5
Puschel, M.6
Veloso, M.7
-
29
-
-
0030157365
-
Global arrays: A nonuniform memory access programming model for high-performance computers
-
J. Nieplocha, R. J. Harrison, and R. J. Littlefield. Global Arrays: A Nonuniform Memory Access Programming Model for High-Performance Computers. The Journal of Supercomputing, Vol. 10, pp. 197-220, 1996.
-
(1996)
The Journal of Supercomputing
, vol.10
, pp. 197-220
-
-
Nieplocha, J.1
Harrison, R.J.2
Littlefield, R.J.3
-
30
-
-
0002081678
-
Co-array Fortran for parallel programming
-
R.W. Numrich and J.K. Reid. Co-Array Fortran for Parallel Programming. Fortran Forum, Vol. 17, No. 2, 1998.
-
(1998)
Fortran Forum
, vol.17
, Issue.2
-
-
Numrich, R.W.1
Reid, J.K.2
-
36
-
-
0032155556
-
Titanium: A high-performance Java dialect
-
Sept.-Nov
-
K. A. Yelick, L. Semenzato, G. Pike, C. Miyamoto, B. Liblit, A. Krishnamurthy, P. N. Hilfinger, S. L. Graham, D. Gay, P. Colella, and A. Aiken. Titanium: A High-Performance Java Dialect, Concurrency: Practice and Experience, Vol. 10, No. 11-13, Sept.-Nov. 1998.
-
(1998)
Concurrency: Practice and Experience
, vol.10
, Issue.11-13
-
-
Yelick, K.A.1
Semenzato, L.2
Pike, G.3
Miyamoto, C.4
Liblit, B.5
Krishnamurthy, A.6
Hilfinger, P.N.7
Graham, S.L.8
Gay, D.9
Colella, P.10
Aiken, A.11
|