-
5
-
-
12444342261
-
Optimization of memory usage and communication requirements for a class of loops implementing multidimensional integrals
-
C. Lam, D. Cociorva, G. Baumgartner and P. Sadayappan. Optimization of Memory Usage and Communication Requirements for a Class of Loops Implementing MultiDimensional Integrals. In Proc. of Twelfth LCPC Workshop, 1999.
-
(1999)
Proc. of Twelfth LCPC Workshop
-
-
Lam, C.1
Cociorva, D.2
Baumgartner, G.3
Sadayappan, P.4
-
6
-
-
0000523695
-
On optimizing a class of multi-dimensional loops with reductions for parallel execution
-
C. Lam, P. Sadayappan and R. Wenger. On Optimizing a Class of Multi-Dimensional Loops with Reductions for Parallel Execution. Parallel Processing Letters, 7(2):157-168, 1997.
-
(1997)
Parallel Processing Letters
, vol.7
, Issue.2
, pp. 157-168
-
-
Lam, C.1
Sadayappan, P.2
Wenger, R.3
-
8
-
-
0036041078
-
Space-time trade-off optimization for a class of electronic structure calculations
-
D. Cociorva, G. Baumgartner, C. Lam, P. Sadayappan, J. Ramanujam, M. Nooijen, D. Bernholdt, and R. Harrison. Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations. In Proc. of ACM SIGPLAN 2002 Conference on Programming Language Design and Implementation (PLDI), pages 177-186, 2002.
-
(2002)
Proc. of ACM SIGPLAN 2002 Conference on Programming Language Design and Implementation (PLDI)
, pp. 177-186
-
-
Cociorva, D.1
Baumgartner, G.2
Lam, C.3
Sadayappan, P.4
Ramanujam, J.5
Nooijen, M.6
Bernholdt, D.7
Harrison, R.8
-
9
-
-
0005363622
-
Loop optimization for a class of memory-constrained computations
-
D. Cociorva, J. Wilkins, C. Lam, G. Baumgartner, P. Sadayappan, and J. Ramanujam. Loop optimization for a class of memory-constrained computations. In Proc. of the Fifteenth ACM International Conference on Supercomputing (ICS'01), pages 500-509, 2001.
-
(2001)
Proc. of the Fifteenth ACM International Conference on Supercomputing (ICS'01)
, pp. 500-509
-
-
Cociorva, D.1
Wilkins, J.2
Lam, C.3
Baumgartner, G.4
Sadayappan, P.5
Ramanujam, J.6
-
10
-
-
84947707755
-
Towards automatic synthesis of high-performance codes for electronic structure calculations: Data locality optimization
-
Springer-Verlag
-
D. Cociorva, J. Wilkins, G. Baumgartner, P. Sadayappan, J. Ramanujam, M. Nooijen, D. E. Bernholdt, and R. Harrison. Towards Automatic Synthesis of High-Performance Codes for Electronic Structure Calculations: Data Locality Optimization. In Proc. of the Intl. Conf. on High Performance Computing, volume 2228, pages 237-248. Springer-Verlag, 2001.
-
(2001)
Proc. of the Intl. Conf. on High Performance Computing
, vol.2228
, pp. 237-248
-
-
Cociorva, D.1
Wilkins, J.2
Baumgartner, G.3
Sadayappan, P.4
Ramanujam, J.5
Nooijen, M.6
Bernholdt, D.E.7
Harrison, R.8
-
11
-
-
84947277042
-
Global communication optimization for tensor contraction expressions under memory constraints
-
D. Cociorva, X. Gao, S. Krishnan, G. Baumgartner, C. Lam, P. Sadayappan, J. Ramanujam. Global Communication Optimization for Tensor Contraction Expressions under Memory Constraints. In Proc. of Seventeenth International Parallel and Distributed Processing Symposium (IPDPS), 2003.
-
(2003)
Proc. of Seventeenth International Parallel and Distributed Processing Symposium (IPDPS)
-
-
Cociorva, D.1
Gao, X.2
Krishnan, S.3
Baumgartner, G.4
Lam, C.5
Sadayappan, P.6
Ramanujam, J.7
-
13
-
-
85117163262
-
A high-level approach to synthesis of high-performance codes for quantum chemistry
-
November
-
G. Baumgartner and D.E. Bernholdt and D. Cociorva and R. Harrison and S. Hirata and C. Lam and M. Nooijen and R. Pitzer and J. Ramanujam and P. Sadayappan, A High-Level Approach to Synthesis of High-Performance Codes for Quantum Chemistry. In Proc. of Supercomputing 2002, November 2002.
-
(2002)
Proc. of Supercomputing 2002
-
-
Baumgartner, G.1
Bernholdt, D.E.2
Cociorva, D.3
Harrison, R.4
Hirata, S.5
Lam, C.6
Nooijen, M.7
Pitzer, R.8
Ramanujam, J.9
Sadayappan, P.10
-
18
-
-
0000533836
-
-
P. v. R. Schleyer, P. R. Schreiner, N. L. Allinger, T. Clark, J. Gasteiger, P. Kollman, H. F. Schaefer III (Eds.)
-
J. M. L. Martin. In P. v. R. Schleyer, P. R. Schreiner, N. L. Allinger, T. Clark, J. Gasteiger, P. Kollman, H. F. Schaefer III (Eds.). Encyclopedia of Computational Chemistry, 1:115-128, 1998.
-
(1998)
Encyclopedia of Computational Chemistry
, vol.1
, pp. 115-128
-
-
Martin, J.M.L.1
-
20
-
-
0030157365
-
Global arrays: A nonuniform memory access programming model for high-performance computers
-
J. Nieplocha, I. J. Harrison and R. J. Littlefield. Global Arrays: A Nonuniform Memory Access Programming Model for High-Performance Computers. The Journal of Supercomputing, 10:197-220, 1996.
-
(1996)
The Journal of Supercomputing
, vol.10
, pp. 197-220
-
-
Nieplocha, J.1
Harrison, I.J.2
Littlefield, R.J.3
-
21
-
-
0030190854
-
Improving data locality with loop transformations
-
July
-
K. S. McKinley, S. Carr and C.-W. Tseng. Improving Data Locality with Loop Transformations. ACM TOPLAS, 18(4):424-453, July 1996.
-
(1996)
ACM TOPLAS
, vol.18
, Issue.4
, pp. 424-453
-
-
McKinley, K.S.1
Carr, S.2
Tseng, C.-W.3
-
24
-
-
0036500189
-
An I/O conscious tiling strategy for disk-resident data sets
-
M. Kandemir, A. Choudhary, and J. Ramanujam. An I/O conscious tiling strategy for disk-resident data sets. The Journal of Supercomputing, 21(3):257-284, 2002.
-
(2002)
The Journal of Supercomputing
, vol.21
, Issue.3
, pp. 257-284
-
-
Kandemir, M.1
Choudhary, A.2
Ramanujam, J.3
-
25
-
-
0034228831
-
A unified framework for optimizing locality, parallelism, and communication in out-of-core computations
-
July
-
M. Kandemir, A. Choudhary, J. Ramanujam, and M. Kandaswamy. A unified framework for optimizing locality, parallelism, and communication in out-of-core computations. IEEE Transactions of Parallel and Distributed Systems, 11(7):648-668, July 2000.
-
(2000)
IEEE Transactions of Parallel and Distributed Systems
, vol.11
, Issue.7
, pp. 648-668
-
-
Kandemir, M.1
Choudhary, A.2
Ramanujam, J.3
Kandaswamy, M.4
-
26
-
-
0032066688
-
Compilation techniques for out-of-core parallel computations
-
June
-
M. Kandemir, A. Choudhary, J. Ramanujam and R. Bordawekar. Compilation techniques for out-of-core parallel computations. Parallel Computing, 24(3-4):597-628, June 1998.
-
(1998)
Parallel Computing
, vol.24
, Issue.3-4
, pp. 597-628
-
-
Kandemir, M.1
Choudhary, A.2
Ramanujam, J.3
Bordawekar, R.4
-
27
-
-
0003831392
-
Compiler support for out-of-core arrays on parallel machines
-
Rice University, Houston, TX, December
-
M. Paleczny, K. Kennedy, and C. Koelbel. Compiler Support for Out-of-Core Arrays on Parallel Machines. Technical Report 94509-S, Rice University, Houston, TX, December 1994.
-
(1994)
Technical Report 94509-S
-
-
Paleczny, M.1
Kennedy, K.2
Koelbel, C.3
-
30
-
-
0032308685
-
Quantifying the multi-level nature of tiling interactions
-
June
-
N. Mitchell, K. Högstedt, L. Carter, and J. Ferrante. Quantifying the multi-level nature of tiling interactions. Intl. Journal of Parallel Programming, 26(6):641-670, June 1998.
-
(1998)
Intl. Journal of Parallel Programming
, vol.26
, Issue.6
, pp. 641-670
-
-
Mitchell, N.1
Högstedt, K.2
Carter, L.3
Ferrante, J.4
-
33
-
-
0029181135
-
A model and compilation strategy for out-of-core data-parallel programs
-
R. Bordawekar, A. Choudhary, K. Kennedy, C. Koelbel, and M. Paleczny, A Model and Compilation Strategy for Out-of-Core Data-Parallel Programs. In Proc. of the Fifth ACM Symposium on Principles and Practice of Parallel Programming, 1995.
-
(1995)
Proc. of the Fifth ACM Symposium on Principles and Practice of Parallel Programming
-
-
Bordawekar, R.1
Choudhary, A.2
Kennedy, K.3
Koelbel, C.4
Paleczny, M.5
-
34
-
-
84877054900
-
PASSION runtime library for parallel I/O
-
R. Thakur, R. Bordawekar, A. Choudhary, R. Ponnusamy, and T. Singh. PASSION Runtime Library for Parallel I/O. In Proc. of Scalable Parallel Libraries Conference, pages 119-128, 1994.
-
(1994)
Proc. of Scalable Parallel Libraries Conference
, pp. 119-128
-
-
Thakur, R.1
Bordawekar, R.2
Choudhary, A.3
Ponnusamy, R.4
Singh, T.5
-
37
-
-
12444319714
-
On efficient Out-of-core matrix transposition
-
The Ohio State University, Columbus, OH, September
-
S. Krishnamoorthy, G. Baumgartner, D. Cociorva, C. Lam and P. Sadayappan. On Efficient Out-of-core Matrix Transposition. Technical Report OSU-CIRSC-9/03-T52, The Ohio State University, Columbus, OH, September 2003.
-
(2003)
Technical Report
, vol.OSU-CIRSC-9-03-T52
-
-
Krishnamoorthy, S.1
Baumgartner, G.2
Cociorva, D.3
Lam, C.4
Sadayappan, P.5
-
39
-
-
12444287248
-
Data locality optimization for synthesis of efficient out-of-core algorithms
-
S. Krishnan, S. Krishnamoorthy, G. Baumgartner, D. Cociorva, C. Lam, P. Sadayappan, J. Ramanujam, D. E. Bernholdt, and V. Choppella. Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms. In Proc. of the Intl. Conf. on High Performance Computing, 2003.
-
(2003)
Proc. of the Intl. Conf. on High Performance Computing
-
-
Krishnan, S.1
Krishnamoorthy, S.2
Baumgartner, G.3
Cociorva, D.4
Lam, C.5
Sadayappan, P.6
Ramanujam, J.7
Bernholdt, D.E.8
Choppella, V.9
-
40
-
-
0003570955
-
ViC*: A preprocessor for virtualmemory C*
-
Dartmouth College, November
-
T. Cormen and A. Colvin. ViC*: A Preprocessor for VirtualMemory C*. Technical Report PCS-TR94-243, Dartmouth College, November 1994.
-
(1994)
Technical Report
, vol.PCS-TR94-243
-
-
Cormen, T.1
Colvin, A.2
-
44
-
-
12444302637
-
-
The Ohio Supercomputer Center
-
The Ohio Supercomputer Center.
-
-
-
-
48
-
-
0032635362
-
New tiling techniques to improve cache temporal locality
-
Y. Song and Z. Li. New Tiling Techniques to Improve Cache Temporal Locality. In Proc. of ACM SIGPLAN PLDI, 1999.
-
(1999)
Proc. of ACM SIGPLAN PLDI
-
-
Song, Y.1
Li, Z.2
|