-
2
-
-
85086807294
-
Synthesizing transformations for locality enhancement of imperfectly nested loops
-
N. Ahmed, N. Mateev, and K. Pingali. Synthesizing transformations for locality enhancement of imperfectly nested loops. In Proc. of ACM ICS, 2000.
-
(2000)
Proc. of ACM ICS
-
-
Ahmed, N.1
Mateev, N.2
Pingali, K.3
-
5
-
-
26444441816
-
A high-level approach to synthesis of high-performance codes for quantum chemistry
-
G. Baumgartner, D. Bernholdt, D. Cociorva, R. Harrison, S. Hirata, C. Lam, M. Nooijen, R. Pitzer, J. Ramanujam, and P. Sadayappan. A High-Level Approach to Synthesis of High-Performance Codes for Quantum Chemistry. In Proc. of SC, 2002.
-
(2002)
Proc. of SC
-
-
Baumgartner, G.1
Bernholdt, D.2
Cociorva, D.3
Harrison, R.4
Hirata, S.5
Lam, C.6
Nooijen, M.7
Pitzer, R.8
Ramanujam, J.9
Sadayappan, P.10
-
6
-
-
1142268809
-
Estimating cache misses and locality using stack distances
-
C. Cascaval and D. A. Padua. Estimating cache misses and locality using stack distances. In Proc. of ICS, 2003.
-
(2003)
Proc. of ICS
-
-
Cascaval, C.1
Padua, D.A.2
-
8
-
-
0036041078
-
Space-time trade-off optimization for a class of electronic structure calculations
-
D. Cociorva, G. Baumgartner, C. Lam, P. Sadayappan, J. Ramanujam, M. Nooijen, D. Bernholdt, and R. Harrison. Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations. In P roc. of PLDI, 2002.
-
(2002)
P Roc. of PLDI
-
-
Cociorva, D.1
Baumgartner, G.2
Lam, C.3
Sadayappan, P.4
Ramanujam, J.5
Nooijen, M.6
Bernholdt, D.7
Harrison, R.8
-
9
-
-
84947277042
-
Global communication optimization for tensor contraction expressions under memory constraints
-
D. Cociorva, X. Gao, S. Krishnan, G. Baumgartner, C. Lam, P. Sadayappan, and J. Ramanujam. Global Communication Optimization for Tensor Contraction Expressions under Memory Constraints. In Proc. of IPDPS, 2003.
-
(2003)
Proc. of IPDPS
-
-
Cociorva, D.1
Gao, X.2
Krishnan, S.3
Baumgartner, G.4
Lam, C.5
Sadayappan, P.6
Ramanujam, J.7
-
10
-
-
23044531284
-
Towards automatic synthesis of high-performance codes for electronic structure calculations: Data locality optimization
-
D. Cociorva, J. Wilkins, G. Baumgartner, P. Sadayappan, J. Ramanujam, M. Nooijen, D. E. Bernholdt, and R. Harrison. Towards Automatic Synthesis of High-Performance Codes for Electronic Structure Calculations: Data Locality Optimization. In Proc. of HiPC, 2001.
-
(2001)
Proc. of HiPC
-
-
Cociorva, D.1
Wilkins, J.2
Baumgartner, G.3
Sadayappan, P.4
Ramanujam, J.5
Nooijen, M.6
Bernholdt, D.E.7
Harrison, R.8
-
11
-
-
0034836237
-
Loop optimization for a class of memory-constrained computations
-
D. Cociorva, J. Wilkins, C. Lam, G. Baumgartner, P. Sadayappan, and J. Ramanujam. Loop optimization for a class of memory-constrained computations. In Proc. of ICS, 2001.
-
(2001)
Proc. of ICS
-
-
Cociorva, D.1
Wilkins, J.2
Lam, C.3
Baumgartner, G.4
Sadayappan, P.5
Ramanujam, J.6
-
12
-
-
0031611719
-
Precise miss analysis for program transformations with caches of arbitrary associativity
-
ACM Press
-
S. Ghosh, M. Martonosi, and S. Malik. Precise miss analysis for program transformations with caches of arbitrary associativity. In Proc. of ASPLOS, pages 228-239. ACM Press, 1998.
-
(1998)
Proc. of ASPLOS
, pp. 228-239
-
-
Ghosh, S.1
Martonosi, M.2
Malik, S.3
-
13
-
-
0001714824
-
Cache miss equations: A compiler framework for analyzing and tuning memory behavior
-
S. Ghosh, M. Martonosi, and S. Malik. Cache miss equations: a compiler framework for analyzing and tuning memory behavior. ACM Trans. Program. Lang. Syst., 21(4), 1999.
-
(1999)
ACM Trans. Program. Lang. Syst.
, vol.21
, Issue.4
-
-
Ghosh, S.1
Martonosi, M.2
Malik, S.3
-
17
-
-
33746313698
-
Optimization of memory usage and communication requirements for a class of loops implementing multi-dimensional integrals
-
C. Lam, D. Cociorva, G. Baumgartner, and P. Sadayappan. Optimization of Memory Usage and Communication Requirements for a Class of Loops Implementing Multi-Dimensional Integrals. In Proc. of LCPC Workshop, 1999.
-
(1999)
Proc. of LCPC Workshop
-
-
Lam, C.1
Cociorva, D.2
Baumgartner, G.3
Sadayappan, P.4
-
18
-
-
0000523695
-
On optimizing a class of multi-dimensional loops with reductions for parallel execution
-
C. Lam, P. Sadayappan, and R. Wenger. On Optimizing a Class of Multi-Dimensional Loops with Reductions for Parallel Execution. Parallel Processing Letters, 7(2):, 1997.
-
(1997)
Parallel Processing Letters
, vol.7
, Issue.2
-
-
Lam, C.1
Sadayappan, P.2
Wenger, R.3
-
20
-
-
0034823777
-
Blocking and array contraction across arbitrarily nested loops using affine partitioning
-
A. W. Lim, S.-W. Liao, and M. S. Lam. Blocking and array contraction across arbitrarily nested loops using affine partitioning. In Proc. of PPoP, page., 2001.
-
(2001)
Proc. of PPoP
-
-
Lim, A.W.1
Liao, S.-W.2
Lam, M.S.3
-
21
-
-
33746273002
-
Improving data locality with loop transformations
-
K. S. McKinley, S. Carr, and C.-W. Tseng. Improving Data Locality with Loop Transformations. ACM TOPLAS, page, 1996.
-
(1996)
ACM TOPLAS
-
-
McKinley, K.S.1
Carr, S.2
Tseng, C.-W.3
-
22
-
-
33746277058
-
Cache miss characterization and data locality optimization for imperfectly nested loops on shared memory multiprocessors
-
OSU, Columbus, OH
-
S. K. Sahoo, R. Panuganti, S. Krishnamoorthy, and P. Sadayappan. Cache miss characterization and data locality optimization for imperfectly nested loops on shared memory multiprocessors. Technical Report OSU-CISRC-1/05-TR03, OSU, Columbus, OH, 2005.
-
(2005)
Technical Report
, vol.OSU-CISRC-1-05-TR03
-
-
Sahoo, S.K.1
Panuganti, R.2
Krishnamoorthy, S.3
Sadayappan, P.4
|