-
1
-
-
0016567155
-
A computer algorithm for transposing nonsquare arrays
-
W. O. Alltop. A computer algorithm for transposing nonsquare arrays. IEEE Transactions on Computers, 24(10):1038-1040, 1975.
-
(1975)
IEEE Transactions on Computers
, vol.24
, Issue.10
, pp. 1038-1040
-
-
Alltop, W.O.1
-
2
-
-
0018907742
-
A stepwise approach to computing the multidimensional fast fourier transform of large arrays
-
G. L. Anderson. A stepwise approach to computing the multidimensional fast Fourier transform of large arrays. IEEE Transactions on Acoustics and Speech Signal Processing, 28(3):280-284, 1980.
-
(1980)
IEEE Transactions on Acoustics and Speech Signal Processing
, vol.28
, Issue.3
, pp. 280-284
-
-
Anderson, G.L.1
-
3
-
-
0025403252
-
FFTs in external or hierarchical memory
-
D. H. Bailey. FFTs in external or hierarchical memory. Journal of Supercomputing, 4(1):23-35, 1990.
-
(1990)
Journal of Supercomputing
, vol.4
, Issue.1
, pp. 23-35
-
-
Bailey, D.H.1
-
4
-
-
19944430447
-
A high-level approach to synthesis of high-performance codes for quantum chemistry
-
G. Baumgartner, D. Bernholdt, D. Cociorva, R. Harrison, S. Hirata, C. Lam, M. Nooijen, R. Pitzer, J. Ramanujam, and P. Sadayappan. A high-level approach to synthesis of high-performance codes for quantum chemistry. In Proceedings of Supercomputing 2002, 2003.
-
(2003)
Proceedings of Supercomputing 2002
-
-
Baumgartner, G.1
Bernholdt, D.2
Cociorva, D.3
Harrison, R.4
Hirata, S.5
Lam, C.6
Nooijen, M.7
Pitzer, R.8
Ramanujam, J.9
Sadayappan, P.10
-
5
-
-
0029254155
-
Myrinet: A gigabit-per-second local area network
-
February
-
N. J. Boden, D. Cohen, R. E. Felderman, A. E. Kulawik, C. L. Seitz, J. N. Seizovic, and W. Su. Myrinet: A gigabit-per-second local area network. IEEE Micro, 15(1):29-36, February 1995.
-
(1995)
IEEE Micro
, vol.15
, Issue.1
, pp. 29-36
-
-
Boden, N.J.1
Cohen, D.2
Felderman, R.E.3
Kulawik, A.E.4
Seitz, C.L.5
Seizovic, J.N.6
Su, W.7
-
6
-
-
0036041078
-
Space-time trade-off optimization for a class of electronic structure calculations
-
D. Cociorva, G. Baumgartner, C. Lam, P. Sadayappan, J. Ramanujam, M. Nooijen, D. Bernholdt, and R. Harrison. Space-time trade-off optimization for a class of electronic structure calculations. In Proc. of ACM SIG-PLAN 2002 Conference on Programming Language Design and Implementation (PLDI), 2002.
-
(2002)
Proc. of ACM SIG-PLAN 2002 Conference on Programming Language Design and Implementation (PLDI)
-
-
Cociorva, D.1
Baumgartner, G.2
Lam, C.3
Sadayappan, P.4
Ramanujam, J.5
Nooijen, M.6
Bernholdt, D.7
Harrison, R.8
-
7
-
-
84947277042
-
Global communication optimization for tensor contraction expressions under memory constraints
-
D. Cociorva, X. Gao, S. Krishnan, G. Baumgartner, C. Lam, P. Sadayappan, and J. Ramanujam. Global communication optimization for tensor contraction expressions under memory constraints. In Proc. of 17th International Parallel & Distributed Processing Symposium (IPDPS), 2003.
-
(2003)
Proc. of 17th International Parallel & Distributed Processing Symposium (IPDPS)
-
-
Cociorva, D.1
Gao, X.2
Krishnan, S.3
Baumgartner, G.4
Lam, C.5
Sadayappan, P.6
Ramanujam, J.7
-
8
-
-
12444314400
-
Towards automatic synthesis of high-performance codes for electronic structure calculations: Data locality optimization
-
D. Cociorva, J. Wilkins, G. Baumgartner, P. Sadayappan, J. Ramanujam, M. Nooijen, D. Bernholdt, and R. Harrison. Towards automatic synthesis of high-performance codes for electronic structure calculations: Data locality optimization. In Proc. of the Intl. Conf. on High Performance Computing, 2001.
-
(2001)
Proc. of the Intl. Conf. on High Performance Computing
-
-
Cociorva, D.1
Wilkins, J.2
Baumgartner, G.3
Sadayappan, P.4
Ramanujam, J.5
Nooijen, M.6
Bernholdt, D.7
Harrison, R.8
-
9
-
-
0032057868
-
Asymptotically tight bounds for performing BMMC permutations on parallel disk systems
-
T. H. Cormen, T. Sundquist, and L. F. Wisniewski. Asymptotically tight bounds for performing BMMC permutations on parallel disk systems. SIAM Journal on Computing, 28(1):105-136, 1998.
-
(1998)
SIAM Journal on Computing
, vol.28
, Issue.1
, pp. 105-136
-
-
Cormen, T.H.1
Sundquist, T.2
Wisniewski, L.F.3
-
10
-
-
0028750115
-
Index transformation algorithms in a linear algebra framework
-
A. Edelman, S. Heller, and S. L. Johnsson. Index transformation algorithms in a linear algebra framework. IEEE Transactions on Parallel and Distributed Systems, 5(12):1302-1309, 1994.
-
(1994)
IEEE Transactions on Parallel and Distributed Systems
, vol.5
, Issue.12
, pp. 1302-1309
-
-
Edelman, A.1
Heller, S.2
Johnsson, S.L.3
-
11
-
-
0000011164
-
A fast computer method for matrix transposing
-
J. O. Eklundh. A fast computer method for matrix transposing. IEEE Transactions on Computers, 20(7):801-803, 1972.
-
(1972)
IEEE Transactions on Computers
, vol.20
, Issue.7
, pp. 801-803
-
-
Eklundh, J.O.1
-
12
-
-
0027719668
-
Efficient transposition algorithms for large matrices
-
ACM Press
-
S. D. Kaushik, C.-H. Huang, R. W. Johnson, P. Sadayappan, and J. R. Johnson. Efficient transposition algorithms for large matrices. In Proceedings of the 1993 ACM/IEEE conference on Supercomputing, pages 656-665. ACM Press, 1993.
-
(1993)
Proceedings of the 1993 ACM/IEEE Conference on Supercomputing
, pp. 656-665
-
-
Kaushik, S.D.1
Huang, C.-H.2
Johnson, R.W.3
Sadayappan, P.4
Johnson, J.R.5
-
13
-
-
12444319714
-
On efficient out-of-core matrix transposition
-
The Ohio State University, Sept
-
S. Krishnamoorthy, G. Baumgartner, D. Cociorva, C. Lam, and P. Sadayappan. On efficient out-of-core matrix transposition. Technical Report OSU-CIRSC-9/03-T52, School of Computer and Information Science, The Ohio State University, Sept 2003.
-
(2003)
Technical Report OSU-CIRSC-9/03-T52, School of Computer and Information Science
-
-
Krishnamoorthy, S.1
Baumgartner, G.2
Cociorva, D.3
Lam, C.4
Sadayappan, P.5
-
14
-
-
84944907858
-
-
NWChem. http://www.emsl.pnl.gov:2080/docs/nwchem/nwchem.html.
-
NWChem
-
-
-
15
-
-
84944917949
-
-
Ohio Supercomputing Center. http://www.osc.edu.
-
-
-
-
16
-
-
0016657275
-
A generalization of eklundh's algorithm for transposing large matrices
-
H. K. Ramapriyan. A generalization of Eklundh's algorithm for transposing large matrices. IEEE Transactions on Computers, 24(12):1221-1226, 1975.
-
(1975)
IEEE Transactions on Computers
, vol.24
, Issue.12
, pp. 1221-1226
-
-
Ramapriyan, H.K.1
-
17
-
-
0036538429
-
An efficient algorithm for out-of-core matrix transposition
-
April
-
J. Suh and V. K. Prasanna. An efficient algorithm for out-of-core matrix transposition. IEEE Transactions on Computers, 51(4):420-438, April 2002.
-
(2002)
IEEE Transactions on Computers
, vol.51
, Issue.4
, pp. 420-438
-
-
Suh, J.1
Prasanna, V.K.2
-
19
-
-
0016994662
-
An extension of eklundh's matrix transposition algorithm and its application to digital signal processing
-
R. E. Twogood and M. P. Ekstrom. An extension of Eklundh's matrix transposition algorithm and its application to digital signal processing. IEEE Transactions on Computers, 25(12):950-952, 1976.
-
(1976)
IEEE Transactions on Computers
, vol.25
, Issue.12
, pp. 950-952
-
-
Twogood, R.E.1
Ekstrom, M.P.2
-
20
-
-
0028484243
-
Algorithms for parallel memory I: Two-level memories
-
J. S. Vitter and E. A. M. Shriver. Algorithms for parallel memory I: Two-level memories. Algorithmica, 12(2-3):110-147, 1994.
-
(1994)
Algorithmica
, vol.12
, Issue.2-3
, pp. 110-147
-
-
Vitter, J.S.1
Shriver, E.A.M.2
|