-
1
-
-
0030661483
-
Optimizing collective I/O performance on parallel computers: A multisystem study
-
Chen, Y., Foster, I., Nieplocha, J., Winslett, W. Optimizing collective I/O performance on parallel computers: A multisystem study. In: 11th ACM Intl. Conf. on Supercomputing. (1997)
-
(1997)
11th ACM Intl. Conf. on Supercomputing.
-
-
Chen, Y.1
Foster, I.2
Nieplocha, J.3
Winslett, W.4
-
4
-
-
85029767319
-
Disk Resident Arrays: An array-oriented I/O library for out-of-core computations
-
Buyya, R., IEEE Computer Society Press, Buyya, R., Jin, H., Cortes, T., eds.
-
Foster, I., Nieplocha, J. Disk Resident Arrays: An array-oriented I/O library for out-of-core computations. In Buyya, R., Jin, H., Cortes, T., eds. Disk Arrays and Parallel I/O: Theory and Practice. IEEE Computer Society Press (2001)
-
(2001)
Disk Arrays and Parallel I/O: Theory and Practice.
-
-
Foster, I.1
Nieplocha, J.2
-
5
-
-
0018907742
-
A stepwise approach to computing the multidimensional fast Fourier transform of large arrays
-
Anderson, G.L. A stepwise approach to computing the multidimensional fast Fourier transform of large arrays. IEEE Transactions on Acoustics and Speech Signal Processing 28 (1980) 280-284
-
(1980)
IEEE Transactions on Acoustics and Speech Signal Processing
, vol.28
, pp. 280-284
-
-
Anderson, G.L.1
-
6
-
-
0025403252
-
FFTs in external or hierarchical memory
-
Bailey, D.H. FFTs in external or hierarchical memory. Journal of Supercomputing 4 (1990) 23-35
-
(1990)
Journal of Supercomputing
, vol.4
, pp. 23-35
-
-
Bailey, D.H.1
-
7
-
-
33646690671
-
-
School of Computer and Information Science, The Ohio State University
-
Kazhiyur-Mannar, R., Wenger, R., Crawfis, R., Dey, T.K. Adaptive resolution isosurface construction in three and four dimensions. Technical Report OSU-CISRC-7/03-TR38, School of Computer and Information Science, The Ohio State University (2003)
-
(2003)
Adaptive Resolution Isosurface Construction in Three and Four Dimensions. Technical Report OSU-CISRC-7/03-TR38
-
-
Kazhiyur-Mannar, R.1
Wenger, R.2
Crawfis, R.3
Dey, T.K.4
-
9
-
-
19944430447
-
A high-level approach to synthesis of high-performance codes for quantum chemistry
-
Baumgartner, G., Bernholdt, D., Cociorva, D., Harrison, R., Hirata, S., Lam, C., Nooijen, M., Pitzer, R., Ramanujam, J., Sadayappan, P. A high-level approach to synthesis of high-performance codes for quantum chemistry. In: Proceedings of Supercomputing 2002. (2003)
-
(2003)
Proceedings of Supercomputing
, vol.2002
-
-
Baumgartner, G.1
Bernholdt, D.2
Cociorva, D.3
Harrison, R.4
Hirata, S.5
Lam, C.6
Nooijen, M.7
Pitzer, R.8
Ramanujam, J.9
Sadayappan, P.10
-
10
-
-
84947277042
-
Global communication optimization for tensor contraction expressions under memory constraints
-
(IPDPS).
-
Cociorva, D., Gao, X., Krishnan, S., Baumgartner, G., Lam, C., Sadayappan, P., Ramanujam, J. Global communication optimization for tensor contraction expressions under memory constraints. In: 17th International Parallel & Distributed Processing Symposium (IPDPS). (2003)
-
(2003)
17th International Parallel & Distributed Processing Symposium
-
-
Cociorva, D.1
Gao, X.2
Krishnan, S.3
Baumgartner, G.4
Lam, C.5
Sadayappan, P.6
Ramanujam, J.7
-
11
-
-
0036041078
-
Space-time trade-off optimization for a class of electronic structure calculations
-
Cociorva, D., Baumgartner, G., Lam, C., Sadayappan, P., Ramanujam, J., Nooijen, M., Bernholdt, D., Harrison, R. Space-time trade-off optimization for a class of electronic structure calculations. In: Proc. of ACM SIGPLAN PLDI2002. (2002)
-
(2002)
Proc. of ACM SIGPLAN PLDI
, vol.2002
-
-
Cociorva, D.1
Baumgartner, G.2
Lam, C.3
Sadayappan, P.4
Ramanujam, J.5
Nooijen, M.6
Bernholdt, D.7
Harrison, R.8
-
12
-
-
12444314400
-
Towards automatic synthesis of high-performance codes for electronic structure calculations: Data locality optimization
-
Cociorva, D., Wilkins, J., Baumgartner, G., Sadayappan, P., Ramanujam, J., Nooijen, M., Bernholdt, D., Harrison, R. Towards automatic synthesis of high-performance codes for electronic structure calculations: Data locality optimization. In: Proc. of the Intl. Conf. on High Performance Computing. (2001)
-
(2001)
Proc. of the Intl. Conf. on High Performance Computing.
-
-
Cociorva, D.1
Wilkins, J.2
Baumgartner, G.3
Sadayappan, P.4
Ramanujam, J.5
Nooijen, M.6
Bernholdt, D.7
Harrison, R.8
-
13
-
-
21144446087
-
Data locality optimization for synthesis of efficient out-of-core algoritms
-
Krishnan, S., Krishnamoorthy, S., Baumgartner, G., Cociorva, D., Lam, C., Sadayappan, P., Ramanujam, J., Bernholdt, D., Choppella, V. Data locality optimization for synthesis of efficient out-of-core algoritms. In: Proc. of the Intl. Conf. on High Performance Computing. (2003)
-
(2003)
Proc. of the Intl. Conf. on High Performance Computing.
-
-
Krishnan, S.1
Krishnamoorthy, S.2
Baumgartner, G.3
Cociorva, D.4
Lam, C.5
Sadayappan, P.6
Ramanujam, J.7
Bernholdt, D.8
Choppella, V.9
-
14
-
-
12444250054
-
Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver
-
Krishnan, S., Krishnamoorthy, S., Baumgartner, G., Lam, C., Ramanujam, J., Choppella, V., Sadayappan, P. Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver. In: Proc. of 18th Intl. Parallel & Distributed Processing Symposium (IPDPS). (2004)
-
(2004)
Proc. of 18th Intl. Parallel & Distributed Processing Symposium (IPDPS).
-
-
Krishnan, S.1
Krishnamoorthy, S.2
Baumgartner, G.3
Lam, C.4
Ramanujam, J.5
Choppella, V.6
Sadayappan, P.7
-
15
-
-
0242338386
-
-
Pacific Northwest National Laboratory, Richland, Washington 99352-0999, USA.
-
High Performance Computational Chemistry Group NWChem, A Computational Chemistry Package for Parallel Computers, Version 4.6. Pacific Northwest National Laboratory, Richland, Washington 99352-0999, USA. (2004)
-
(2004)
NWChem, A Computational Chemistry Package for Parallel Computers, Version 4.6.
-
-
-
16
-
-
0028732614
-
Global arrays: A portable programming model for distributed memory computers
-
Nieplocha, J., Harrison, R.J., Littlefield, R.J. Global arrays: a portable programming model for distributed memory computers. In: Supercomputing. (1994) 340-349
-
(1994)
Supercomputing.
, pp. 340-349
-
-
Nieplocha, J.1
Harrison, R.J.2
Littlefield, R.J.3
-
17
-
-
0030157365
-
Global arrays: A nonuniform memory access programming model for high-performance computers
-
Nieplocha, J., Harrison, R.J., Littlefield, R.J. Global arrays: A nonuniform memory access programming model for high-performance computers. The Journal of Supercomputing 10 (1996) 169-189
-
(1996)
The Journal of Supercomputing
, vol.10
, pp. 169-189
-
-
Nieplocha, J.1
Harrison, R.J.2
Littlefield, R.J.3
-
19
-
-
0000011164
-
A fast computer method for matrix transposing
-
Eklundh, J.O. A fast computer method for matrix transposing. IEEE Trans. on Computers 20 (1972) 801-803
-
(1972)
IEEE Trans. on Computers
, vol.20
, pp. 801-803
-
-
Eklundh, J.O.1
-
20
-
-
0027719668
-
Efficient transposition algorithms for large matrices
-
ACM Press
-
Kaushik, S.D., Huang, C.H., Johnson, R.W., Sadayappan, P., Johnson, J.R. Efficient transposition algorithms for large matrices. In: Proceedings of the 1993 ACM/IEEE conference on Supercomputing, ACM Press (1993) 656-665
-
(1993)
Proceedings of the 1993 ACM/IEEE Conference on Supercomputing
, pp. 656-665
-
-
Kaushik, S.D.1
Huang, C.H.2
Johnson, R.W.3
Sadayappan, P.4
Johnson, J.R.5
-
21
-
-
0036538429
-
An efficient algorithm for out-of-core matrix transposition
-
Suh, J., Prasanna, V.K. An efficient algorithm for out-of-core matrix transposition. IEEE Trans. on Computers 51 (2002) 420-438
-
(2002)
IEEE Trans. on Computers
, vol.51
, pp. 420-438
-
-
Suh, J.1
Prasanna, V.K.2
-
22
-
-
12444319714
-
On efficient out-of-core matrix transposition
-
School of Computer and Information Science, The Ohio State University
-
Krishnamoorthy, S., Baumgartner, G., Cociorva, D., Lam, C., Sadayappan, P. On efficient out-of-core matrix transposition. Technical Report OSU-CIRSC-9/03-T52, School of Computer and Information Science, The Ohio State University (2003)
-
(2003)
Technical Report OSU-CIRSC-9/03-T52
-
-
Krishnamoorthy, S.1
Baumgartner, G.2
Cociorva, D.3
Lam, C.4
Sadayappan, P.5
-
23
-
-
33646714072
-
Efficient parallel out-of-core matrix transposition
-
IEEE Computer Society Press to appear
-
Krishnamoorthy, S., Baumgartner, G., Cociorva, D., Lam, C.C., Sadayappan, P. Efficient parallel out-of-core matrix transposition. In: Proceedings of the International Conference on Cluster Computing, IEEE Computer Society Press (2003) to appear.
-
(2003)
Proceedings of the International Conference on Cluster Computing
-
-
Krishnamoorthy, S.1
Baumgartner, G.2
Cociorva, D.3
Lam, C.C.4
Sadayappan, P.5
-
24
-
-
35048866258
-
-
The Ohio Supercomputer Center, (http://www.osc.edu)
-
-
-
|