-
1
-
-
0033700781
-
Synthesizing transformations for locality enhancement of imperfectly nested loops
-
AHMED, N., MATEEV, N., AND PINGALI, K. 2000. Synthesizing transformations for locality enhancement of imperfectly nested loops. In Proc. ACM Intl. Conf. on Supercomputing, 141-152.
-
(2000)
Proc. ACM Intl. Conf. on Supercomputing
, pp. 141-152
-
-
AHMED, N.1
MATEEV, N.2
PINGALI, K.3
-
2
-
-
85117163262
-
A High-Level Approach to Synthesis of High-Performance Codes for Quantum Chemistry
-
BAUMGARTNER, G., BERNHOLDT, D., COCIORVA, D., HARRISON, R., HIRATA, S., LAM, C., NOOIJEN, M., PLTZER, R., RAMANUIAM, J., AND SADAYAPPAN, P. 2002. A High-Level Approach to Synthesis of High-Performance Codes for Quantum Chemistry. In Proc. of Supercomputing 2002.
-
(2002)
Proc. of Supercomputing 2002
-
-
BAUMGARTNER, G.1
BERNHOLDT, D.2
COCIORVA, D.3
HARRISON, R.4
HIRATA, S.5
LAM, C.6
NOOIJEN, M.7
PLTZER, R.8
RAMANUIAM, J.9
SADAYAPPAN, P.10
-
3
-
-
84947911090
-
Decomposing irregularly sparse matrices for parallel matrix-vector multiplications
-
Proceedings of 3rd International Symposium on Solving Irregularly Structured Problems in Parallel, Irregular'96, Springer-Verlag, of
-
CATALYÜREK, U. V., AND AYKANAT, C. 1996. Decomposing irregularly sparse matrices for parallel matrix-vector multiplications. In Proceedings of 3rd International Symposium on Solving Irregularly Structured Problems in Parallel, Irregular'96, Springer-Verlag, vol. 1117 of Lecture Notes in Computer Science, 75-86.
-
(1996)
Lecture Notes in Computer Science
, vol.1117
, pp. 75-86
-
-
CATALYÜREK, U.V.1
AYKANAT, C.2
-
4
-
-
0033360524
-
Hypergraph-partitioning based decomposition for parallel spars e-matrix vector multiplication
-
CATALYÜREK, U. V., AND AYKANAT, C. 1999. Hypergraph-partitioning based decomposition for parallel spars e-matrix vector multiplication. IEEE TPDS 10, 7, 673-693.
-
(1999)
IEEE TPDS
, vol.10
, Issue.7
, pp. 673-693
-
-
CATALYÜREK, U.V.1
AYKANAT, C.2
-
5
-
-
0005879552
-
A hypergraph-based workload partitioning strategy for parallel data aggregation
-
SIAM
-
CHANG, C., KURC, T., SUSSMAN, A., ÇATALYÜREK, U. V., AND SALTZ, J. 2001. A hypergraph-based workload partitioning strategy for parallel data aggregation. In Proceedings of the Eleventh SIAM Conference on Parallel Processing for Scientific Computing, SIAM.
-
(2001)
Proceedings of the Eleventh SIAM Conference on Parallel Processing for Scientific Computing
-
-
CHANG, C.1
KURC, T.2
SUSSMAN, A.3
ÇATALYÜREK, U.V.4
SALTZ, J.5
-
6
-
-
34548231133
-
-
CRAWFORD, T., AND III, H. S. 2000. An Introduction to Coupled Cluster Theory for Computational Chemists. In Reviews in Computational Chemistry, K. Lipkowitz and D. Boyd, Ed., 14. John Wiley & Sons, Ltd., 33-136.
-
CRAWFORD, T., AND III, H. S. 2000. An Introduction to Coupled Cluster Theory for Computational Chemists. In Reviews in Computational Chemistry, K. Lipkowitz and D. Boyd, Ed., vol. 14. John Wiley & Sons, Ltd., 33-136.
-
-
-
-
7
-
-
0031223114
-
Level 3 basic linear algebra subprograms for sparse matrices: A user-level interface
-
DUFF, I. S., MARRONE, M., RADICATI, G., AND VITTOLI, C. 1997. Level 3 basic linear algebra subprograms for sparse matrices: a user-level interface. ACM Trans. Math. Softw. 23, 3, 379-401.
-
(1997)
ACM Trans. Math. Softw
, vol.23
, Issue.3
, pp. 379-401
-
-
DUFF, I.S.1
MARRONE, M.2
RADICATI, G.3
VITTOLI, C.4
-
8
-
-
0003573801
-
The Chaco user's guide: Version 2.0
-
Tech. Rep. SAND94-2692, Sandia National Laboratories
-
HENDRICKSON, B., AND LELAND, R. 1994. The Chaco user's guide: Version 2.0. Tech. Rep. SAND94-2692, Sandia National Laboratories.
-
(1994)
-
-
HENDRICKSON, B.1
LELAND, R.2
-
9
-
-
34548251077
-
-
HIGH PERFORMANCE COMPUTATIONAL CHEMISTRY GROUP. 2004. NWChem, A Computational Chemistry Package for Parallel Computers, Version 4.6. Pacific Northwest National Laboratory
-
HIGH PERFORMANCE COMPUTATIONAL CHEMISTRY GROUP. 2004. NWChem, A Computational Chemistry Package for Parallel Computers, Version 4.6. Pacific Northwest National Laboratory.
-
-
-
-
10
-
-
84976817516
-
CHARM++: A Portable Concurrent Object Oriented System Based on C++
-
ACM Press, A.Paepcke, Ed
-
KALÉ, L., AND KRISHNAN, S. 1993. CHARM++: A Portable Concurrent Object Oriented System Based on C++. In Proceedings of OOPSLA'93, ACM Press, A.Paepcke, Ed., 91-108.
-
(1993)
Proceedings of OOPSLA'93
, pp. 91-108
-
-
KALÉ, L.1
KRISHNAN, S.2
-
11
-
-
0030686036
-
Multilevel hypergraph partitioning: Applications in VLSI domain
-
KARYPIS, G., AGGRAWAL, R., KUMAR, V., AND SHEKHAR, S. 1997. Multilevel hypergraph partitioning: Applications in VLSI domain. In Proc. of 34th Design Automation Conference.
-
(1997)
Proc. of 34th Design Automation Conference
-
-
KARYPIS, G.1
AGGRAWAL, R.2
KUMAR, V.3
SHEKHAR, S.4
-
12
-
-
33845325178
-
A Hypergraph Partitioning Based Approach for Scheduling of Tasks with Batchshared I/O
-
To Appear
-
KHANNA, G., VYDYANATHAN, N., KURC, T., CATALYUREK, U., WYCKOFF, P., SALTZ, J., AND SADAYAPPAN, P. 2005. A Hypergraph Partitioning Based Approach for Scheduling of Tasks with Batchshared I/O. In Proceedings of the 5th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2005). To Appear.
-
(2005)
Proceedings of the 5th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2005)
-
-
KHANNA, G.1
VYDYANATHAN, N.2
KURC, T.3
CATALYUREK, U.4
WYCKOFF, P.5
SALTZ, J.6
SADAYAPPAN, P.7
-
13
-
-
0030685988
-
Data-centric multi-level blocking
-
KODUKULA, I., AHMED, N., AND PINGALI, K. 1997. Data-centric multi-level blocking. In Proc. SIGPLAN Conf. Programming Language Design and Implementation, 346-357.
-
(1997)
Proc. SIGPLAN Conf. Programming Language Design and Implementation
, pp. 346-357
-
-
KODUKULA, I.1
AHMED, N.2
PINGALI, K.3
-
14
-
-
33847143321
-
An extensible global address space framework with decoupled task and data abstractions
-
KRISHNAMOORTHY, S., CATALYUREK, U., NIEPLOCHA, J., ROUNTEV, A., AND SADAYAPPAN, P. 2006. An extensible global address space framework with decoupled task and data abstractions. In Proc. IPDPS Workshop on Next Generation Software.
-
(2006)
Proc. IPDPS Workshop on Next Generation Software
-
-
KRISHNAMOORTHY, S.1
CATALYUREK, U.2
NIEPLOCHA, J.3
ROUNTEV, A.4
SADAYAPPAN, P.5
-
15
-
-
21144446087
-
Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms
-
Springer Verlag
-
KRISHNAN, S., KRISHNAMOORTHY, S., BAUMGARTNER, G., COCIORVA, D., LAM, C., SADAYAPPAN, P., RAMANUJAM, J., BERNHOLDT, D., AND CHOPPELLA, V. 2003. Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms. In Proc. 10th Annual International Conference on High Performance Computing (HiPC), Springer Verlag, 406-417.
-
(2003)
Proc. 10th Annual International Conference on High Performance Computing (HiPC)
, pp. 406-417
-
-
KRISHNAN, S.1
KRISHNAMOORTHY, S.2
BAUMGARTNER, G.3
COCIORVA, D.4
LAM, C.5
SADAYAPPAN, P.6
RAMANUJAM, J.7
BERNHOLDT, D.8
CHOPPELLA, V.9
-
16
-
-
12444250054
-
Efficient synthesis of out-of-core algorithms for tensor contractions using a nonlinear optimization solver
-
KRISHNAN, S., KRISHNAMOORTHY, S., BAUMGARTNER, G., LAM, C.-C., RAMANUJAM, J., SADAYAPPAN, P., AND CHOPPELLA, V. 2004. Efficient synthesis of out-of-core algorithms for tensor contractions using a nonlinear optimization solver. In The 18th International Parallel and Distributed Processing Symposium.
-
(2004)
The 18th International Parallel and Distributed Processing Symposium
-
-
KRISHNAN, S.1
KRISHNAMOORTHY, S.2
BAUMGARTNER, G.3
LAM, C.-C.4
RAMANUJAM, J.5
SADAYAPPAN, P.6
CHOPPELLA, V.7
-
17
-
-
0032067773
-
-
LIM, A. W., AND LAM, M. S. 1998. Maximizing parallelism and minimizing synchronization with affine partitions. Parallel Computing 24, 3-4 (May), 445-475.
-
LIM, A. W., AND LAM, M. S. 1998. Maximizing parallelism and minimizing synchronization with affine partitions. Parallel Computing 24, 3-4 (May), 445-475.
-
-
-
-
18
-
-
17644395320
-
Blocking and array contraction across arbitrarily nested loops using affine partitioning
-
ACM Press
-
LIM, A., LIAO, S., AND LAM, M. 2001. Blocking and array contraction across arbitrarily nested loops using affine partitioning. In Proc. 8th ACM SIGPLAN Symposium on Principles and Practices of Parallel Programming, ACM Press, 103-112.
-
(2001)
Proc. 8th ACM SIGPLAN Symposium on Principles and Practices of Parallel Programming
, pp. 103-112
-
-
LIM, A.1
LIAO, S.2
LAM, M.3
-
20
-
-
0013398077
-
-
PhD thesis, MIT Department of Electrical Engineering and Computer Science
-
RANDALL, K. H. 1998. Cilk: Efficient Multithreaded Computing. PhD thesis, MIT Department of Electrical Engineering and Computer Science.
-
(1998)
Cilk: Efficient Multithreaded Computing
-
-
RANDALL, K.H.1
-
21
-
-
33845445660
-
Integrated loop optimizations for data locality enhancement of tensor contraction expressions
-
SAHOO, S. K., KRISHNAMOORTHY, S., PANUGANTI, R., AND SADAYAPPAN, P. 2005. Integrated loop optimizations for data locality enhancement of tensor contraction expressions. In Proc. Supercomputing (SC 2005).
-
(2005)
Proc. Supercomputing (SC 2005)
-
-
SAHOO, S.K.1
KRISHNAMOORTHY, S.2
PANUGANTI, R.3
SADAYAPPAN, P.4
-
22
-
-
34548272716
-
-
A manual for the CHAOS runtime library. Tech. Rep. CS-TR-3437 and UMIACS-TR-95-34, University of Maryland, Department of Computer Science and UMIACS, March
-
SALTZ, J., PONNUSAMY, R., SHARMA, S., MOON, B., AND DAS, R. 1995. A manual for the CHAOS runtime library. Tech. Rep. CS-TR-3437 and UMIACS-TR-95-34, University of Maryland, Department of Computer Science and UMIACS, March.
-
(1995)
-
-
SALTZ, J.1
PONNUSAMY, R.2
SHARMA, S.3
MOON, B.4
DAS, R.5
-
24
-
-
0003603271
-
Official Aztec user's guide; Version 2.1
-
Tech. rep, Sandia National Laboratories
-
TUMINARO, R. S., HEROUX, M., HUTCHINSON, S. A., AND SHADID, J. N. 1999. Official Aztec user's guide; Version 2.1. Tech. rep., Sandia National Laboratories.
-
(1999)
-
-
TUMINARO, R.S.1
HEROUX, M.2
HUTCHINSON, S.A.3
SHADID, J.N.4
|