-
1
-
-
0032067773
-
Maximizing parallelism and minimizing synchronization with affine partitions
-
Lim, A.W., Lam, M.S.: Maximizing parallelism and minimizing synchronization with affine partitions. Parallel Computing 24 (1998) 445-475
-
(1998)
Parallel Computing
, vol.24
, pp. 445-475
-
-
Lim, A.W.1
Lam, M.S.2
-
3
-
-
33645983963
-
Advances, applications and performance of the global arrays shared memory programming toolkit
-
to appear
-
Nieplocha, J., Palmer, B., Tipparaju, V., Krishnan, M., Trease, H., Apra, E.: Advances, Applications and Performance of the Global Arrays Shared Memory Programming Toolkit. Intern. J. High Perf. Comp. Applications to appear (2005)
-
(2005)
Intern. J. High Perf. Comp. Applications
-
-
Nieplocha, J.1
Palmer, B.2
Tipparaju, V.3
Krishnan, M.4
Trease, H.5
Apra, E.6
-
4
-
-
85117163262
-
A high-level approach to synthesis of highperformance codes for quantum chemistry
-
Baumgartner, G., Bernholdt, D., Cociorva, D., Harrison, R., Hirata, S., Lam, C., Nooijen, M., Pitzer, R., Ramanujam, J., Sadayappan, P.: A High-Level Approach to Synthesis of HighPerformance Codes for Quantum Chemistry. In: Proc. of Supercomputing 2002. (2002)
-
(2002)
Proc. of Supercomputing 2002
-
-
Baumgartner, G.1
Bernholdt, D.2
Cociorva, D.3
Harrison, R.4
Hirata, S.5
Lam, C.6
Nooijen, M.7
Pitzer, R.8
Ramanujam, J.9
Sadayappan, P.10
-
5
-
-
0027268763
-
Parallel molecular dynamics with the embedded atom method
-
Proc. of Materials Theory and Modelling
-
Plimpton, S.J., Hendrickson, B.A.: Parallel molecular dynamics with the embedded atom method. In: Proc. of Materials Theory and Modelling, MRS Proceedings (1993) 37
-
(1993)
MRS Proceedings
, pp. 37
-
-
Plimpton, S.J.1
Hendrickson, B.A.2
-
11
-
-
0030686036
-
Multilevel hypergraph partitioning: Applications in VLSI domain
-
Karypis, G., Aggrawal, R., Kumar, V., Shekhar, S.: Multilevel hypergraph partitioning: Applications in VLSI domain. In: Proc. of 34th Design Automation Conference. (1997)
-
(1997)
Proc. of 34th Design Automation Conference
-
-
Karypis, G.1
Aggrawal, R.2
Kumar, V.3
Shekhar, S.4
-
12
-
-
0033360524
-
Hypergraph-partitioning based decomposition for parallel spars e-matrix vector multiplication
-
Çatalyürek, U.V., Aykanat, C.: Hypergraph-partitioning based decomposition for parallel spars e-matrix vector multiplication. IEEE TPDS 10 (1999) 673-693
-
(1999)
IEEE TPDS
, vol.10
, pp. 673-693
-
-
Çatalyürek, U.V.1
Aykanat, C.2
-
13
-
-
0345566357
-
Tensor contraction engine: Abstraction and automated parallel implementation of configuration-interaction, coupled-cluster, and many-body perturbation theories
-
Hitara, S.: Tensor contraction engine: Abstraction and automated parallel implementation of configuration-interaction, coupled-cluster, and many-body perturbation theories. J. Phys. Chem. A 107 (2003) 9887-9897
-
(2003)
J. Phys. Chem. A
, vol.107
, pp. 9887-9897
-
-
Hitara, S.1
-
14
-
-
0031223114
-
Level 3 basic linear algebra subprograms for sparse matrices: A user-level interface
-
Duff, I.S., Marrone, M., Radicati, G., Vittoli, C.: Level 3 basic linear algebra subprograms for sparse matrices: a user-level interface. ACM Trans. Math. Softw. 23 (1997) 379-401
-
(1997)
ACM Trans. Math. Softw.
, vol.23
, pp. 379-401
-
-
Duff, I.S.1
Marrone, M.2
Radicati, G.3
Vittoli, C.4
-
15
-
-
0003603271
-
Official Aztec user's guide: Version 2.1
-
Sandia National Laboratories
-
Tuminaro, R.S., Heroux, M., Hutchinson, S.A., Shadid, J.N.: Official Aztec user's guide: Version 2.1. Technical report, Sandia National Laboratories (1999)
-
(1999)
Technical Report
-
-
Tuminaro, R.S.1
Heroux, M.2
Hutchinson, S.A.3
Shadid, J.N.4
-
16
-
-
0003573801
-
The Chaco user's guide: Version 2.0
-
Sandia National Laboratories
-
Hendrickson, B., Leland, R.: The Chaco user's guide: Version 2.0. Technical Report SAND94-2692, Sandia National Laboratories (1994)
-
(1994)
Technical Report
, vol.SAND94-2692
-
-
Hendrickson, B.1
Leland, R.2
-
17
-
-
84949516058
-
A load balancing strategy for prioritized execution of tasks
-
Newport Beach, CA
-
Sinha, A., Kalé, L.: A load balancing strategy for prioritized execution of tasks. In: Seventh International Parallel Processing Symposium, Newport Beach, CA. (1993) 230-237
-
(1993)
Seventh International Parallel Processing Symposium
, pp. 230-237
-
-
Sinha, A.1
Kalé, L.2
-
18
-
-
84976817516
-
CHARM++: A portable concurrent object oriented system based on C++
-
Paepcke, A., ed.: , ACM Press
-
Kalé, L., Krishnan, S.: CHARM++: A Portable Concurrent Object Oriented System Based on C++. In Paepcke, A., ed.: Proceedings of OOPSLA'93, ACM Press (1993) 91-108
-
(1993)
Proceedings of OOPSLA'93
, pp. 91-108
-
-
Kalé, L.1
Krishnan, S.2
-
19
-
-
0013398077
-
-
PhD thesis, MIT Department of Electrical Engineering and Computer Science
-
Randall, K.H.: Cilk: Efficient Multithreaded Computing. PhD thesis, MIT Department of Electrical Engineering and Computer Science (1998)
-
(1998)
Cilk: Efficient Multithreaded Computing
-
-
Randall, K.H.1
-
20
-
-
0005879552
-
A hypergraph-based workload partitioning strategy for parallel data aggregation
-
SIAM
-
Chang, C., Kurc, T., Sussman, A., Çatalyürek, U.V., Saltz, J.: A hypergraph-based workload partitioning strategy for parallel data aggregation. In: Proceedings of the Eleventh SIAM Conference on Parallel Processing for Scientific Computing, SIAM (2001)
-
(2001)
Proceedings of the Eleventh SIAM Conference on Parallel Processing for Scientific Computing
-
-
Chang, C.1
Kurc, T.2
Sussman, A.3
Çatalyürek, U.V.4
Saltz, J.5
|