-
1
-
-
35248859849
-
-
R. Thakur and W. Gropp, Improving the performance of collective operations in MPICH, in Recent Advances in Parallel Virtual Machine and Message Passing Interface, ser. LNCS, J. Dongarra, D. Laforenza, and S. Orlando, Eds., no. 2840. Springer Verlag, 2003, pp. 257-267, 10th European PVM/MPI User's Group Meeting, Venice, Italy.
-
R. Thakur and W. Gropp, "Improving the performance of collective operations in MPICH," in Recent Advances in Parallel Virtual Machine and Message Passing Interface, ser. LNCS, J. Dongarra, D. Laforenza, and S. Orlando, Eds., no. 2840. Springer Verlag, 2003, pp. 257-267, 10th European PVM/MPI User's Group Meeting, Venice, Italy.
-
-
-
-
2
-
-
34548804377
-
Towards an Accurate Model for Collective Communications
-
San Francisco, USA
-
S. Vadhiyar, G. Fagg, and J. Dongarra, "Towards an Accurate Model for Collective Communications," in Proceedings of International Conference on Computational Science (ICCS 2001), San Francisco, USA, 2001.
-
(2001)
Proceedings of International Conference on Computational Science (ICCS 2001)
-
-
Vadhiyar, S.1
Fagg, G.2
Dongarra, J.3
-
3
-
-
18844428650
-
-
T. Kielmann, R. F. H. Hofman, H. E. Bal. A. Plaat. and R. A. F. Bhoedjang, MagPle: MPI's collective communication operations for clustered wide area systems, ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP'99). 34, no. 8, pp. 131-140, May 1999.
-
T. Kielmann, R. F. H. Hofman, H. E. Bal. A. Plaat. and R. A. F. Bhoedjang, "MagPle: MPI's collective communication operations for clustered wide area systems," ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP'99). vol. 34, no. 8, pp. 131-140, May 1999.
-
-
-
-
4
-
-
34548696258
-
-
R. Rabenseifner and J. L. Träff, More efficient reduction algorithms for non-power-of-two number of processors in message-passing parallel systems. in Proceedings of EuroPVM/MPI, set. Lecture Notes in Computer Science. Springer-Verlag, 2004, pp. 36-46.
-
R. Rabenseifner and J. L. Träff, "More efficient reduction algorithms for non-power-of-two number of processors in message-passing parallel systems." in Proceedings of EuroPVM/MPI, set. Lecture Notes in Computer Science. Springer-Verlag, 2004, pp. 36-46.
-
-
-
-
5
-
-
34248373234
-
Star-mpi: Self tuned adaptive routines for mpi collective operations
-
New York, NY, USA: ACM Press
-
A. Faraj, X. Yuan, and D. Lowenthal. "Star-mpi: self tuned adaptive routines for mpi collective operations," in ICS '06: Proceedings of the 20th annual international conference on Supercomputing. New York, NY, USA: ACM Press. 2006, pp. 199-208.
-
(2006)
ICS '06: Proceedings of the 20th annual international conference on Supercomputing
, pp. 199-208
-
-
Faraj, A.1
Yuan, X.2
Lowenthal, D.3
-
7
-
-
33847103649
-
Optimizing bandwidth limited problems using one-sided communication and overlap
-
C. Bell, D. Bonachea, R. Nishtala, and K. Yelick, "Optimizing bandwidth limited problems using one-sided communication and overlap," in 20th International Parallel and Distributed Processing Symposium (IPDPS), 2006.
-
(2006)
20th International Parallel and Distributed Processing Symposium (IPDPS)
-
-
Bell, C.1
Bonachea, D.2
Nishtala, R.3
Yelick, K.4
-
8
-
-
13244279577
-
Minimizing development and maintenance costs in supporting persistently optimized blas
-
R. C. Whaley and A. Petite, "Minimizing development and maintenance costs in supporting persistently optimized blas," Software: Practice and Experience, vol. 35, no. 2, pp. 101-121. 2005.
-
(2005)
Software: Practice and Experience
, vol.35
, Issue.2
, pp. 101-121
-
-
Whaley, R.C.1
Petite, A.2
-
9
-
-
20744449792
-
The Design and Implementation of FFTW3
-
M. Frigo and S. G. Johnson, "The Design and Implementation of FFTW3," Proceedings of IEEE, vol. 93, no. 2, pp. 216-231. 2005.
-
(2005)
Proceedings of IEEE
, vol.93
, Issue.2
, pp. 216-231
-
-
Frigo, M.1
Johnson, S.G.2
-
10
-
-
35048884271
-
Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation
-
Budapest, Hungary, September
-
E. Gabriel, G. E. Fagg, G. Bosilca, T. Angskun, J. J. Dongarra, J. M. Squyres, V. Sahay, P. Kambadur, B. Barrett, A. Lumsdaine. R. H. Castain, D. J. Daniel, R. L. Graham, and T. S. Woodall, "Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation," in Proceedings, 1lth European PVM/MPI Users' Group Meeting, Budapest, Hungary, September 2004, pp. 97-104.
-
(2004)
Proceedings, 1lth European PVM/MPI Users' Group Meeting
, pp. 97-104
-
-
Gabriel, E.1
Fagg, G.E.2
Bosilca, G.3
Angskun, T.4
Dongarra, J.J.5
Squyres, J.M.6
Sahay, V.7
Kambadur, P.8
Barrett, B.9
Lumsdaine, A.10
Castain, R.H.11
Daniel, D.J.12
Graham, R.L.13
Woodall, T.S.14
-
11
-
-
11844300540
-
Efficient execution of MPI applications on the grid: Porting and optimization issues
-
R. Keller, E. Gabriel, B. Krammer, M. S. Mller, and M. M. Resch, "Efficient execution of MPI applications on the grid: porting and optimization issues'," Journal of Grid Computing, vol. 1, no. 2, pp. 133-149, 2003.
-
(2003)
Journal of Grid Computing
, vol.1
, Issue.2
, pp. 133-149
-
-
Keller, R.1
Gabriel, E.2
Krammer, B.3
Mller, M.S.4
Resch, M.M.5
|