-
2
-
-
8344267505
-
An analysis of the impact of MPI overlap and independent progress
-
New York, NY, USA: ACM Press
-
R. Brightwell and K. D. Underwood, "An analysis of the impact of MPI overlap and independent progress," in ICS '04: Proceedings of the 18th annual international conference on Supercomputing. New York, NY, USA: ACM Press, 2004, pp. 298-305.
-
(2004)
ICS '04: Proceedings of the 18th annual international conference on Supercomputing
, pp. 298-305
-
-
Brightwell, R.1
Underwood, K.D.2
-
3
-
-
30644479805
-
Overlapping of communication and computation and early binding: Fundamental mechanisms for improving parallel performance on clusters of workstations,
-
Ph.D. dissertation, Mississippi State University
-
R. Dimitrov, "Overlapping of communication and computation and early binding: Fundamental mechanisms for improving parallel performance on clusters of workstations," Ph.D. dissertation, Mississippi State University, 2001.
-
(2001)
-
-
Dimitrov, R.1
-
4
-
-
33745195144
-
Hunting the overlap
-
Washington, DC, USA: IEEE Computer Society
-
C. Iancu, P. Husbands, and P. Hargrove, "Hunting the overlap," in PACT '05: Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05). Washington, DC, USA: IEEE Computer Society, 2005, pp. 279-290.
-
(2005)
PACT '05: Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05)
, pp. 279-290
-
-
Iancu, C.1
Husbands, P.2
Hargrove, P.3
-
5
-
-
84949094818
-
-
J. W. III and S. Bova, Where's the Overlap? - An Analysis of Popular MPI Implementations, 1999. [Online]. Available: citeseer.ist.psu.edu/white99wheres.html
-
J. W. III and S. Bova, "Where's the Overlap? - An Analysis of Popular MPI Implementations," 1999. [Online]. Available: citeseer.ist.psu.edu/white99wheres.html
-
-
-
-
6
-
-
84949098871
-
-
A. Adelmann, W. P. P. A. Bonelli and, and C. W. Ueberhuber, Communication efficiency of parallel 3d ffts. in High Performance Computing for Computational Science - VECPAR 2004, 6th International Conference, Valencia, Spain, June 28-30, 2004, Revised Selected and Invited Papers, ser. Lecture Notes in Computer Science, 3402. Springer, 2004, pp. 901-907.
-
A. Adelmann, W. P. P. A. Bonelli and, and C. W. Ueberhuber, "Communication efficiency of parallel 3d ffts." in High Performance Computing for Computational Science - VECPAR 2004, 6th International Conference, Valencia, Spain, June 28-30, 2004, Revised Selected and Invited Papers, ser. Lecture Notes in Computer Science, vol. 3402. Springer, 2004, pp. 901-907.
-
-
-
-
8
-
-
0035004646
-
Redistribution strategies for portable parallel FFT: A case study
-
A. Dubey and D. Tessera, "Redistribution strategies for portable parallel FFT: a case study." Concurrency and Computation: Practice and Experience, vol. 13, no. 3, pp. 209-220, 2001.
-
(2001)
Concurrency and Computation: Practice and Experience
, vol.13
, Issue.3
, pp. 209-220
-
-
Dubey, A.1
Tessera, D.2
-
9
-
-
0042532049
-
An efficient 3-dim FFT for plane wave electronic structure calculations on massively parallel machines composed of multiprocessor nodes
-
Aug
-
S. Goedecker, M. Boulet, and T. Deutsch, "An efficient 3-dim FFT for plane wave electronic structure calculations on massively parallel machines composed of multiprocessor nodes," Computer Physics Communications, vol. 154, pp. 105-110, Aug. 2003.
-
(2003)
Computer Physics Communications
, vol.154
, pp. 105-110
-
-
Goedecker, S.1
Boulet, M.2
Deutsch, T.3
-
10
-
-
56749151145
-
-
T. Hoefler, A. Lumsdaine, and W. Rehm, Implementation and Performance Analysis of Non-Blocking Collective Operations for MPI, in In proceedings of the 2007 International Conference on High Performance Computing, Networking, Storage and Analysis, SC07. IEEE Computer Society/ACM, 11 2007. [Online]. Available: ./img/hoefler-sc07.pdf
-
T. Hoefler, A. Lumsdaine, and W. Rehm, "Implementation and Performance Analysis of Non-Blocking Collective Operations for MPI," in In proceedings of the 2007 International Conference on High Performance Computing, Networking, Storage and Analysis, SC07. IEEE Computer Society/ACM, 11 2007. [Online]. Available: ./img/hoefler-sc07.pdf
-
-
-
-
11
-
-
0018515759
-
-
C. L. Lawson, R. J. Hanson, D. Kincaid, and F. T. Krogh, Basic Linear Algebra Subprograms for FORTRAN usage, in In ACM Trans. Math. Soft., 5 (1979), pp. 308-323, 1979.
-
C. L. Lawson, R. J. Hanson, D. Kincaid, and F. T. Krogh, "Basic Linear Algebra Subprograms for FORTRAN usage," in In ACM Trans. Math. Soft., 5 (1979), pp. 308-323, 1979.
-
-
-
-
12
-
-
33750234379
-
-
G. M. Shipman, T. S. Woodall, G. Bosilca, R. ch L. Graham, and A. B. Maccabe, High performance RDMA protocols in HPC, in Proceedings, 13th European PVM/MPI Users' Group Meeting, ser. Lecture Notes in Computer Science. Bonn, Germany: Springer-Verlag, September 2006.
-
G. M. Shipman, T. S. Woodall, G. Bosilca, R. ch L. Graham, and A. B. Maccabe, "High performance RDMA protocols in HPC," in Proceedings, 13th European PVM/MPI Users' Group Meeting, ser. Lecture Notes in Computer Science. Bonn, Germany: Springer-Verlag, September 2006.
-
-
-
-
13
-
-
51049098070
-
-
T. Hoefler and A. Lumsdaine, Optimizing non-blocking Collective Operations for InfiniBand, in Proceedings of the 22nd IEEE International Parallel & Distributed Processing Symposium (IPDPS), 04 2008. [Online]. Available: ./img//hoefler-cac08.pdf
-
T. Hoefler and A. Lumsdaine, "Optimizing non-blocking Collective Operations for InfiniBand," in Proceedings of the 22nd IEEE International Parallel & Distributed Processing Symposium (IPDPS), 04 2008. [Online]. Available: ./img//hoefler-cac08.pdf
-
-
-
-
14
-
-
0013357434
-
Asynchronous mpi messaging on myrinet
-
Washington, DC, USA: IEEE Computer Society
-
C. Keppitiyagama and A. S. Wagner, "Asynchronous mpi messaging on myrinet," in IPDPS '01: Proceedings of the 15th International Parallel & Distributed Processing Symposium. Washington, DC, USA: IEEE Computer Society, 2001, p. 50.
-
(2001)
IPDPS '01: Proceedings of the 15th International Parallel & Distributed Processing Symposium
, pp. 50
-
-
Keppitiyagama, C.1
Wagner, A.S.2
-
15
-
-
84944408245
-
Emp: Zero-copy os-bypass nic-driven gigabit ethernet message passing
-
New York, NY, USA: ACM Press
-
P. Shivam, P. Wyckoff, and D. Panda, "Emp: zero-copy os-bypass nic-driven gigabit ethernet message passing," in Supercomputing '01: Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM). New York, NY, USA: ACM Press, 2001, pp. 57-57.
-
(2001)
Supercomputing '01: Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM)
, pp. 57-57
-
-
Shivam, P.1
Wyckoff, P.2
Panda, D.3
-
17
-
-
12444259728
-
Efficient and scalable barrier over quadrics and myrinet with a new nic-based collective message passing protocol
-
Santa Fe, New Mexico, USA
-
W. Yu, D. Buntinas, R. L. Graham, and D. K. Panda, "Efficient and scalable barrier over quadrics and myrinet with a new nic-based collective message passing protocol." in 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), CD-ROM / Abstracts Proceedings, 26-30 April 2004, Santa Fe, New Mexico, USA, 2004.
-
(2004)
18th International Parallel and Distributed Processing Symposium (IPDPS 2004), CD-ROM / Abstracts Proceedings, 26-30 April 2004
-
-
Yu, W.1
Buntinas, D.2
Graham, R.L.3
Panda, D.K.4
-
18
-
-
20444508120
-
Application-bypass reduction for large-scale clusters
-
IEEE Computer Society, December
-
A. Wagner, D. Buntinas, D. K. Panda, and R. Brightwell, "Application-bypass reduction for large-scale clusters." in 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003). IEEE Computer Society, December 2003, pp. 404-411.
-
(2003)
2003 IEEE International Conference on Cluster Computing (CLUSTER 2003)
, pp. 404-411
-
-
Wagner, A.1
Buntinas, D.2
Panda, D.K.3
Brightwell, R.4
-
19
-
-
57949084992
-
Application-bypass broadcast in mpich over gm
-
Washington, DC, USA: IEEE Computer Society
-
D. Buntinas, D. K. Panda, and R. Brightwell, "Application-bypass broadcast in mpich over gm," in CCGRID '03: Proceedings of the 3st International Symposium on Cluster Computing and the Grid. Washington, DC, USA: IEEE Computer Society, 2003, p. 2.
-
(2003)
CCGRID '03: Proceedings of the 3st International Symposium on Cluster Computing and the Grid
, pp. 2
-
-
Buntinas, D.1
Panda, D.K.2
Brightwell, R.3
-
20
-
-
84949085373
-
-
R. Rabenseifner, Hybrid parallel programming on hpc platforms, in In proceedings of the Fifth European Workshop on OpenMP, EWOMP'03, Aachen, Germany, 2003.
-
R. Rabenseifner, "Hybrid parallel programming on hpc platforms," in In proceedings of the Fifth European Workshop on OpenMP, EWOMP'03, Aachen, Germany, 2003.
-
-
-
-
21
-
-
38449103903
-
-
T. Hoefler, P. Kambadur, R. L. Graham, G. Shipman, and A. Lumsdaine, A Case for Standard Non-Blocking Collective Operations, in Recent Advances in Parallel Virtual Machine and Message Passing Interface, EuroPVM/MPI 2007, 4757. Springer, 10 2007, pp. 125-134. [Online]. Available: ./img/hoefler-nbc-standard.pdf
-
T. Hoefler, P. Kambadur, R. L. Graham, G. Shipman, and A. Lumsdaine, "A Case for Standard Non-Blocking Collective Operations," in Recent Advances in Parallel Virtual Machine and Message Passing Interface, EuroPVM/MPI 2007, vol. 4757. Springer, 10 2007, pp. 125-134. [Online]. Available: ./img/hoefler-nbc-standard.pdf
-
-
-
-
22
-
-
38149121511
-
Netgauge: A Network Performance Measurement Framework
-
Springer, 9
-
T. Hoefler, T. Mehlan, A. Lumsdaine, and W. Rehm, "Netgauge: A Network Performance Measurement Framework," in High Performance Computing and Communications, Third International Conference, HPCC 2007, Houston, USA, September 26-28, 2007, Proceedings, vol. 4782. Springer, 9 2007, pp. 659-671.
-
(2007)
High Performance Computing and Communications, Third International Conference, HPCC 2007, Houston, USA, September 26-28, 2007, Proceedings
, vol.4782
, pp. 659-671
-
-
Hoefler, T.1
Mehlan, T.2
Lumsdaine, A.3
Rehm, W.4
-
23
-
-
51049102790
-
-
T. Hoefler, T. Schneider, and A. Lumsdaine, Accurately Measuring Collective Operations at Massive Scale, in Proceedings of the 22nd IEEE International Parallel & Distributed Processing Symposium (IPDPS), 04 2008. [Online]. Available: ./img/hoefler-pmeo08.pdf
-
T. Hoefler, T. Schneider, and A. Lumsdaine, "Accurately Measuring Collective Operations at Massive Scale," in Proceedings of the 22nd IEEE International Parallel & Distributed Processing Symposium (IPDPS), 04 2008. [Online]. Available: ./img/hoefler-pmeo08.pdf
-
-
-
-
25
-
-
33746274942
-
Performance Analysis of MPI Collective Operations
-
Denver, CO, April
-
J. Pjesivac-Grbovic, T. Angskun, G. Bosilca, G. E. Fagg, E. Gabriel, and J. J. Dongarra, "Performance Analysis of MPI Collective Operations," in Proceedings of the 19th International Parallel and Distributed Processing Symposium, 4th International Workshop on Performance Modeling, Evaluation, and Optimization of Parallel and Distributed Systems (PMEO-PDS 05), Denver, CO, April 2005.
-
(2005)
Proceedings of the 19th International Parallel and Distributed Processing Symposium, 4th International Workshop on Performance Modeling, Evaluation, and Optimization of Parallel and Distributed Systems (PMEO-PDS 05)
-
-
Pjesivac-Grbovic, J.1
Angskun, T.2
Bosilca, G.3
Fagg, G.E.4
Gabriel, E.5
Dongarra, J.J.6
-
26
-
-
50149113637
-
The case against user-level networking
-
Madrid, Spain
-
K. Magoutis, M. I. Seltzer, and E. Gabber, "The case against user-level networking," in Proceedings of Workshop on Novel Uses of System-Area Networks (SAN-3), Madrid, Spain, 2004.
-
(2004)
Proceedings of Workshop on Novel Uses of System-Area Networks (SAN-3)
-
-
Magoutis, K.1
Seltzer, M.I.2
Gabber, E.3
|