SCOPUS 정보 검색 플랫폼

Proceedings - IEEE International Conference on Cluster Computing, ICCC

Volumn Proceedings of the 2008 IEEE International Conference on Cluster Computing, ICCC 2008, Issue , 2008, Pages 213-222

Message progression in parallel computing - To thread or not to thread?

(2) Hoefler, Torsten a Lumsdaine, Andrew a

a INDIANA UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BENCHMARKING; PARALLEL PROCESSING SYSTEMS;

CPU CORES; OPERATING SYSTEMS; PARALLEL APPLICATIONS; PARALLEL COMPUTING; PERFORMANCE IMPROVEMENTS; REAL-TIME SCHEDULING;

COMMUNICATION;

EID: 57949089840 PISSN: 15525244 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CLUSTR.2008.4663774 Document Type: Conference Paper

Times cited : (99)

References (26)

1
- 33845423787
- Computation-communication overlap on network-of-workstation multiprocessors
- July
- G. Liu and T. Abdelrahman, "Computation-communication overlap on network-of-workstation multiprocessors," in Proc. of the Int'l Conference on Parallel and Distributed Processing Techniques and Applications, July 1998, pp. 1635-1642.
- (1998) Proc. of the Int'l Conference on Parallel and Distributed Processing Techniques and Applications , pp. 1635-1642
- Liu, G.¹ Abdelrahman, T.²

2
- 8344267505
- An analysis of the impact of MPI overlap and independent progress
- New York, NY, USA: ACM Press
- R. Brightwell and K. D. Underwood, "An analysis of the impact of MPI overlap and independent progress," in ICS '04: Proceedings of the 18th annual international conference on Supercomputing. New York, NY, USA: ACM Press, 2004, pp. 298-305.
- (2004) ICS '04: Proceedings of the 18th annual international conference on Supercomputing , pp. 298-305
- Brightwell, R.¹ Underwood, K.D.²

3
- 30644479805
- Overlapping of communication and computation and early binding: Fundamental mechanisms for improving parallel performance on clusters of workstations,
- Ph.D. dissertation, Mississippi State University
- R. Dimitrov, "Overlapping of communication and computation and early binding: Fundamental mechanisms for improving parallel performance on clusters of workstations," Ph.D. dissertation, Mississippi State University, 2001.
- (2001)
- Dimitrov, R.¹

4
- 33745195144
- Hunting the overlap
- Washington, DC, USA: IEEE Computer Society
- C. Iancu, P. Husbands, and P. Hargrove, "Hunting the overlap," in PACT '05: Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05). Washington, DC, USA: IEEE Computer Society, 2005, pp. 279-290.
- (2005) PACT '05: Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05) , pp. 279-290
- Iancu, C.¹ Husbands, P.² Hargrove, P.³

5
- 84949094818
- J. W. III and S. Bova, Where's the Overlap? - An Analysis of Popular MPI Implementations, 1999. [Online]. Available: citeseer.ist.psu.edu/white99wheres.html
- J. W. III and S. Bova, "Where's the Overlap? - An Analysis of Popular MPI Implementations," 1999. [Online]. Available: citeseer.ist.psu.edu/white99wheres.html

6
- 84949098871
- A. Adelmann, W. P. P. A. Bonelli and, and C. W. Ueberhuber, Communication efficiency of parallel 3d ffts. in High Performance Computing for Computational Science - VECPAR 2004, 6th International Conference, Valencia, Spain, June 28-30, 2004, Revised Selected and Invited Papers, ser. Lecture Notes in Computer Science, 3402. Springer, 2004, pp. 901-907.
- A. Adelmann, W. P. P. A. Bonelli and, and C. W. Ueberhuber, "Communication efficiency of parallel 3d ffts." in High Performance Computing for Computational Science - VECPAR 2004, 6th International Conference, Valencia, Spain, June 28-30, 2004, Revised Selected and Invited Papers, ser. Lecture Notes in Computer Science, vol. 3402. Springer, 2004, pp. 901-907.

7
- 0041140906
- C. Calvin and F. Desprez, "Minimizing communication overhead using pipelining for multidimensional fft on distributed memory machines," 1993.
- (1993) Minimizing communication overhead using pipelining for multidimensional fft on distributed memory machines
- Calvin, C.¹ Desprez, F.²

8
- 0035004646
- Redistribution strategies for portable parallel FFT: A case study
- A. Dubey and D. Tessera, "Redistribution strategies for portable parallel FFT: a case study." Concurrency and Computation: Practice and Experience, vol. 13, no. 3, pp. 209-220, 2001.
- (2001) Concurrency and Computation: Practice and Experience , vol.13 , Issue.3 , pp. 209-220
- Dubey, A.¹ Tessera, D.²

9
- 0042532049
- An efficient 3-dim FFT for plane wave electronic structure calculations on massively parallel machines composed of multiprocessor nodes
- Aug
- S. Goedecker, M. Boulet, and T. Deutsch, "An efficient 3-dim FFT for plane wave electronic structure calculations on massively parallel machines composed of multiprocessor nodes," Computer Physics Communications, vol. 154, pp. 105-110, Aug. 2003.
- (2003) Computer Physics Communications , vol.154 , pp. 105-110
- Goedecker, S.¹ Boulet, M.² Deutsch, T.³

10
- 56749151145
- T. Hoefler, A. Lumsdaine, and W. Rehm, Implementation and Performance Analysis of Non-Blocking Collective Operations for MPI, in In proceedings of the 2007 International Conference on High Performance Computing, Networking, Storage and Analysis, SC07. IEEE Computer Society/ACM, 11 2007. [Online]. Available: ./img/hoefler-sc07.pdf
- T. Hoefler, A. Lumsdaine, and W. Rehm, "Implementation and Performance Analysis of Non-Blocking Collective Operations for MPI," in In proceedings of the 2007 International Conference on High Performance Computing, Networking, Storage and Analysis, SC07. IEEE Computer Society/ACM, 11 2007. [Online]. Available: ./img/hoefler-sc07.pdf

11
- 0018515759
- C. L. Lawson, R. J. Hanson, D. Kincaid, and F. T. Krogh, Basic Linear Algebra Subprograms for FORTRAN usage, in In ACM Trans. Math. Soft., 5 (1979), pp. 308-323, 1979.
- C. L. Lawson, R. J. Hanson, D. Kincaid, and F. T. Krogh, "Basic Linear Algebra Subprograms for FORTRAN usage," in In ACM Trans. Math. Soft., 5 (1979), pp. 308-323, 1979.

12
- 33750234379
- G. M. Shipman, T. S. Woodall, G. Bosilca, R. ch L. Graham, and A. B. Maccabe, High performance RDMA protocols in HPC, in Proceedings, 13th European PVM/MPI Users' Group Meeting, ser. Lecture Notes in Computer Science. Bonn, Germany: Springer-Verlag, September 2006.
- G. M. Shipman, T. S. Woodall, G. Bosilca, R. ch L. Graham, and A. B. Maccabe, "High performance RDMA protocols in HPC," in Proceedings, 13th European PVM/MPI Users' Group Meeting, ser. Lecture Notes in Computer Science. Bonn, Germany: Springer-Verlag, September 2006.

13
- 51049098070
- T. Hoefler and A. Lumsdaine, Optimizing non-blocking Collective Operations for InfiniBand, in Proceedings of the 22nd IEEE International Parallel & Distributed Processing Symposium (IPDPS), 04 2008. [Online]. Available: ./img//hoefler-cac08.pdf
- T. Hoefler and A. Lumsdaine, "Optimizing non-blocking Collective Operations for InfiniBand," in Proceedings of the 22nd IEEE International Parallel & Distributed Processing Symposium (IPDPS), 04 2008. [Online]. Available: ./img//hoefler-cac08.pdf

14
- 0013357434
- Asynchronous mpi messaging on myrinet
- Washington, DC, USA: IEEE Computer Society
- C. Keppitiyagama and A. S. Wagner, "Asynchronous mpi messaging on myrinet," in IPDPS '01: Proceedings of the 15th International Parallel & Distributed Processing Symposium. Washington, DC, USA: IEEE Computer Society, 2001, p. 50.
- (2001) IPDPS '01: Proceedings of the 15th International Parallel & Distributed Processing Symposium , pp. 50
- Keppitiyagama, C.¹ Wagner, A.S.²

15
- 84944408245
- Emp: Zero-copy os-bypass nic-driven gigabit ethernet message passing
- New York, NY, USA: ACM Press
- P. Shivam, P. Wyckoff, and D. Panda, "Emp: zero-copy os-bypass nic-driven gigabit ethernet message passing," in Supercomputing '01: Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM). New York, NY, USA: ACM Press, 2001, pp. 57-57.
- (2001) Supercomputing '01: Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM) , pp. 57-57
- Shivam, P.¹ Wyckoff, P.² Panda, D.³

16
- 84944750609
- pp
- W. Yu, D. Buntinas, and D. K. Panda, "High Performance and Reliable NIC-Based Multicast over Myrinet/GM-2," pp. 197-204, 2003.
- (2003) High Performance and Reliable NIC-Based Multicast over Myrinet/GM-2 , pp. 197-204
- Yu, W.¹ Buntinas, D.² Panda, D.K.³

17
- 12444259728
- Efficient and scalable barrier over quadrics and myrinet with a new nic-based collective message passing protocol
- Santa Fe, New Mexico, USA
- W. Yu, D. Buntinas, R. L. Graham, and D. K. Panda, "Efficient and scalable barrier over quadrics and myrinet with a new nic-based collective message passing protocol." in 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), CD-ROM / Abstracts Proceedings, 26-30 April 2004, Santa Fe, New Mexico, USA, 2004.
- (2004) 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), CD-ROM / Abstracts Proceedings, 26-30 April 2004
- Yu, W.¹ Buntinas, D.² Graham, R.L.³ Panda, D.K.⁴

18
- 20444508120
- Application-bypass reduction for large-scale clusters
- IEEE Computer Society, December
- A. Wagner, D. Buntinas, D. K. Panda, and R. Brightwell, "Application-bypass reduction for large-scale clusters." in 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003). IEEE Computer Society, December 2003, pp. 404-411.
- (2003) 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003) , pp. 404-411
- Wagner, A.¹ Buntinas, D.² Panda, D.K.³ Brightwell, R.⁴

19
- 57949084992
- Application-bypass broadcast in mpich over gm
- Washington, DC, USA: IEEE Computer Society
- D. Buntinas, D. K. Panda, and R. Brightwell, "Application-bypass broadcast in mpich over gm," in CCGRID '03: Proceedings of the 3st International Symposium on Cluster Computing and the Grid. Washington, DC, USA: IEEE Computer Society, 2003, p. 2.
- (2003) CCGRID '03: Proceedings of the 3st International Symposium on Cluster Computing and the Grid , pp. 2
- Buntinas, D.¹ Panda, D.K.² Brightwell, R.³

20
- 84949085373
- R. Rabenseifner, Hybrid parallel programming on hpc platforms, in In proceedings of the Fifth European Workshop on OpenMP, EWOMP'03, Aachen, Germany, 2003.
- R. Rabenseifner, "Hybrid parallel programming on hpc platforms," in In proceedings of the Fifth European Workshop on OpenMP, EWOMP'03, Aachen, Germany, 2003.

21
- 38449103903
- T. Hoefler, P. Kambadur, R. L. Graham, G. Shipman, and A. Lumsdaine, A Case for Standard Non-Blocking Collective Operations, in Recent Advances in Parallel Virtual Machine and Message Passing Interface, EuroPVM/MPI 2007, 4757. Springer, 10 2007, pp. 125-134. [Online]. Available: ./img/hoefler-nbc-standard.pdf
- T. Hoefler, P. Kambadur, R. L. Graham, G. Shipman, and A. Lumsdaine, "A Case for Standard Non-Blocking Collective Operations," in Recent Advances in Parallel Virtual Machine and Message Passing Interface, EuroPVM/MPI 2007, vol. 4757. Springer, 10 2007, pp. 125-134. [Online]. Available: ./img/hoefler-nbc-standard.pdf

22
- 38149121511
- Netgauge: A Network Performance Measurement Framework
- Springer, 9
- T. Hoefler, T. Mehlan, A. Lumsdaine, and W. Rehm, "Netgauge: A Network Performance Measurement Framework," in High Performance Computing and Communications, Third International Conference, HPCC 2007, Houston, USA, September 26-28, 2007, Proceedings, vol. 4782. Springer, 9 2007, pp. 659-671.
- (2007) High Performance Computing and Communications, Third International Conference, HPCC 2007, Houston, USA, September 26-28, 2007, Proceedings , vol.4782 , pp. 659-671
- Hoefler, T.¹ Mehlan, T.² Lumsdaine, A.³ Rehm, W.⁴

23
- 51049102790
- T. Hoefler, T. Schneider, and A. Lumsdaine, Accurately Measuring Collective Operations at Massive Scale, in Proceedings of the 22nd IEEE International Parallel & Distributed Processing Symposium (IPDPS), 04 2008. [Online]. Available: ./img/hoefler-pmeo08.pdf
- T. Hoefler, T. Schneider, and A. Lumsdaine, "Accurately Measuring Collective Operations at Massive Scale," in Proceedings of the 22nd IEEE International Parallel & Distributed Processing Symposium (IPDPS), 04 2008. [Online]. Available: ./img/hoefler-pmeo08.pdf

24
- 3042688013
- Automatic MPI Counter Profiling
- R. Rabenseifner, "Automatic MPI Counter Profiling," in Proceedings of 42nd CUG Conference, 2000.
- (2000) Proceedings of 42nd CUG Conference
- Rabenseifner, R.¹

25
- 33746274942
- Performance Analysis of MPI Collective Operations
- Denver, CO, April
- J. Pjesivac-Grbovic, T. Angskun, G. Bosilca, G. E. Fagg, E. Gabriel, and J. J. Dongarra, "Performance Analysis of MPI Collective Operations," in Proceedings of the 19th International Parallel and Distributed Processing Symposium, 4th International Workshop on Performance Modeling, Evaluation, and Optimization of Parallel and Distributed Systems (PMEO-PDS 05), Denver, CO, April 2005.
- (2005) Proceedings of the 19th International Parallel and Distributed Processing Symposium, 4th International Workshop on Performance Modeling, Evaluation, and Optimization of Parallel and Distributed Systems (PMEO-PDS 05)
- Pjesivac-Grbovic, J.¹ Angskun, T.² Bosilca, G.³ Fagg, G.E.⁴ Gabriel, E.⁵ Dongarra, J.J.⁶

26
- 50149113637
- The case against user-level networking
- Madrid, Spain
- K. Magoutis, M. I. Seltzer, and E. Gabber, "The case against user-level networking," in Proceedings of Workshop on Novel Uses of System-Area Networks (SAN-3), Madrid, Spain, 2004.
- (2004) Proceedings of Workshop on Novel Uses of System-Area Networks (SAN-3)
- Magoutis, K.¹ Seltzer, M.I.² Gabber, E.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.