메뉴 건너뛰기




Volumn , Issue , 2008, Pages

Optimizing non-blocking collective operations for InfiniBand

Author keywords

[No Author keywords available]

Indexed keywords

APPLICATION PROGRAMMING INTERFACES (API); APPLICATIONS; COMPUTER NETWORKS; DISTRIBUTED PARAMETER NETWORKS;

EID: 51049098070     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2008.4536138     Document Type: Conference Paper
Times cited : (25)

References (36)
  • 2
    • 84883859962 scopus 로고    scopus 로고
    • T. Hoefler, J. Squyres, W. Rehm, and A. Lumsdaine, A Case for Non-Blocking Collective Operations, in Frontiers of High Performance Computing and Networking - ISPA 2006 Workshops, 4331/2006. Springer Berlin / Heidelberg, 12 2006, pp. 155-164. [Online]. Available: ./img/hoefler-ispa06.pdf
    • T. Hoefler, J. Squyres, W. Rehm, and A. Lumsdaine, "A Case for Non-Blocking Collective Operations," in Frontiers of High Performance Computing and Networking - ISPA 2006 Workshops, vol. 4331/2006. Springer Berlin / Heidelberg, 12 2006, pp. 155-164. [Online]. Available: ./img/hoefler-ispa06.pdf
  • 3
    • 84877019178 scopus 로고    scopus 로고
    • The Case of the Missing Supercomputer Performance: Achieving Optimal Performance on the 8, 192 Processors of ASCI Q
    • ACM
    • F. Petrini, D. J. Kerbyson, and S. Pakin, "The Case of the Missing Supercomputer Performance: Achieving Optimal Performance on the 8, 192 Processors of ASCI Q." in Proceedings of the ACM/IEEE Supercomputing. ACM, 2003, p. 55.
    • (2003) Proceedings of the ACM/IEEE Supercomputing , pp. 55
    • Petrini, F.1    Kerbyson, D.J.2    Pakin, S.3
  • 4
    • 30644479805 scopus 로고    scopus 로고
    • Overlapping of communication and computation and early binding: Fundamental mechanisms for improving parallel performance on clusters of workstations,
    • Ph.D. dissertation, Mississippi State University
    • R. Dimitrov, "Overlapping of communication and computation and early binding: Fundamental mechanisms for improving parallel performance on clusters of workstations," Ph.D. dissertation, Mississippi State University, 2001.
    • (2001)
    • Dimitrov, R.1
  • 5
    • 1242332596 scopus 로고    scopus 로고
    • Send-receive considered harmful: Myths and realities of message passing
    • S. Gorlatch, "Send-receive considered harmful: Myths and realities of message passing," ACM Trans. Program. Lang. Syst., vol. 26, no. 1, pp. 47-56, 2004.
    • (2004) ACM Trans. Program. Lang. Syst , vol.26 , Issue.1 , pp. 47-56
    • Gorlatch, S.1
  • 6
    • 51049109755 scopus 로고    scopus 로고
    • Message Passing Interface Forum, MPI: A Message Passing Interface Standard, 1995.
    • Message Passing Interface Forum, "MPI: A Message Passing Interface Standard," 1995.
  • 7
    • 0003604499 scopus 로고    scopus 로고
    • MPI-2: Extensions to the Message-Passing Interface,
    • Technical Report, University of Tennessee, Knoxville
    • _, "MPI-2: Extensions to the Message-Passing Interface," Technical Report, University of Tennessee, Knoxville, 1997.
    • (1997)
    • Gorlatch, S.1
  • 10
    • 84947212732 scopus 로고    scopus 로고
    • A Framework for Collective Personalized Communication
    • Nice, France, April
    • L. V. Kale, S. Kumar, and K. Vardarajan, "A Framework for Collective Personalized Communication," in Proceedings of IPDPS'03, Nice, France, April 2003.
    • (2003) Proceedings of IPDPS'03
    • Kale, L.V.1    Kumar, S.2    Vardarajan, K.3
  • 14
    • 51049113070 scopus 로고    scopus 로고
    • J. W. III and S. Bova, Where's the Overlap? - An Analysis of Popular MPI Implementations, 1999. [Online]. Available: citeseer.ist.psu.edu/white99wheres.html
    • J. W. III and S. Bova, "Where's the Overlap? - An Analysis of Popular MPI Implementations," 1999. [Online]. Available: citeseer.ist.psu.edu/white99wheres.html
  • 15
    • 84948981514 scopus 로고    scopus 로고
    • Comb: A portable benchmark suite for assessing mpi overlap
    • IEEE Computer Society
    • W. Lawry, C. Wilson, A. B. Maccabe, and R. Brightwell, "Comb: A portable benchmark suite for assessing mpi overlap." in CLUSTER. IEEE Computer Society, 2002, pp. 472-475.
    • (2002) CLUSTER , pp. 472-475
    • Lawry, W.1    Wilson, C.2    Maccabe, A.B.3    Brightwell, R.4
  • 17
    • 51049102456 scopus 로고    scopus 로고
    • The InfiniBand Trade Association, Infiniband Architecture Specification 1, Release 1.2, InfiniBand Trade Association, 2003.
    • The InfiniBand Trade Association, Infiniband Architecture Specification Volume 1, Release 1.2, InfiniBand Trade Association, 2003.
  • 18
    • 81455128348 scopus 로고    scopus 로고
    • Assessing Single-Message and Multi-Node Communication Performance of InfiniBand
    • IEEE Computer Society
    • T. Hoefler, C. Viertel, T. Mehlan, F. Mietke, and W. Rehm, "Assessing Single-Message and Multi-Node Communication Performance of InfiniBand," in Proceedings of IEEE PARELEC 2006. IEEE Computer Society, 9 2006, pp. 227-232.
    • (2006) Proceedings of IEEE PARELEC 2006 , vol.9 , pp. 227-232
    • Hoefler, T.1    Viertel, C.2    Mehlan, T.3    Mietke, F.4    Rehm, W.5
  • 22
    • 70350237882 scopus 로고    scopus 로고
    • Analysis of the Memory Registration Process in the Mellanox Infini-Band Software Stack
    • Springer-Verlag Berlin
    • F. Mietke, R. Baumgartl, R. Rex, T. Mehlan, T. Hoefler, and W. Rehm, "Analysis of the Memory Registration Process in the Mellanox Infini-Band Software Stack," in Euro-Par 2006 Parallel Processing. Springer-Verlag Berlin, 8 2006, pp. 124-133.
    • (2006) Euro-Par 2006 Parallel Processing , vol.8 , pp. 124-133
    • Mietke, F.1    Baumgartl, R.2    Rex, R.3    Mehlan, T.4    Hoefler, T.5    Rehm, W.6
  • 25
    • 33750234379 scopus 로고    scopus 로고
    • G. M. Shipman, T. S. Woodall, G. Bosilca, R. ch L. Graham, and A. B. Maccabe, High performance RDMA protocols in HPC, in Proceedings, 13th European PVM/MPI Users' Group Meeting, ser. Lecture Notes in Computer Science. Bonn, Germany: Springer-Verlag, September 2006.
    • G. M. Shipman, T. S. Woodall, G. Bosilca, R. ch L. Graham, and A. B. Maccabe, "High performance RDMA protocols in HPC," in Proceedings, 13th European PVM/MPI Users' Group Meeting, ser. Lecture Notes in Computer Science. Bonn, Germany: Springer-Verlag, September 2006.
  • 26
    • 0018515759 scopus 로고    scopus 로고
    • C. L. Lawson, R. J. Hanson, D. Kincaid, and F. T. Krogh, Basic Linear Algebra Subprograms for FORTRAN usage, in In ACM Trans. Math. Soft., 5 (1979), pp. 308-323, 1979.
    • C. L. Lawson, R. J. Hanson, D. Kincaid, and F. T. Krogh, "Basic Linear Algebra Subprograms for FORTRAN usage," in In ACM Trans. Math. Soft., 5 (1979), pp. 308-323, 1979.
  • 31
    • 38149121511 scopus 로고    scopus 로고
    • T. Hoefler, T. Mehlan, A. Lumsdaine, and W. Rehm, Netgauge: A Network Performance Measurement Framework, in Proceedings of Third International Conference, HPCC 2007, 4782. Springer, 9 2007, pp. 659-671. [Online]. Available: ./img/hoefler-netgauge.pdf
    • T. Hoefler, T. Mehlan, A. Lumsdaine, and W. Rehm, "Netgauge: A Network Performance Measurement Framework," in Proceedings of Third International Conference, HPCC 2007, vol. 4782. Springer, 9 2007, pp. 659-671. [Online]. Available: ./img/hoefler-netgauge.pdf
  • 32
    • 51049106278 scopus 로고    scopus 로고
    • T. Hoefler, A. Lichei, and W. Rehm, Low-Overhead LogGP Parameter Assessment for Modern Interconnection Networks, 03 2007. [Online]. Available: ./img/hoefler-pmeo07.pdf
    • T. Hoefler, A. Lichei, and W. Rehm, "Low-Overhead LogGP Parameter Assessment for Modern Interconnection Networks," 03 2007. [Online]. Available: ./img/hoefler-pmeo07.pdf
  • 35
    • 34548793392 scopus 로고    scopus 로고
    • T. Hoefler, C. Siebert, and W. Rehm, A practically constant-time MPI Broadcast Algorithm for large-scale InfiniBand Clusters with Multicast, in Proceedings of the 21st IEEE International Parallel & Distributed Processing Symposium. IEEE Computer Society, 03 2007, p. 232. [Online]. Available: ./img/hoefler-cac07.pdf
    • T. Hoefler, C. Siebert, and W. Rehm, "A practically constant-time MPI Broadcast Algorithm for large-scale InfiniBand Clusters with Multicast," in Proceedings of the 21st IEEE International Parallel & Distributed Processing Symposium. IEEE Computer Society, 03 2007, p. 232. [Online]. Available: ./img/hoefler-cac07.pdf


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.