메뉴 건너뛰기




Volumn , Issue , 2005, Pages 393-402

Automatic generation and tuning of MPI collective communication routines

Author keywords

Cluster of Workstations; Empirical; MPI; Tuning System

Indexed keywords

CLUSTER OF WORKSTATIONS; EMPIRICAL; MESSAGE PASSING INTERFACE (MPI); TUNING SYSTEMS;

EID: 32844458895     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1088149.1088202     Document Type: Conference Paper
Times cited : (75)

References (23)
  • 3
    • 50149106169 scopus 로고    scopus 로고
    • Bandwidth efficient all-to-all broadcast on switched clusters
    • Department of Computer Science, Florida State University, May
    • A. Faraj, P. Patarasuk, and X. Yuan. Bandwidth Efficient All-to-All Broadcast on Switched Clusters. Technical Report, Department of Computer Science, Florida State University, May 2005.
    • (2005) Technical Report
    • Faraj, A.1    Patarasuk, P.2    Yuan, X.3
  • 4
    • 32844460718 scopus 로고    scopus 로고
    • Message scheduling for all-to-all personalized communication on ethernet switched clusters
    • April
    • A. Faraj and X. Yuan. Message Scheduling for All-to-all Personalized Communication on Ethernet Switched Clusters. IEEE IPDPS, April 2005.
    • (2005) IEEE IPDPS
    • Faraj, A.1    Yuan, X.2
  • 7
    • 0037997900 scopus 로고
    • A high-performance, portable implementation of the MPI message passing interface standard
    • W. Gropp, E. Lusk, N. Doss, and A. Skjellum. A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard. In MPI Developers Conference, 1995.
    • (1995) MPI Developers Conference
    • Gropp, W.1    Lusk, E.2    Doss, N.3    Skjellum, A.4
  • 8
    • 24144445542 scopus 로고    scopus 로고
    • Reproducible measurements of MPI performance characteristics
    • Argonne National Labratory, Argonne, IL, June
    • W. Gropp and E. Lusk. Reproducible Measurements of MPI Performance Characteristics. Technical Report ANL/MCS-P755-0699, Argonne National Labratory, Argonne, IL, June 1999.
    • (1999) Technical Report ANL/MCS-P755-0699
    • Gropp, W.1    Lusk, E.2
  • 11
    • 84947212732 scopus 로고    scopus 로고
    • A framework for collective personalized communication
    • April
    • L. V. Kale, S. Kumar, K. Varadarajan, "A Framework for Collective Personalized Communication," IPDPS'03, April 2003.
    • (2003) IPDPS'03
    • Kale, L.V.1    Kumar, S.2    Varadarajan, K.3
  • 12
    • 1442337675 scopus 로고    scopus 로고
    • CC-MPI: A compiled communication capable MPI prototype for ethernet switched clusters
    • June
    • A. Karwande, X. Yuan, and D.K. Lowenthal. CC-MPI: A Compiled Communication Capable MPI Prototype for Ethernet Switched Clusters. In ACM SIGPLAN PPoPP, pages 95-106, June 2003.
    • (2003) ACM SIGPLAN PPoPP , pp. 95-106
    • Karwande, A.1    Yuan, X.2    Lowenthal, D.K.3
  • 13
    • 18844428650 scopus 로고    scopus 로고
    • Magpie: MPI's collective communication operations for clustered wide area systems
    • May
    • T. Kielmann, et. al. Magpie: MPI's Collective Communication Operations for Clustered Wide Area Systems. In ACM SIGPLAN PPoPP, pages 131-140, May 1999.
    • (1999) ACM SIGPLAN PPoPP , pp. 131-140
    • Kielmann, T.1
  • 17
    • 0038674285 scopus 로고    scopus 로고
    • OMPI: Optimizing MPI programs using partial evaluation
    • November
    • H. Ogawa and S. Matsuoka. OMPI: Optimizing MPI Programs Using Partial Evaluation. In Supercomputing'96, November 1996.
    • (1996) Supercomputing'96
    • Ogawa, H.1    Matsuoka, S.2
  • 18
    • 0033463967 scopus 로고    scopus 로고
    • Multi-processor molecular dynamics using the brenner potential: Parallelization of an implicit multi-body potential
    • Feb.
    • I. Rosenblum, J. Adler, and S. Brandon. Multi-processor molecular dynamics using the Brenner potential: Parallelization of an implicit multi-body potential. International Journal of Modern Physics, C 10(1):189-203, Feb. 1999.
    • (1999) International Journal of Modern Physics , vol.10 C , Issue.1 , pp. 189-203
    • Rosenblum, I.1    Adler, J.2    Brandon, S.3
  • 20
    • 0003576826 scopus 로고    scopus 로고
    • Program transformation and runtime support for threaded MPI execution on shared-memory machines
    • July
    • H. Tang, K. Shen, and T. Yang. Program Transformation and Runtime Support for Threaded MPI Execution on Shared-Memory Machines. ACM Transactions on Programming Languages and Systems, 22(4):673-700, July 2000.
    • (2000) ACM Transactions on Programming Languages and Systems , vol.22 , Issue.4 , pp. 673-700
    • Tang, H.1    Shen, K.2    Yang, T.3
  • 21
    • 32844461816 scopus 로고    scopus 로고
    • Optimizing of collective communication operations in MPICH
    • Mathematics and Computer Science Division, Argonne National Laboratory, March
    • R. Thakur, R. Rabenseifner, and W. Gropp. Optimizing of Collective Communication Operations in MPICH. ANL/MCS-P1140-0304, Mathematics and Computer Science Division, Argonne National Laboratory, March 2004.
    • (2004) ANL/MCS-P1140-0304
    • Thakur, R.1    Rabenseifner, R.2    Gropp, W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.