메뉴 건너뛰기




Volumn 21, Issue 5, 2010, Pages 698-709

Self-consistent MPI performance guidelines

Author keywords

Message passing; Message passing interface; MPI; Parallel processing; Performance guidelines; Performance model; Performance portability; Performance prediction; Public benchmarking

Indexed keywords

MESSAGE PASSING INTERFACE; PARALLEL PROCESSING; PERFORMANCE GUIDELINES; PERFORMANCE MODEL; PERFORMANCE PREDICTION;

EID: 77950627571     PISSN: 10459219     EISSN: None     Source Type: Journal    
DOI: 10.1109/TPDS.2009.120     Document Type: Article
Times cited : (35)

References (41)
  • 5
    • 37549003336 scopus 로고    scopus 로고
    • MapReduce: Simplified data processing on large clusters
    • J. Dean and S. Ghemawat, "MapReduce: Simplified Data Processing on Large Clusters," Comm. ACM, vol.51, no.1, pp. 107-113, 2008.
    • (2008) Comm. ACM , vol.51 , Issue.1 , pp. 107-113
    • Dean, J.1    Ghemawat, S.2
  • 7
    • 70349746305 scopus 로고    scopus 로고
    • PRO: A model for the design and analysis of efficient and scalable parallel algorithms
    • A.H. Gebremedhin, M. Essaïdi, I.G. Lassous, J. Gustedt, and J.A. Telle, "PRO: A Model for the Design and Analysis of Efficient and Scalable Parallel Algorithms," Nordic J. Computing, vol.13, pp. 215- 239, 2006.
    • (2006) Nordic J. Computing , vol.13 , pp. 215-239
    • Gebremedhin, A.H.1    Essaïdi, M.2    Lassous, I.G.3    Gustedt, J.4    Telle, J.A.5
  • 9
    • 0001064241 scopus 로고    scopus 로고
    • Toward formally-based design of message passing programs
    • Mar.
    • S. Gorlatch, "Toward Formally-Based Design of Message Passing Programs," IEEE Trans. Software Eng., vol.26, no.3, pp. 276-288, Mar. 2000.
    • (2000) IEEE Trans. Software Eng. , vol.26 , Issue.3 , pp. 276-288
    • Gorlatch, S.1
  • 10
    • 1242332596 scopus 로고    scopus 로고
    • Send-Receive considered harmful: Myths and realities of message passing
    • S. Gorlatch, "Send-Receive Considered Harmful: Myths and Realities of Message Passing," ACM Trans. Programming Languages and Systems, vol.26, no.1, pp. 47-56, 2004.
    • (2004) ACM Trans. Programming Languages and Systems , vol.26 , Issue.1 , pp. 47-56
    • Gorlatch, S.1
  • 15
    • 0030673395 scopus 로고    scopus 로고
    • Application restructuring and performance portability on shared virtual memory and Hardware-Coherent multiprocessors
    • D. Jiang, H. Shan, and J.P. Singh, "Application Restructuring and Performance Portability on Shared Virtual Memory and Hardware- Coherent Multiprocessors," Proc. Sixth ACM SIGPLAN Symp. Principles and Practice of Parallel Programming (PPoPP), pp. 217-229, 1997. (Pubitemid 127452555)
    • (1997) SIGPLAN Notices (ACM Special Interest Group on Programming Languages) , vol.32 , Issue.7 , pp. 217-229
    • Jiang, D.1    Shan, H.2    Singh, J.P.3
  • 16
    • 34548281165 scopus 로고    scopus 로고
    • A practical approach to performance analysis and modeling of large-scale systems
    • DOI 10.1145/1188455.1188670, Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, SC'06
    • D.J. Kerbyson and A. Hoisie, "S05-A Practical Approach to Performance Analysis and Modeling of Large-Scale Systems," Proc. ACM/IEEE SC Conf. High Performance Networking and Computing, p. 206, 2006. (Pubitemid 47318737)
    • (2006) Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, SC'06 , pp. 1188670
    • Kerbyson, D.J.1    Hoisie, A.2
  • 18
    • 33748875639 scopus 로고    scopus 로고
    • Optimizing MPI collective communication by orthogonal structures
    • DOI 10.1007/s10586-006-9740-9, Cluster Computing in Science and Engineering
    • M. Kühnemann, T. Rauber, and G. Rünger, "Optimizing MPI Collective Communication by Orthogonal Structures," Cluster Computing, vol.9, no.3, pp. 257-279, 2006. (Pubitemid 44419723)
    • (2006) Cluster Computing , vol.9 , Issue.3 , pp. 257-279
    • Kuhnemann, M.1    Rauber, T.2    Runger, G.3
  • 20
    • 28044435048 scopus 로고    scopus 로고
    • A performance model of non-deterministic particle transport on large-scale systems
    • DOI 10.1016/j.future.2004.11.018, PII S0167739X04002249
    • M.M. Mathis, D.J. Kerbyson, and A. Hoisie, "A Performance Model of Non-Deterministic Particle Transport on Large-Scale Systems," Future Generation Computer Systems, vol.22, no.3, pp. 324-335, 2006. (Pubitemid 41689814)
    • (2006) Future Generation Computer Systems , vol.22 , Issue.3 , pp. 324-335
    • Mathis, M.M.1    Kerbyson, D.J.2    Hoisie, A.3
  • 24
    • 77950626520 scopus 로고    scopus 로고
    • An LPAR-Customized MPI-Alltoallv for the materials science Code CASTEP
    • The Univ. of Edinburgh
    • M. Plummer and K. Refson, "An LPAR-Customized MPI-Alltoallv for the Materials Science Code CASTEP," Technical Report HPCxTR0401, EPCC, The Univ. of Edinburgh, 2004.
    • (2004) Technical Report HPCxTR0401, EPCC
    • Plummer, M.1    Refson, K.2
  • 25
    • 0038587352 scopus 로고    scopus 로고
    • Using SKaMPI for developing high-performance MOI programs with performance portability
    • R. Reussner, "Using SKaMPI for Developing High-Performance MOI Programs with Performance Portability," Future Generation Computing Systems, vol.19, no.5, pp. 749-759, 2003.
    • (2003) Future Generation Computing Systems , vol.19 , Issue.5 , pp. 749-759
    • Reussner, R.1
  • 26
    • 84957882532 scopus 로고    scopus 로고
    • SKaMPI: A detailed, accurate MPI benchmark
    • Recent Advances in Parallel Virtual Machine and Message Passing Interface
    • R. Reussner, P. Sanders, L. Prechelt, and M. Müller, "SKaMPI: A Detailed, Accurate MPI Benchmark," Proc. Recent Advances in Parallel Virtual Machine and Message Passing Interface: Fifth European PVM/MPI Users' Group Meeting, pp. 52-59, 1998. (Pubitemid 128135093)
    • (1998) LECTURE NOTES IN COMPUTER SCIENCE , Issue.1497 , pp. 52-62
    • Reussner, R.1    Sanders, P.2    Prechelt, L.3    Mueller, M.4
  • 27
    • 0036082072 scopus 로고    scopus 로고
    • SKaMPI: A comprehensive benchmark for public benchmarking of MPI
    • R. Reussner, P. Sanders, and J.L. Träff, "SKaMPI: A Comprehensive Benchmark for Public Benchmarking of MPI," Scientific Programming, vol.10, no.1, pp. 55-65, 2002. (Pubitemid 34685255)
    • (2002) Scientific Programming , vol.10 , Issue.1 , pp. 55-65
    • Reussner, R.1    Sanders, P.2    Traff, J.L.3
  • 29
    • 35048892196 scopus 로고    scopus 로고
    • Generation of simple analytical models for message passing applications
    • Euro-Par 2004 Parallel Processing
    • G. Rodríguez, R.M. Badia, and J. Labarta, "Generation of Simple Analytical Models for Message Passing Applications," Proc. Euro- Par '04 Parallel Processing Conf., pp. 183-188, 2004. (Pubitemid 39217270)
    • (2004) LECTURE NOTES IN COMPUTER SCIENCE , Issue.3149 , pp. 183-188
    • Rodriguez, G.1    Badia, R.M.2    Labarta, J.3
  • 32
    • 35248859849 scopus 로고    scopus 로고
    • Improving the performance of collective operations in MPICH
    • Recent Advances in Parallel Virtual Machine and Message Passing Interface
    • R. Thakur, W.D. Gropp, and R. Rabenseifner, "Improving the Performance of Collective Operations in MPICH," Int'l J. High Performance Computing Applications, vol.19, pp. 49-66, 2004. (Pubitemid 37240338)
    • (2003) LECTURE NOTES IN COMPUTER SCIENCE , Issue.2840 , pp. 257-267
    • Thakur, R.1    Gropp, W.D.2
  • 36
    • 0025467711 scopus 로고
    • A bridging model for parallel computation
    • L.G. Valiant, "A Bridging Model for Parallel Computation," Comm. ACM, vol.33, no.8, pp. 103-111, 1990.
    • (1990) Comm. ACM , vol.33 , Issue.8 , pp. 103-111
    • Valiant, L.G.1
  • 37
    • 23844503894 scopus 로고    scopus 로고
    • Performance portability in the physical parameterizations of the community atmospheric model
    • P. Worley and J. Drake, "Performance Portability in the Physical Parameterizations of the Community Atmospheric Model," Int'l J. High Performance Computing Applications, vol.19, no.3, pp. 187-202, 2005.
    • (2005) Int'l J. High Performance Computing Applications , vol.19 , Issue.3 , pp. 187-202
    • Worley, P.1    Drake, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.