메뉴 건너뛰기




Volumn 15, Issue 4, 2007, Pages 481-491

Performance modeling of communication and computation in hybrid MPI and OpenMP applications

Author keywords

Cluster; MPI; OpenMP; Performance modeling; SMP

Indexed keywords

CLUSTER ANALYSIS; COMPUTATIONAL COMPLEXITY; COMPUTATIONAL METHODS; OPTIMIZATION; PROGRAM COMPILERS; TELECOMMUNICATION SYSTEMS;

EID: 33947361066     PISSN: 1569190X     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.simpat.2006.11.014     Document Type: Article
Times cited : (26)

References (43)
  • 1
    • 2442517698 scopus 로고    scopus 로고
    • Parallel program performance prediction using deterministic task graph analysis
    • Adve V.S., and Vernon M.K. Parallel program performance prediction using deterministic task graph analysis. ACM Transactions On Computer Systems 22 1 (2004) 94-136
    • (2004) ACM Transactions On Computer Systems , vol.22 , Issue.1 , pp. 94-136
    • Adve, V.S.1    Vernon, M.K.2
  • 3
    • 0347133254 scopus 로고    scopus 로고
    • Exploiting distributed-memory and shared-memory parallelism on clusters of smps with data parallel programs
    • Benkner S., and Sipkov'a V. Exploiting distributed-memory and shared-memory parallelism on clusters of smps with data parallel programs. International Journal of Parallel Programming 31 1 (2003) 3-19
    • (2003) International Journal of Parallel Programming , vol.31 , Issue.1 , pp. 3-19
    • Benkner, S.1    Sipkov'a, V.2
  • 5
    • 33947407825 scopus 로고    scopus 로고
    • J.M. Bull, Measuring synchronisation and scheduling overheads in openmp, in: In European Workshop on OpenMP (EWOMP1999), Lund, Sweden, 1999.
  • 6
    • 33947355118 scopus 로고    scopus 로고
    • I.J. Bush, C.J. Noble, R.J. Allan, Mixed openmp and mpi for parallel fortran applications, in: In European Workshop on OpenMP (EWOMP2000), Edinburgh, UK, 2000.
  • 7
    • 33947394170 scopus 로고    scopus 로고
    • F. Cappello, D. Etiemble, Mpi versus mpi + openmp on ibm sp for the nas benchmarks, in: In SC2000, Supercomputing 2000, November, Dallas, 2000.
  • 8
    • 33947426186 scopus 로고    scopus 로고
    • Edmond Chow, David Hysom. Assessing performance of hybrid mpi/openmp programs on smp clusters. Technical Report UCRL-JC-143957, Lawrence Livermore National Laboratory, May 2001.
  • 10
    • 12444315069 scopus 로고    scopus 로고
    • N. Drosinos, N. Koziris. Performance comparison of pure MPI vs hybrid MPI-OpenMP parallelization models on SMP clusters, in: Proceedings of the 18th International Parallel and Distributed Processing Symposium 2004 (IPDPS 2004), Santa Fe, New Mexico, April 2004, p. 15.
  • 11
    • 33947388965 scopus 로고    scopus 로고
    • Message Passing Interface Forum. .
  • 13
    • 33947417844 scopus 로고    scopus 로고
    • L. Giraud, Combining shared and distributed memory programming models on clusters of symmetric multiprocessors: some basic promising experiments. Working Note WN/PA/01/19, CERFACS, Toulouse, France, 2001.
  • 14
    • 33947433774 scopus 로고    scopus 로고
    • Pallas GmbH. Pallas mpi benchmarks - pmb. .
  • 15
    • 84958053214 scopus 로고    scopus 로고
    • Reproducible measurements of MPI performance characteristics
    • Recent Advances in Parallel Virtual Machine and Message Passing Interface. Dongarra J., Luque E., and Margalef T. (Eds), Springer Verlag 6th European PVM/MPI Users' Group Meeting, Barcelona, Spain, September 1999
    • Gropp W.D., and Lusk E. Reproducible measurements of MPI performance characteristics. In: Dongarra J., Luque E., and Margalef T. (Eds). Recent Advances in Parallel Virtual Machine and Message Passing Interface. Lecture Notes in Computer Science vol. 1697 (1999), Springer Verlag 11-18 6th European PVM/MPI Users' Group Meeting, Barcelona, Spain, September 1999
    • (1999) Lecture Notes in Computer Science , vol.1697 , pp. 11-18
    • Gropp, W.D.1    Lusk, E.2
  • 17
    • 33947382375 scopus 로고    scopus 로고
    • D.S. Henty, Performance of hybrid message-passing and shared-memory parallelism for discrete element modeling, in: Supercomputing 2000, 2000.
  • 18
    • 0028401457 scopus 로고
    • The communication challenge for mpp: Intel paragon and meiko cs-2
    • Hockney R.W. The communication challenge for mpp: Intel paragon and meiko cs-2. Parallel Computation 20 3 (1994) 389-398
    • (1994) Parallel Computation , vol.20 , Issue.3 , pp. 389-398
    • Hockney, R.W.1
  • 19
    • 33947364128 scopus 로고    scopus 로고
    • M.D. Jones, R. Yao, Parallel osem reconstruction speed with mpi, openmp, and hybrid mpi-openmp programming models, in: In IEEE Nuclear Science Symposium and Medical Imaging Conference Record, Rome, Italy, October 2004.
  • 20
    • 84876347047 scopus 로고    scopus 로고
    • Thilo Kielmann, Henri E. Bal, Kees Verstoep, Fast measurement of logp parameters for message passing platforms, in: IPDPS Workshops, 2000, pp. 1176-1183.
  • 21
    • 33947420584 scopus 로고    scopus 로고
    • Rick Kufrin, Perfsuite: an accessible, open source, performance analysis environment for linux, in: 6th International Conference on Linux Clusters (LCI- 2005), Chapel Hill, NC, April 2005.
  • 22
    • 12444290884 scopus 로고    scopus 로고
    • M. Kühnemann, T. Rauber, G. Rünger, A source code analyzer for performance prediction, in: Proceedings of the IPDPS-Workshop on Massively Parallel Processing (CDROM), IEEE, 2004.
  • 23
    • 33947410530 scopus 로고    scopus 로고
    • Piero Lanucara, Sergio Rovida, Conjugate-gradients algorithms: an mpiopenmp implementation on, in: First European Workshop on OpenMP, 1999, pp. 76-78.
  • 24
  • 25
    • 0033873170 scopus 로고    scopus 로고
    • Amitava Majumdar, Parallel performance study of monte carlo photon transport code on shared-, distributed-, and distributed-shared-memory architectures, in: IPDPS, 2000, p. 93.
  • 27
    • 0032137545 scopus 로고    scopus 로고
    • A compiler optimization algorithm for shared-memory multiprocessors
    • McKinley K.S. A compiler optimization algorithm for shared-memory multiprocessors. IEEE Transactions on Parallel and Distributed Systems 9 8 (1998) 769-787
    • (1998) IEEE Transactions on Parallel and Distributed Systems , vol.9 , Issue.8 , pp. 769-787
    • McKinley, K.S.1
  • 28
    • 33947428938 scopus 로고    scopus 로고
    • P. Mucci, K. London, The Mpbench Report, 1998.
  • 30
    • 0036734103 scopus 로고    scopus 로고
    • Effects of ordering strategies and programming paradigms on sparse matrix computations
    • Oliker L., Li X., Husbands P., and Biswas R. Effects of ordering strategies and programming paradigms on sparse matrix computations. SIAM Rev. 44 3 (2002) 373-393
    • (2002) SIAM Rev. , vol.44 , Issue.3 , pp. 373-393
    • Oliker, L.1    Li, X.2    Husbands, P.3    Biswas, R.4
  • 31
    • 33947416311 scopus 로고    scopus 로고
    • Open64. .
  • 32
    • 33947413241 scopus 로고    scopus 로고
    • OpenMP. .
  • 33
    • 33947356391 scopus 로고    scopus 로고
    • OpenUH. .
  • 34
    • 84957882532 scopus 로고    scopus 로고
    • Ralf Reussner, Peter Sanders, Lutz Prechelt, and Matthias Muller. Skampi: A detailed, accurate MPI benchmark. In PVM/MPI, 1998, pp. 52-59.
  • 35
    • 0036082072 scopus 로고    scopus 로고
    • Skampi: a comprehensive benchmark for public benchmarking of mpi
    • Reussner R., Sanders P., and Träff J.L. Skampi: a comprehensive benchmark for public benchmarking of mpi. Scientific Programming 10 1 (2002) 55-65
    • (2002) Scientific Programming , vol.10 , Issue.1 , pp. 55-65
    • Reussner, R.1    Sanders, P.2    Träff, J.L.3
  • 38
    • 33947376222 scopus 로고    scopus 로고
    • SPHINX. .
  • 39
    • 20444497314 scopus 로고    scopus 로고
    • Parallel-multigrid computation of unsteady incompressible viscous flows using a matrix-free implicit method and high-resolution characteristics-based scheme
    • Tai1 C.H., Zhao Y., and Liew K.M. Parallel-multigrid computation of unsteady incompressible viscous flows using a matrix-free implicit method and high-resolution characteristics-based scheme. Computer Methods in Applied Mechanics and Engineering 194 36-38 (2005) 3949-3983
    • (2005) Computer Methods in Applied Mechanics and Engineering , vol.194 , Issue.36-38 , pp. 3949-3983
    • Tai1, C.H.1    Zhao, Y.2    Liew, K.M.3
  • 40
    • 33947381549 scopus 로고    scopus 로고
    • M.B. van Gijzen. Two level parallelism in a stream-function model for global ocean circulation. Technical Report TR/PA/03/09, CERFACS, Toulouse, France, 2003.
  • 41
    • 33947368794 scopus 로고    scopus 로고
    • Huang W. and Tafti D.K.A parallel computing framework for dynamic power balancing in adaptive mesh refinement applications. In Parallel CFD99, Wiiliamsburg, VA, May 1999.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.