메뉴 건너뛰기




Volumn 2, Issue , 2006, Pages 3-8

Performance modeling of communication and computation in hybrid MPI and OpenMP applications

Author keywords

[No Author keywords available]

Indexed keywords

PERFORMANCE EVALUATION; RUNTIME SYSTEM;

EID: 34047216159     PISSN: 15219097     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICPADS.2006.81     Document Type: Conference Paper
Times cited : (15)

References (38)
  • 1
    • 2442517698 scopus 로고    scopus 로고
    • Parallel program performance prediction using deterministic task graph analysis
    • V. S. Adve and M. K. Vernon. Parallel program performance prediction using deterministic task graph analysis. ACM Trans. Comput. Syst., 22(1):94-136, 2004.
    • (2004) ACM Trans. Comput. Syst , vol.22 , Issue.1 , pp. 94-136
    • Adve, V.S.1    Vernon, M.K.2
  • 3
    • 0347133254 scopus 로고    scopus 로고
    • Exploiting distributed-memory and shared-memory parallelism on clusters of smps with data parallel programs
    • S. Benkner and V. Sipková. Exploiting distributed-memory and shared-memory parallelism on clusters of smps with data parallel programs. International Journal of Parallel Programming, 31(1):3-19, 2003.
    • (2003) International Journal of Parallel Programming , vol.31 , Issue.1 , pp. 3-19
    • Benkner, S.1    Sipková, V.2
  • 5
    • 34047232776 scopus 로고    scopus 로고
    • J. Bull. Measuring synchronisation and scheduling over-heads in openmp. In European Workshop on OpenMP (EWOMP1999), Lund, Sweden, 1999.
    • J. Bull. Measuring synchronisation and scheduling over-heads in openmp. In European Workshop on OpenMP (EWOMP1999), Lund, Sweden, 1999.
  • 8
    • 33645202282 scopus 로고    scopus 로고
    • Assessing performance of hybrid mpi/openmp programs on smp clusters
    • Technical Report UCRL-JC-143957, Lawrence Livermore National Laboratory, May
    • E. Chow and D. Hysom. Assessing performance of hybrid mpi/openmp programs on smp clusters. Technical Report UCRL-JC-143957, Lawrence Livermore National Laboratory, May 2001.
    • (2001)
    • Chow, E.1    Hysom, D.2
  • 15
    • 0346882110 scopus 로고    scopus 로고
    • D. R. Helman and J. Jaacute;J. Prefix computations on symmetric multiprocessors. J. Parallel. Distrib. Comput., 61(2):265-278, 2001.
    • D. R. Helman and J. Jaacute;J. Prefix computations on symmetric multiprocessors. J. Parallel. Distrib. Comput., 61(2):265-278, 2001.
  • 16
    • 0003293945 scopus 로고    scopus 로고
    • Performance of hybrid message-passing and shared-memory parallelism for discrete element modeling
    • D. S. Henty. Performance of hybrid message-passing and shared-memory parallelism for discrete element modeling. In Supercomputing 2000, pages 50-50, 2000.
    • (2000) Supercomputing 2000 , pp. 50-50
    • Henty, D.S.1
  • 17
    • 34047215377 scopus 로고    scopus 로고
    • Parallel osem reconstruction speed with mpi, openmp, and hybrid mpi-openmp programming models
    • Rome, Italy, October
    • M. D. Jones and R. Yao. Parallel osem reconstruction speed with mpi, openmp, and hybrid mpi-openmp programming models. In IEEE Nuclear Science Symposium and Medical Imaging Conference Record, Rome, Italy, October 2004.
    • (2004) IEEE Nuclear Science Symposium and Medical Imaging Conference Record
    • Jones, M.D.1    Yao, R.2
  • 18
    • 84876347047 scopus 로고    scopus 로고
    • Fast measurement of logp parameters for message passing platforms
    • T. Kielmann, H. E. Bal, and K. Verstoep. Fast measurement of logp parameters for message passing platforms. In IPDPS Workshops, pages 1176-1183, 2000.
    • (2000) IPDPS Workshops , pp. 1176-1183
    • Kielmann, T.1    Bal, H.E.2    Verstoep, K.3
  • 19
    • 34548776288 scopus 로고    scopus 로고
    • Perfsuite: An accessible, open source, performance analysis environment for linux
    • Chapel Hill, NC, April
    • R. Kufrin. Perfsuite: An accessible, open source, performance analysis environment for linux. In 6th International Conference on Linux Clusters (LCI-2005), Chapel Hill, NC, April 2005.
    • (2005) 6th International Conference on Linux Clusters (LCI-2005)
    • Kufrin, R.1
  • 21
    • 0008458295 scopus 로고    scopus 로고
    • Conjugate-gradients algorithms: An mpi-openmp implementation on
    • P. Lanucara and S. Rovida. Conjugate-gradients algorithms: An mpi-openmp implementation on. In First European Workshop on OpenMP, pages 76-78, 1999.
    • (1999) First European Workshop on OpenMP , pp. 76-78
    • Lanucara, P.1    Rovida, S.2
  • 22
  • 23
    • 0033873170 scopus 로고    scopus 로고
    • Parallel performance study of monte carlo photon transport code on shared-, distributed-, and distributed-shared-memory architectures
    • A. Majumdar. Parallel performance study of monte carlo photon transport code on shared-, distributed-, and distributed-shared-memory architectures. In IPDPS, pages 93-, 2000.
    • (2000) IPDPS , pp. 93
    • Majumdar, A.1
  • 25
    • 0032137545 scopus 로고    scopus 로고
    • A compiler optimization algorithm for shared-memory multiprocessors
    • K. S. McKinley. A compiler optimization algorithm for shared-memory multiprocessors. IEEE Trans. Parallel Distrib. Syst., 9(8):769-787, 1998.
    • (1998) IEEE Trans. Parallel Distrib. Syst , vol.9 , Issue.8 , pp. 769-787
    • McKinley, K.S.1
  • 27
    • 0036734103 scopus 로고    scopus 로고
    • Effects of ordering strategies and programming paradigms on sparse matrix computations
    • L. Oliker, X. Li, P. Husbands, and R. Biswas. Effects of ordering strategies and programming paradigms on sparse matrix computations. SIAM Rev., 44(3):373-393, 2002.
    • (2002) SIAM Rev , vol.44 , Issue.3 , pp. 373-393
    • Oliker, L.1    Li, X.2    Husbands, P.3    Biswas, R.4
  • 28
    • 34047212337 scopus 로고    scopus 로고
    • OpenMP
    • OpenMP. http://www.openmp.org.
  • 29
    • 34047223642 scopus 로고    scopus 로고
    • OpenUH
    • OpenUH. http://www.cs.uh.edu/õpenuh.
  • 33
    • 34047213631 scopus 로고    scopus 로고
    • SPHINX
    • SPHINX, http://www.llnl.gov/casc/sphinx/sphinx.html.
  • 34
    • 20444497314 scopus 로고    scopus 로고
    • Parallel-multigrid computation of unsteady incompressible viscous flows using a matrix-free implicit method and high-resolution characteristics-based scheme
    • C. H. Tail, Y. Zhao, and K. M. Liew. Parallel-multigrid computation of unsteady incompressible viscous flows using a matrix-free implicit method and high-resolution characteristics-based scheme. Computer Methods in Applied Mechanics and Engineering, 194(36-38):3949-3983, 2005.
    • (2005) Computer Methods in Applied Mechanics and Engineering , vol.194 , Issue.36-38 , pp. 3949-3983
    • Tail, C.H.1    Zhao, Y.2    Liew, K.M.3
  • 35
    • 34047231600 scopus 로고    scopus 로고
    • M. B. van Gijzen. Two level parallelism in a stream-function model for global ocean circulation. Technical Report TR/-PA/03/09, CERFACS, Toulouse, France, 2003.
    • M. B. van Gijzen. Two level parallelism in a stream-function model for global ocean circulation. Technical Report TR/-PA/03/09, CERFACS, Toulouse, France, 2003.
  • 36
    • 34047235040 scopus 로고    scopus 로고
    • A parallel computing framework for dynamic power balancing in adaptive mesh refinement applications
    • May
    • H. W. and T. D. K. A parallel computing framework for dynamic power balancing in adaptive mesh refinement applications. In Parallel CFD99, Wiiliamsburg, VA, May 1999.
    • (1999) Parallel CFD99, Wiiliamsburg, VA
    • W., H.1    K., T.D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.