SCOPUS 정보 검색 플랫폼

Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS

Volumn 2, Issue , 2006, Pages 3-8

Performance modeling of communication and computation in hybrid MPI and OpenMP applications

(2) Adhianto, Laksono a Chapman, Barbara a

a University of Houston (United States)

Author keywords

[No Author keywords available]

Indexed keywords

PERFORMANCE EVALUATION; RUNTIME SYSTEM;

COMPUTATIONAL METHODS; COMPUTER SIMULATION; OPTIMIZATION; PARAMETER ESTIMATION;

PARALLEL PROGRAMMING;

EID: 34047216159 PISSN: 15219097 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICPADS.2006.81 Document Type: Conference Paper

Times cited : (15)

References (38)

1
- 2442517698
- Parallel program performance prediction using deterministic task graph analysis
- V. S. Adve and M. K. Vernon. Parallel program performance prediction using deterministic task graph analysis. ACM Trans. Comput. Syst., 22(1):94-136, 2004.
- (2004) ACM Trans. Comput. Syst , vol.22 , Issue.1 , pp. 94-136
- Adve, V.S.¹ Vernon, M.K.²

2
- 0029193089
- Loggp: Incorporating long messages into the logp model: one step closer towards a realistic model for parallel computation
- New York, NY, USA, ACM Press
- A. Alexandrov, M. F. Ionescu, K. E. Schauser, and C. Scheiman. Loggp: incorporating long messages into the logp model: one step closer towards a realistic model for parallel computation. In SPAA '95: Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures, pages 95-105, New York, NY, USA, 1995. ACM Press.
- (1995) SPAA '95: Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures , pp. 95-105
- Alexandrov, A.¹ Ionescu, M.F.² Schauser, K.E.³ Scheiman, C.⁴

3
- 0347133254
- Exploiting distributed-memory and shared-memory parallelism on clusters of smps with data parallel programs
- S. Benkner and V. Sipková. Exploiting distributed-memory and shared-memory parallelism on clusters of smps with data parallel programs. International Journal of Parallel Programming, 31(1):3-19, 2003.
- (2003) International Journal of Parallel Programming , vol.31 , Issue.1 , pp. 3-19
- Benkner, S.¹ Sipková, V.²

4
- 0035448025
- Parallel programming with message passing and directives
- 2001
- S. W. Bova, C. P. Breshears, H. Gabb, B. Kuhn, B. Magro, R. Eigenmann, G. Gaertner, S. Salvini, and H. Scott. Parallel programming with message passing and directives. Computing in Science and Engineering, 3(5):22-37, /2001.
- Computing in Science and Engineering , vol.3 , Issue.5 , pp. 22-37
- Bova, S.W.¹ Breshears, C.P.² Gabb, H.³ Kuhn, B.⁴ Magro, B.⁵ Eigenmann, R.⁶ Gaertner, G.⁷ Salvini, S.⁸ Scott, H.⁹

5
- 34047232776
- J. Bull. Measuring synchronisation and scheduling over-heads in openmp. In European Workshop on OpenMP (EWOMP1999), Lund, Sweden, 1999.
- J. Bull. Measuring synchronisation and scheduling over-heads in openmp. In European Workshop on OpenMP (EWOMP1999), Lund, Sweden, 1999.

6
- 34047219262
- Mixed openmp and mpi for parallel fortran applications
- Edinburgh, UK
- I. J. Bush, C. J. Noble, and R. J. Allan. Mixed openmp and mpi for parallel fortran applications. In European Workshop on OpenMP (EWOMP2000), Edinburgh, UK, 2000.
- (2000) European Workshop on OpenMP (EWOMP2000)
- Bush, I.J.¹ Noble, C.J.² Allan, R.J.³

7
- 85054165140
- Mpi versus mpi+openmp on ibm sp for the nas benchmarks
- F. Cappello and D. Etiemble. Mpi versus mpi+openmp on ibm sp for the nas benchmarks. In SC2000, Supercomputing 2000, November, Dallas, 2000.
- (2000) SC2000, Supercomputing 2000, November, Dallas
- Cappello, F.¹ Etiemble, D.²

8
- 33645202282
- Assessing performance of hybrid mpi/openmp programs on smp clusters
- Technical Report UCRL-JC-143957, Lawrence Livermore National Laboratory, May
- E. Chow and D. Hysom. Assessing performance of hybrid mpi/openmp programs on smp clusters. Technical Report UCRL-JC-143957, Lawrence Livermore National Laboratory, May 2001.
- (2001)
- Chow, E.¹ Hysom, D.²

9
- 0009346826
- Logp: Towards a realistic model of parallel computation
- New York, NY, USA, ACM Press
- D. Culler, R. Karp, D. Patterson, A. Sahay, K. E. Schauser, E. Santos, R. Subramonian, and T. von Eicken. Logp: towards a realistic model of parallel computation. In PPOPP '93: Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming, pages 1-12, New York, NY, USA, 1993. ACM Press.
- (1993) PPOPP '93: Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming , pp. 1-12
- Culler, D.¹ Karp, R.² Patterson, D.³ Sahay, A.⁴ Schauser, K.E.⁵ Santos, E.⁶ Subramonian, R.⁷ von Eicken, T.⁸

10
- 12444315069
- Performance Comparison of Pure MPI vs Hybrid MPI-OpenMP Parallelization Models on SMP Clusters
- Santa Fe, New Mexico, Apr
- N. Drosinos and N. Koziris. Performance Comparison of Pure MPI vs Hybrid MPI-OpenMP Parallelization Models on SMP Clusters. In Proceedings of the 18th International Parallel and Distributed Processing Symposium 2004 (IPDPS 2004), page 15, Santa Fe, New Mexico, Apr. 2004.
- (2004) Proceedings of the 18th International Parallel and Distributed Processing Symposium 2004 (IPDPS 2004) , pp. 15
- Drosinos, N.¹ Koziris, N.²

11
- 34047200450
- M. P. I. Forum, http://www.mpi-forum.org.
- Forum, M.P.I.¹

12
- 0030721811
- Can shared-memory model serve as a bridging model for parallel computation?
- New York, NY, USA, ACM Press
- P. B. Gibbons, Y. Matias, and V. Ramachandran. Can shared-memory model serve as a bridging model for parallel computation? In SPAA '97: Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures, pages 72-83, New York, NY, USA, 1997. ACM Press.
- (1997) SPAA '97: Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures , pp. 72-83
- Gibbons, P.B.¹ Matias, Y.² Ramachandran, V.³

13
- 34047203084
- Working Note WN/-PA/01/19, CERFACS, Toulouse, France
- L. Giraud. Combining shared and distributed memory programming models on clusters of symmetric multiprocessors: Some basic promising experiments. Working Note WN/-PA/01/19, CERFACS, Toulouse, France, 2001.
- (2001) Combining shared and distributed memory programming models on clusters of symmetric multiprocessors: Some basic promising experiments
- Giraud, L.¹

14
- 12444295451
- P. GmbH. Pallas mpi benchmarks - pmb, http://www.pallas.de/pages/pmbd. htm.
- Pallas mpi benchmarks - pmb
- GmbH, P.¹

15
- 0346882110
- D. R. Helman and J. Jaacute;J. Prefix computations on symmetric multiprocessors. J. Parallel. Distrib. Comput., 61(2):265-278, 2001.
- D. R. Helman and J. Jaacute;J. Prefix computations on symmetric multiprocessors. J. Parallel. Distrib. Comput., 61(2):265-278, 2001.

16
- 0003293945
- Performance of hybrid message-passing and shared-memory parallelism for discrete element modeling
- D. S. Henty. Performance of hybrid message-passing and shared-memory parallelism for discrete element modeling. In Supercomputing 2000, pages 50-50, 2000.
- (2000) Supercomputing 2000 , pp. 50-50
- Henty, D.S.¹

17
- 34047215377
- Parallel osem reconstruction speed with mpi, openmp, and hybrid mpi-openmp programming models
- Rome, Italy, October
- M. D. Jones and R. Yao. Parallel osem reconstruction speed with mpi, openmp, and hybrid mpi-openmp programming models. In IEEE Nuclear Science Symposium and Medical Imaging Conference Record, Rome, Italy, October 2004.
- (2004) IEEE Nuclear Science Symposium and Medical Imaging Conference Record
- Jones, M.D.¹ Yao, R.²

18
- 84876347047
- Fast measurement of logp parameters for message passing platforms
- T. Kielmann, H. E. Bal, and K. Verstoep. Fast measurement of logp parameters for message passing platforms. In IPDPS Workshops, pages 1176-1183, 2000.
- (2000) IPDPS Workshops , pp. 1176-1183
- Kielmann, T.¹ Bal, H.E.² Verstoep, K.³

19
- 34548776288
- Perfsuite: An accessible, open source, performance analysis environment for linux
- Chapel Hill, NC, April
- R. Kufrin. Perfsuite: An accessible, open source, performance analysis environment for linux. In 6th International Conference on Linux Clusters (LCI-2005), Chapel Hill, NC, April 2005.
- (2005) 6th International Conference on Linux Clusters (LCI-2005)
- Kufrin, R.¹

20
- 12444290884
- A Source Code Analyzer for Performance Prediction
- IEEE
- M. Kühnemann, T. Rauber, and G. Rũnger. A Source Code Analyzer for Performance Prediction. In Proc. of the JPDPS-Workshop on Massively Parallel Processing (CDROM). IEEE, 2004.
- (2004) Proc. of the JPDPS-Workshop on Massively Parallel Processing (CDROM)
- Kühnemann, M.¹ Rauber, T.² Rũnger, G.³

21
- 0008458295
- Conjugate-gradients algorithms: An mpi-openmp implementation on
- P. Lanucara and S. Rovida. Conjugate-gradients algorithms: An mpi-openmp implementation on. In First European Workshop on OpenMP, pages 76-78, 1999.
- (1999) First European Workshop on OpenMP , pp. 76-78
- Lanucara, P.¹ Rovida, S.²

22
- 1642555145
- A Hybrid MPI-OpenMP Implementation of an Implicit Finite-Element Code on Parallel Architectures
- G. Mahinthakumar and F. Saied. A Hybrid MPI-OpenMP Implementation of an Implicit Finite-Element Code on Parallel Architectures. International Journal of High Performance Computing Applications, 16(4):371-393, 2002.
- (2002) International Journal of High Performance Computing Applications , vol.16 , Issue.4 , pp. 371-393
- Mahinthakumar, G.¹ Saied, F.²

23
- 0033873170
- Parallel performance study of monte carlo photon transport code on shared-, distributed-, and distributed-shared-memory architectures
- A. Majumdar. Parallel performance study of monte carlo photon transport code on shared-, distributed-, and distributed-shared-memory architectures. In IPDPS, pages 93-, 2000.
- (2000) IPDPS , pp. 93
- Majumdar, A.¹

24
- 8344269521
- Cross-architecture performance predictions for scientific applications using parameterized models
- New York, NY, USA, ACM Press
- G. Marin and J. Mellor-Crummey. Cross-architecture performance predictions for scientific applications using parameterized models. In SIGMETRICS 2004/PERFORMANCE 2004: Proceedings of the joint international conference on Measurement and modeling of computer systems, pages 2-13, New York, NY, USA, 2004. ACM Press.
- (2004) SIGMETRICS 2004/PERFORMANCE 2004: Proceedings of the joint international conference on Measurement and modeling of computer systems , pp. 2-13
- Marin, G.¹ Mellor-Crummey, J.²

25
- 0032137545
- A compiler optimization algorithm for shared-memory multiprocessors
- K. S. McKinley. A compiler optimization algorithm for shared-memory multiprocessors. IEEE Trans. Parallel Distrib. Syst., 9(8):769-787, 1998.
- (1998) IEEE Trans. Parallel Distrib. Syst , vol.9 , Issue.8 , pp. 769-787
- McKinley, K.S.¹

26
- 33845387737
- Using dynamic tracing sampling to measure long running programs
- Washington, DC, USA, IEEE Computer Society
- J. Odom, J. K. Hollingsworth, L. DeRose, K. Ekanadham, and S. Sbaraglia. Using dynamic tracing sampling to measure long running programs. In SC '05: Proceedings of the 2005 ACM/IEEE conference on Supercomputing, page 59, Washington, DC, USA, 2005. IEEE Computer Society.
- (2005) SC '05: Proceedings of the 2005 ACM/IEEE conference on Supercomputing , pp. 59
- Odom, J.¹ Hollingsworth, J.K.² DeRose, L.³ Ekanadham, K.⁴ Sbaraglia, S.⁵

27
- 0036734103
- Effects of ordering strategies and programming paradigms on sparse matrix computations
- L. Oliker, X. Li, P. Husbands, and R. Biswas. Effects of ordering strategies and programming paradigms on sparse matrix computations. SIAM Rev., 44(3):373-393, 2002.
- (2002) SIAM Rev , vol.44 , Issue.3 , pp. 373-393
- Oliker, L.¹ Li, X.² Husbands, P.³ Biswas, R.⁴

28
- 34047212337
- OpenMP
- OpenMP. http://www.openmp.org.

29
- 34047223642
- OpenUH
- OpenUH. http://www.cs.uh.edu/õpenuh.

30
- 84957882532
- Skampi: A detailed, accurate MPI benchmark
- R. Reussner, P. Sanders, L. Prechelt, and M. Muller. Skampi: A detailed, accurate MPI benchmark. In PVM/MPI, pages 52-59, 1998.
- (1998) PVM/MPI , pp. 52-59
- Reussner, R.¹ Sanders, P.² Prechelt, L.³ Muller, M.⁴

31
- 12844275862
- Locality phase prediction
- New York, NY, USA, ACM Press
- X. Shen, Y. Zhong, and C. Ding. Locality phase prediction. In ASPLOS-XI: Proceedings of the 11th international conference on Architectural support for programming languages and operating systems, pages 165-176, New York, NY, USA, 2004. ACM Press.
- (2004) ASPLOS-XI: Proceedings of the 11th international conference on Architectural support for programming languages and operating systems , pp. 165-176
- Shen, X.¹ Zhong, Y.² Ding, C.³

32
- 80053252314
- A framework for performance modeling and prediction
- Los Alamitos, CA, USA, IEEE Computer Society Press
- A. Snavely, L. Carrington, N. Wolter, J. Labarta, R. Badia, and A. Purkayastha. A framework for performance modeling and prediction. In Supercomputing '02: Proceedings of the 2002 ACM/IEEE conference on Supercomputing, pages 1-17, Los Alamitos, CA, USA, 2002. IEEE Computer Society Press.
- (2002) Supercomputing '02: Proceedings of the 2002 ACM/IEEE conference on Supercomputing , pp. 1-17
- Snavely, A.¹ Carrington, L.² Wolter, N.³ Labarta, J.⁴ Badia, R.⁵ Purkayastha, A.⁶

33
- 34047213631
- SPHINX
- SPHINX, http://www.llnl.gov/casc/sphinx/sphinx.html.

34
- 20444497314
- Parallel-multigrid computation of unsteady incompressible viscous flows using a matrix-free implicit method and high-resolution characteristics-based scheme
- C. H. Tail, Y. Zhao, and K. M. Liew. Parallel-multigrid computation of unsteady incompressible viscous flows using a matrix-free implicit method and high-resolution characteristics-based scheme. Computer Methods in Applied Mechanics and Engineering, 194(36-38):3949-3983, 2005.
- (2005) Computer Methods in Applied Mechanics and Engineering , vol.194 , Issue.36-38 , pp. 3949-3983
- Tail, C.H.¹ Zhao, Y.² Liew, K.M.³

35
- 34047231600
- M. B. van Gijzen. Two level parallelism in a stream-function model for global ocean circulation. Technical Report TR/-PA/03/09, CERFACS, Toulouse, France, 2003.
- M. B. van Gijzen. Two level parallelism in a stream-function model for global ocean circulation. Technical Report TR/-PA/03/09, CERFACS, Toulouse, France, 2003.

36
- 34047235040
- A parallel computing framework for dynamic power balancing in adaptive mesh refinement applications
- May
- H. W. and T. D. K. A parallel computing framework for dynamic power balancing in adaptive mesh refinement applications. In Parallel CFD99, Wiiliamsburg, VA, May 1999.
- (1999) Parallel CFD99, Wiiliamsburg, VA
- W., H.¹ K., T.D.²

37
- 35248816473
- Eclipse - an open source platform for the next generation of development tools
- London, UK, Springer-Verlag
- A. Weinand. Eclipse - an open source platform for the next generation of development tools. InNODe '02: Revised Papers from the International Conference NetObjectDays on Objects, Components, Architectures, Services, and Applications for a Networked World, page 3, London, UK, 2003. Springer-Verlag.
- (2003) NODe '02: Revised Papers from the International Conference NetObjectDays on Objects, Components, Architectures, Services, and Applications for a Networked World , pp. 3
- Weinand, A.¹

38
- 0030379246
- Combining loop transformations considering caches and scheduling
- IEEE Computer Society
- M. E. Wolf, D. E. Maydan, and D.-K. Chen. Combining loop transformations considering caches and scheduling. In Proceedings of the 29th annual ACM/IEEE international symposium on Microarchitecture, pages 274-286. IEEE Computer Society, 1996.
- (1996) Proceedings of the 29th annual ACM/IEEE international symposium on Microarchitecture , pp. 274-286
- Wolf, M.E.¹ Maydan, D.E.² Chen, D.-K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.