-
1
-
-
2442517698
-
Parallel program performance prediction using deterministic task graph analysis
-
Adve V.S., and Vernon M.K. Parallel program performance prediction using deterministic task graph analysis. ACM Transactions On Computer Systems 22 1 (2004) 94-136
-
(2004)
ACM Transactions On Computer Systems
, vol.22
, Issue.1
, pp. 94-136
-
-
Adve, V.S.1
Vernon, M.K.2
-
2
-
-
0029193089
-
Loggp: incorporating long messages into the logp model: one step closer towards a realistic model for parallel computation
-
ACM Press, New York, NY, USA
-
Alexandrov A., Ionescu M.F., Schauser K.E., and Scheiman C. Loggp: incorporating long messages into the logp model: one step closer towards a realistic model for parallel computation. SPAA '95: Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures (1995), ACM Press, New York, NY, USA 95-105
-
(1995)
SPAA '95: Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures
, pp. 95-105
-
-
Alexandrov, A.1
Ionescu, M.F.2
Schauser, K.E.3
Scheiman, C.4
-
3
-
-
0347133254
-
Exploiting distributed-memory and shared-memory parallelism on clusters of smps with data parallel programs
-
Benkner S., and Sipkov'a V. Exploiting distributed-memory and shared-memory parallelism on clusters of smps with data parallel programs. International Journal of Parallel Programming 31 1 (2003) 3-19
-
(2003)
International Journal of Parallel Programming
, vol.31
, Issue.1
, pp. 3-19
-
-
Benkner, S.1
Sipkov'a, V.2
-
4
-
-
0035448025
-
Parallel programming with message passing and directives
-
Bova S.W., Breshears C.P., Gabb H., Kuhn B., Magro B., Eigenmann R., Gaertner G., Salvini S., and Scott H. Parallel programming with message passing and directives. Computing in Science and Engineering 3 5 (2001) 22-37
-
(2001)
Computing in Science and Engineering
, vol.3
, Issue.5
, pp. 22-37
-
-
Bova, S.W.1
Breshears, C.P.2
Gabb, H.3
Kuhn, B.4
Magro, B.5
Eigenmann, R.6
Gaertner, G.7
Salvini, S.8
Scott, H.9
-
5
-
-
33947407825
-
-
J.M. Bull, Measuring synchronisation and scheduling overheads in openmp, in: In European Workshop on OpenMP (EWOMP1999), Lund, Sweden, 1999.
-
-
-
-
6
-
-
33947355118
-
-
I.J. Bush, C.J. Noble, R.J. Allan, Mixed openmp and mpi for parallel fortran applications, in: In European Workshop on OpenMP (EWOMP2000), Edinburgh, UK, 2000.
-
-
-
-
7
-
-
33947394170
-
-
F. Cappello, D. Etiemble, Mpi versus mpi + openmp on ibm sp for the nas benchmarks, in: In SC2000, Supercomputing 2000, November, Dallas, 2000.
-
-
-
-
8
-
-
33947426186
-
-
Edmond Chow, David Hysom. Assessing performance of hybrid mpi/openmp programs on smp clusters. Technical Report UCRL-JC-143957, Lawrence Livermore National Laboratory, May 2001.
-
-
-
-
9
-
-
0009346826
-
Logp: towards a realistic model of parallel computation
-
ACM Press, New York, NY, USA
-
Culler D., Karp R., Patterson D., Sahay A., Schauser K.E., Santos E., Subramonian R., and Eicken T.v. Logp: towards a realistic model of parallel computation. PPOPP '93: Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming (1993), ACM Press, New York, NY, USA 1-12
-
(1993)
PPOPP '93: Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
, pp. 1-12
-
-
Culler, D.1
Karp, R.2
Patterson, D.3
Sahay, A.4
Schauser, K.E.5
Santos, E.6
Subramonian, R.7
Eicken, T.v.8
-
10
-
-
12444315069
-
-
N. Drosinos, N. Koziris. Performance comparison of pure MPI vs hybrid MPI-OpenMP parallelization models on SMP clusters, in: Proceedings of the 18th International Parallel and Distributed Processing Symposium 2004 (IPDPS 2004), Santa Fe, New Mexico, April 2004, p. 15.
-
-
-
-
11
-
-
33947388965
-
-
Message Passing Interface Forum. .
-
-
-
-
12
-
-
0030721811
-
Can sharedmemory model serve as a bridging model for parallel computation?
-
ACM Press, New York, NY, USA
-
Gibbons P.B., Matias Y., and Ramachandran V. Can sharedmemory model serve as a bridging model for parallel computation?. SPAA '97: Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures (1997), ACM Press, New York, NY, USA 72-83
-
(1997)
SPAA '97: Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures
, pp. 72-83
-
-
Gibbons, P.B.1
Matias, Y.2
Ramachandran, V.3
-
13
-
-
33947417844
-
-
L. Giraud, Combining shared and distributed memory programming models on clusters of symmetric multiprocessors: some basic promising experiments. Working Note WN/PA/01/19, CERFACS, Toulouse, France, 2001.
-
-
-
-
14
-
-
33947433774
-
-
Pallas GmbH. Pallas mpi benchmarks - pmb. .
-
-
-
-
15
-
-
84958053214
-
Reproducible measurements of MPI performance characteristics
-
Recent Advances in Parallel Virtual Machine and Message Passing Interface. Dongarra J., Luque E., and Margalef T. (Eds), Springer Verlag 6th European PVM/MPI Users' Group Meeting, Barcelona, Spain, September 1999
-
Gropp W.D., and Lusk E. Reproducible measurements of MPI performance characteristics. In: Dongarra J., Luque E., and Margalef T. (Eds). Recent Advances in Parallel Virtual Machine and Message Passing Interface. Lecture Notes in Computer Science vol. 1697 (1999), Springer Verlag 11-18 6th European PVM/MPI Users' Group Meeting, Barcelona, Spain, September 1999
-
(1999)
Lecture Notes in Computer Science
, vol.1697
, pp. 11-18
-
-
Gropp, W.D.1
Lusk, E.2
-
17
-
-
33947382375
-
-
D.S. Henty, Performance of hybrid message-passing and shared-memory parallelism for discrete element modeling, in: Supercomputing 2000, 2000.
-
-
-
-
18
-
-
0028401457
-
The communication challenge for mpp: Intel paragon and meiko cs-2
-
Hockney R.W. The communication challenge for mpp: Intel paragon and meiko cs-2. Parallel Computation 20 3 (1994) 389-398
-
(1994)
Parallel Computation
, vol.20
, Issue.3
, pp. 389-398
-
-
Hockney, R.W.1
-
19
-
-
33947364128
-
-
M.D. Jones, R. Yao, Parallel osem reconstruction speed with mpi, openmp, and hybrid mpi-openmp programming models, in: In IEEE Nuclear Science Symposium and Medical Imaging Conference Record, Rome, Italy, October 2004.
-
-
-
-
20
-
-
84876347047
-
-
Thilo Kielmann, Henri E. Bal, Kees Verstoep, Fast measurement of logp parameters for message passing platforms, in: IPDPS Workshops, 2000, pp. 1176-1183.
-
-
-
-
21
-
-
33947420584
-
-
Rick Kufrin, Perfsuite: an accessible, open source, performance analysis environment for linux, in: 6th International Conference on Linux Clusters (LCI- 2005), Chapel Hill, NC, April 2005.
-
-
-
-
22
-
-
12444290884
-
-
M. Kühnemann, T. Rauber, G. Rünger, A source code analyzer for performance prediction, in: Proceedings of the IPDPS-Workshop on Massively Parallel Processing (CDROM), IEEE, 2004.
-
-
-
-
23
-
-
33947410530
-
-
Piero Lanucara, Sergio Rovida, Conjugate-gradients algorithms: an mpiopenmp implementation on, in: First European Workshop on OpenMP, 1999, pp. 76-78.
-
-
-
-
25
-
-
0033873170
-
-
Amitava Majumdar, Parallel performance study of monte carlo photon transport code on shared-, distributed-, and distributed-shared-memory architectures, in: IPDPS, 2000, p. 93.
-
-
-
-
26
-
-
8344269521
-
Cross-architecture performance predictions for scientific applications using parameterized models
-
ACM Press, New York, NY, USA
-
Marin G., and Mellor-Crummey J. Cross-architecture performance predictions for scientific applications using parameterized models. SIGMETRICS 2004/PERFORMANCE 2004: Proceedings of the joint international conference on Measurement and modeling of computer systems (2004), ACM Press, New York, NY, USA 2-13
-
(2004)
SIGMETRICS 2004/PERFORMANCE 2004: Proceedings of the joint international conference on Measurement and modeling of computer systems
, pp. 2-13
-
-
Marin, G.1
Mellor-Crummey, J.2
-
27
-
-
0032137545
-
A compiler optimization algorithm for shared-memory multiprocessors
-
McKinley K.S. A compiler optimization algorithm for shared-memory multiprocessors. IEEE Transactions on Parallel and Distributed Systems 9 8 (1998) 769-787
-
(1998)
IEEE Transactions on Parallel and Distributed Systems
, vol.9
, Issue.8
, pp. 769-787
-
-
McKinley, K.S.1
-
28
-
-
33947428938
-
-
P. Mucci, K. London, The Mpbench Report, 1998.
-
-
-
-
29
-
-
33845387737
-
Using dynamic tracing sampling to measure long running programs
-
IEEE Computer Society, Washington, DC, USA
-
Odom J., Hollingsworth J.K., DeRose L., Ekanadham K., and Sbaraglia S. Using dynamic tracing sampling to measure long running programs. SC '05: Proceedings of the 2005 ACM/IEEE conference on Supercomputing (2005), IEEE Computer Society, Washington, DC, USA 59
-
(2005)
SC '05: Proceedings of the 2005 ACM/IEEE conference on Supercomputing
, pp. 59
-
-
Odom, J.1
Hollingsworth, J.K.2
DeRose, L.3
Ekanadham, K.4
Sbaraglia, S.5
-
30
-
-
0036734103
-
Effects of ordering strategies and programming paradigms on sparse matrix computations
-
Oliker L., Li X., Husbands P., and Biswas R. Effects of ordering strategies and programming paradigms on sparse matrix computations. SIAM Rev. 44 3 (2002) 373-393
-
(2002)
SIAM Rev.
, vol.44
, Issue.3
, pp. 373-393
-
-
Oliker, L.1
Li, X.2
Husbands, P.3
Biswas, R.4
-
31
-
-
33947416311
-
-
Open64. .
-
-
-
-
32
-
-
33947413241
-
-
OpenMP. .
-
-
-
-
33
-
-
33947356391
-
-
OpenUH. .
-
-
-
-
34
-
-
84957882532
-
-
Ralf Reussner, Peter Sanders, Lutz Prechelt, and Matthias Muller. Skampi: A detailed, accurate MPI benchmark. In PVM/MPI, 1998, pp. 52-59.
-
-
-
-
35
-
-
0036082072
-
Skampi: a comprehensive benchmark for public benchmarking of mpi
-
Reussner R., Sanders P., and Träff J.L. Skampi: a comprehensive benchmark for public benchmarking of mpi. Scientific Programming 10 1 (2002) 55-65
-
(2002)
Scientific Programming
, vol.10
, Issue.1
, pp. 55-65
-
-
Reussner, R.1
Sanders, P.2
Träff, J.L.3
-
36
-
-
12844275862
-
Locality phase prediction
-
ACM Press, New York, NY, USA
-
Shen X., Zhong Y., and Ding C. Locality phase prediction. ASPLOS-XI: Proceedings of the 11th international conference on Architectural support for programming languages and operating systems (2004), ACM Press, New York, NY, USA 165-176
-
(2004)
ASPLOS-XI: Proceedings of the 11th international conference on Architectural support for programming languages and operating systems
, pp. 165-176
-
-
Shen, X.1
Zhong, Y.2
Ding, C.3
-
37
-
-
80053252314
-
A framework for performance modeling and prediction
-
IEE Computer Society Press, Los Alamitos, CA, USA
-
Snavely A., Carrington L., Wolter N., Labarta J., Badia R., and Purkayastha A. A framework for performance modeling and prediction. Supercomputing'02: Proceedings of the 2002 ACM/IEEE conference on Supercomputing (2002), IEE Computer Society Press, Los Alamitos, CA, USA 1-17
-
(2002)
Supercomputing'02: Proceedings of the 2002 ACM/IEEE conference on Supercomputing
, pp. 1-17
-
-
Snavely, A.1
Carrington, L.2
Wolter, N.3
Labarta, J.4
Badia, R.5
Purkayastha, A.6
-
38
-
-
33947376222
-
-
SPHINX. .
-
-
-
-
39
-
-
20444497314
-
Parallel-multigrid computation of unsteady incompressible viscous flows using a matrix-free implicit method and high-resolution characteristics-based scheme
-
Tai1 C.H., Zhao Y., and Liew K.M. Parallel-multigrid computation of unsteady incompressible viscous flows using a matrix-free implicit method and high-resolution characteristics-based scheme. Computer Methods in Applied Mechanics and Engineering 194 36-38 (2005) 3949-3983
-
(2005)
Computer Methods in Applied Mechanics and Engineering
, vol.194
, Issue.36-38
, pp. 3949-3983
-
-
Tai1, C.H.1
Zhao, Y.2
Liew, K.M.3
-
40
-
-
33947381549
-
-
M.B. van Gijzen. Two level parallelism in a stream-function model for global ocean circulation. Technical Report TR/PA/03/09, CERFACS, Toulouse, France, 2003.
-
-
-
-
41
-
-
33947368794
-
-
Huang W. and Tafti D.K.A parallel computing framework for dynamic power balancing in adaptive mesh refinement applications. In Parallel CFD99, Wiiliamsburg, VA, May 1999.
-
-
-
-
42
-
-
35248816473
-
Eclipse - an open source platform for the next generation of development tools
-
Springer-Verlag, London, UK
-
Weinand A. Eclipse - an open source platform for the next generation of development tools. NODe '02: Revised Papers from the International Conference NetObjectDays on Objects, Components, Architectures, Services, and Applications for a Networked World (2003), Springer-Verlag, London, UK 3
-
(2003)
NODe '02: Revised Papers from the International Conference NetObjectDays on Objects, Components, Architectures, Services, and Applications for a Networked World
, pp. 3
-
-
Weinand, A.1
|