-
1
-
-
2442517698
-
Parallel program performance prediction using deterministic task graph analysis
-
V. S. Adve and M. K. Vernon. Parallel program performance prediction using deterministic task graph analysis. ACM Trans. Comput. Syst., 22(1):94-136, 2004.
-
(2004)
ACM Trans. Comput. Syst
, vol.22
, Issue.1
, pp. 94-136
-
-
Adve, V.S.1
Vernon, M.K.2
-
2
-
-
0029193089
-
Loggp: Incorporating long messages into the logp model: one step closer towards a realistic model for parallel computation
-
New York, NY, USA, ACM Press
-
A. Alexandrov, M. F. Ionescu, K. E. Schauser, and C. Scheiman. Loggp: incorporating long messages into the logp model: one step closer towards a realistic model for parallel computation. In SPAA '95: Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures, pages 95-105, New York, NY, USA, 1995. ACM Press.
-
(1995)
SPAA '95: Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures
, pp. 95-105
-
-
Alexandrov, A.1
Ionescu, M.F.2
Schauser, K.E.3
Scheiman, C.4
-
3
-
-
0347133254
-
Exploiting distributed-memory and shared-memory parallelism on clusters of smps with data parallel programs
-
S. Benkner and V. Sipková. Exploiting distributed-memory and shared-memory parallelism on clusters of smps with data parallel programs. International Journal of Parallel Programming, 31(1):3-19, 2003.
-
(2003)
International Journal of Parallel Programming
, vol.31
, Issue.1
, pp. 3-19
-
-
Benkner, S.1
Sipková, V.2
-
4
-
-
0035448025
-
Parallel programming with message passing and directives
-
2001
-
S. W. Bova, C. P. Breshears, H. Gabb, B. Kuhn, B. Magro, R. Eigenmann, G. Gaertner, S. Salvini, and H. Scott. Parallel programming with message passing and directives. Computing in Science and Engineering, 3(5):22-37, /2001.
-
Computing in Science and Engineering
, vol.3
, Issue.5
, pp. 22-37
-
-
Bova, S.W.1
Breshears, C.P.2
Gabb, H.3
Kuhn, B.4
Magro, B.5
Eigenmann, R.6
Gaertner, G.7
Salvini, S.8
Scott, H.9
-
5
-
-
34047232776
-
-
J. Bull. Measuring synchronisation and scheduling over-heads in openmp. In European Workshop on OpenMP (EWOMP1999), Lund, Sweden, 1999.
-
J. Bull. Measuring synchronisation and scheduling over-heads in openmp. In European Workshop on OpenMP (EWOMP1999), Lund, Sweden, 1999.
-
-
-
-
8
-
-
33645202282
-
Assessing performance of hybrid mpi/openmp programs on smp clusters
-
Technical Report UCRL-JC-143957, Lawrence Livermore National Laboratory, May
-
E. Chow and D. Hysom. Assessing performance of hybrid mpi/openmp programs on smp clusters. Technical Report UCRL-JC-143957, Lawrence Livermore National Laboratory, May 2001.
-
(2001)
-
-
Chow, E.1
Hysom, D.2
-
9
-
-
0009346826
-
Logp: Towards a realistic model of parallel computation
-
New York, NY, USA, ACM Press
-
D. Culler, R. Karp, D. Patterson, A. Sahay, K. E. Schauser, E. Santos, R. Subramonian, and T. von Eicken. Logp: towards a realistic model of parallel computation. In PPOPP '93: Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming, pages 1-12, New York, NY, USA, 1993. ACM Press.
-
(1993)
PPOPP '93: Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
, pp. 1-12
-
-
Culler, D.1
Karp, R.2
Patterson, D.3
Sahay, A.4
Schauser, K.E.5
Santos, E.6
Subramonian, R.7
von Eicken, T.8
-
12
-
-
0030721811
-
Can shared-memory model serve as a bridging model for parallel computation?
-
New York, NY, USA, ACM Press
-
P. B. Gibbons, Y. Matias, and V. Ramachandran. Can shared-memory model serve as a bridging model for parallel computation? In SPAA '97: Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures, pages 72-83, New York, NY, USA, 1997. ACM Press.
-
(1997)
SPAA '97: Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures
, pp. 72-83
-
-
Gibbons, P.B.1
Matias, Y.2
Ramachandran, V.3
-
15
-
-
0346882110
-
-
D. R. Helman and J. Jaacute;J. Prefix computations on symmetric multiprocessors. J. Parallel. Distrib. Comput., 61(2):265-278, 2001.
-
D. R. Helman and J. Jaacute;J. Prefix computations on symmetric multiprocessors. J. Parallel. Distrib. Comput., 61(2):265-278, 2001.
-
-
-
-
16
-
-
0003293945
-
Performance of hybrid message-passing and shared-memory parallelism for discrete element modeling
-
D. S. Henty. Performance of hybrid message-passing and shared-memory parallelism for discrete element modeling. In Supercomputing 2000, pages 50-50, 2000.
-
(2000)
Supercomputing 2000
, pp. 50-50
-
-
Henty, D.S.1
-
17
-
-
34047215377
-
Parallel osem reconstruction speed with mpi, openmp, and hybrid mpi-openmp programming models
-
Rome, Italy, October
-
M. D. Jones and R. Yao. Parallel osem reconstruction speed with mpi, openmp, and hybrid mpi-openmp programming models. In IEEE Nuclear Science Symposium and Medical Imaging Conference Record, Rome, Italy, October 2004.
-
(2004)
IEEE Nuclear Science Symposium and Medical Imaging Conference Record
-
-
Jones, M.D.1
Yao, R.2
-
18
-
-
84876347047
-
Fast measurement of logp parameters for message passing platforms
-
T. Kielmann, H. E. Bal, and K. Verstoep. Fast measurement of logp parameters for message passing platforms. In IPDPS Workshops, pages 1176-1183, 2000.
-
(2000)
IPDPS Workshops
, pp. 1176-1183
-
-
Kielmann, T.1
Bal, H.E.2
Verstoep, K.3
-
19
-
-
34548776288
-
Perfsuite: An accessible, open source, performance analysis environment for linux
-
Chapel Hill, NC, April
-
R. Kufrin. Perfsuite: An accessible, open source, performance analysis environment for linux. In 6th International Conference on Linux Clusters (LCI-2005), Chapel Hill, NC, April 2005.
-
(2005)
6th International Conference on Linux Clusters (LCI-2005)
-
-
Kufrin, R.1
-
21
-
-
0008458295
-
Conjugate-gradients algorithms: An mpi-openmp implementation on
-
P. Lanucara and S. Rovida. Conjugate-gradients algorithms: An mpi-openmp implementation on. In First European Workshop on OpenMP, pages 76-78, 1999.
-
(1999)
First European Workshop on OpenMP
, pp. 76-78
-
-
Lanucara, P.1
Rovida, S.2
-
23
-
-
0033873170
-
Parallel performance study of monte carlo photon transport code on shared-, distributed-, and distributed-shared-memory architectures
-
A. Majumdar. Parallel performance study of monte carlo photon transport code on shared-, distributed-, and distributed-shared-memory architectures. In IPDPS, pages 93-, 2000.
-
(2000)
IPDPS
, pp. 93
-
-
Majumdar, A.1
-
24
-
-
8344269521
-
Cross-architecture performance predictions for scientific applications using parameterized models
-
New York, NY, USA, ACM Press
-
G. Marin and J. Mellor-Crummey. Cross-architecture performance predictions for scientific applications using parameterized models. In SIGMETRICS 2004/PERFORMANCE 2004: Proceedings of the joint international conference on Measurement and modeling of computer systems, pages 2-13, New York, NY, USA, 2004. ACM Press.
-
(2004)
SIGMETRICS 2004/PERFORMANCE 2004: Proceedings of the joint international conference on Measurement and modeling of computer systems
, pp. 2-13
-
-
Marin, G.1
Mellor-Crummey, J.2
-
25
-
-
0032137545
-
A compiler optimization algorithm for shared-memory multiprocessors
-
K. S. McKinley. A compiler optimization algorithm for shared-memory multiprocessors. IEEE Trans. Parallel Distrib. Syst., 9(8):769-787, 1998.
-
(1998)
IEEE Trans. Parallel Distrib. Syst
, vol.9
, Issue.8
, pp. 769-787
-
-
McKinley, K.S.1
-
26
-
-
33845387737
-
Using dynamic tracing sampling to measure long running programs
-
Washington, DC, USA, IEEE Computer Society
-
J. Odom, J. K. Hollingsworth, L. DeRose, K. Ekanadham, and S. Sbaraglia. Using dynamic tracing sampling to measure long running programs. In SC '05: Proceedings of the 2005 ACM/IEEE conference on Supercomputing, page 59, Washington, DC, USA, 2005. IEEE Computer Society.
-
(2005)
SC '05: Proceedings of the 2005 ACM/IEEE conference on Supercomputing
, pp. 59
-
-
Odom, J.1
Hollingsworth, J.K.2
DeRose, L.3
Ekanadham, K.4
Sbaraglia, S.5
-
27
-
-
0036734103
-
Effects of ordering strategies and programming paradigms on sparse matrix computations
-
L. Oliker, X. Li, P. Husbands, and R. Biswas. Effects of ordering strategies and programming paradigms on sparse matrix computations. SIAM Rev., 44(3):373-393, 2002.
-
(2002)
SIAM Rev
, vol.44
, Issue.3
, pp. 373-393
-
-
Oliker, L.1
Li, X.2
Husbands, P.3
Biswas, R.4
-
28
-
-
34047212337
-
-
OpenMP
-
OpenMP. http://www.openmp.org.
-
-
-
-
29
-
-
34047223642
-
-
OpenUH
-
OpenUH. http://www.cs.uh.edu/õpenuh.
-
-
-
-
30
-
-
84957882532
-
Skampi: A detailed, accurate MPI benchmark
-
R. Reussner, P. Sanders, L. Prechelt, and M. Muller. Skampi: A detailed, accurate MPI benchmark. In PVM/MPI, pages 52-59, 1998.
-
(1998)
PVM/MPI
, pp. 52-59
-
-
Reussner, R.1
Sanders, P.2
Prechelt, L.3
Muller, M.4
-
31
-
-
12844275862
-
Locality phase prediction
-
New York, NY, USA, ACM Press
-
X. Shen, Y. Zhong, and C. Ding. Locality phase prediction. In ASPLOS-XI: Proceedings of the 11th international conference on Architectural support for programming languages and operating systems, pages 165-176, New York, NY, USA, 2004. ACM Press.
-
(2004)
ASPLOS-XI: Proceedings of the 11th international conference on Architectural support for programming languages and operating systems
, pp. 165-176
-
-
Shen, X.1
Zhong, Y.2
Ding, C.3
-
32
-
-
80053252314
-
A framework for performance modeling and prediction
-
Los Alamitos, CA, USA, IEEE Computer Society Press
-
A. Snavely, L. Carrington, N. Wolter, J. Labarta, R. Badia, and A. Purkayastha. A framework for performance modeling and prediction. In Supercomputing '02: Proceedings of the 2002 ACM/IEEE conference on Supercomputing, pages 1-17, Los Alamitos, CA, USA, 2002. IEEE Computer Society Press.
-
(2002)
Supercomputing '02: Proceedings of the 2002 ACM/IEEE conference on Supercomputing
, pp. 1-17
-
-
Snavely, A.1
Carrington, L.2
Wolter, N.3
Labarta, J.4
Badia, R.5
Purkayastha, A.6
-
33
-
-
34047213631
-
-
SPHINX
-
SPHINX, http://www.llnl.gov/casc/sphinx/sphinx.html.
-
-
-
-
34
-
-
20444497314
-
Parallel-multigrid computation of unsteady incompressible viscous flows using a matrix-free implicit method and high-resolution characteristics-based scheme
-
C. H. Tail, Y. Zhao, and K. M. Liew. Parallel-multigrid computation of unsteady incompressible viscous flows using a matrix-free implicit method and high-resolution characteristics-based scheme. Computer Methods in Applied Mechanics and Engineering, 194(36-38):3949-3983, 2005.
-
(2005)
Computer Methods in Applied Mechanics and Engineering
, vol.194
, Issue.36-38
, pp. 3949-3983
-
-
Tail, C.H.1
Zhao, Y.2
Liew, K.M.3
-
35
-
-
34047231600
-
-
M. B. van Gijzen. Two level parallelism in a stream-function model for global ocean circulation. Technical Report TR/-PA/03/09, CERFACS, Toulouse, France, 2003.
-
M. B. van Gijzen. Two level parallelism in a stream-function model for global ocean circulation. Technical Report TR/-PA/03/09, CERFACS, Toulouse, France, 2003.
-
-
-
-
36
-
-
34047235040
-
A parallel computing framework for dynamic power balancing in adaptive mesh refinement applications
-
May
-
H. W. and T. D. K. A parallel computing framework for dynamic power balancing in adaptive mesh refinement applications. In Parallel CFD99, Wiiliamsburg, VA, May 1999.
-
(1999)
Parallel CFD99, Wiiliamsburg, VA
-
-
W., H.1
K., T.D.2
-
37
-
-
35248816473
-
Eclipse - an open source platform for the next generation of development tools
-
London, UK, Springer-Verlag
-
A. Weinand. Eclipse - an open source platform for the next generation of development tools. InNODe '02: Revised Papers from the International Conference NetObjectDays on Objects, Components, Architectures, Services, and Applications for a Networked World, page 3, London, UK, 2003. Springer-Verlag.
-
(2003)
NODe '02: Revised Papers from the International Conference NetObjectDays on Objects, Components, Architectures, Services, and Applications for a Networked World
, pp. 3
-
-
Weinand, A.1
|