-
2
-
-
0028734038
-
Building a high-performance collective communication library
-
M. Barnett, S. Gupta, D.G. Payne, L. Schuler, R. van de Geijn, and J. Watts, "Building a High-Performance Collective Communication Library," Proc. Conf. Supercomputing '94, pp. 107-116, 1994.
-
(1994)
Proc. Conf. Supercomputing '94
, pp. 107-116
-
-
Barnett, M.1
Gupta, S.2
Payne, D.G.3
Schuler, L.4
Van De Geijn, R.5
Watts, J.6
-
3
-
-
20444467676
-
On optimizing collective communication
-
E.W. Chan, M.F. Heimlich, A. Purkayastha, and R.A. van de Geijn, "On Optimizing Collective Communication," Proc. IEEE Int'l Conf. Cluster Computing (CLUSTER), 2004.
-
(2004)
Proc. IEEE Int'l Conf. Cluster Computing (CLUSTER)
-
-
Chan, E.W.1
Heimlich, M.F.2
Purkayastha, A.3
Van De Geijn, R.A.4
-
4
-
-
0030287932
-
A practical model of parallel computation
-
D.E. Culler, R.M. Karp, D. Patterson, A. Sahay, E.E. Santos, K.E. Schauser, R. Subramonian, and T. von Eicken, "LogP: A Practical Model of Parallel Computation," Comm. ACM, vol.39, no.11, pp. 78-85, 1996. (Pubitemid 126428083)
-
(1996)
Communications of the ACM
, vol.39
, Issue.11
, pp. 78-85
-
-
Culler, D.E.1
Karp, R.M.2
Patterson, D.3
Sahay, A.4
Santos, E.E.5
Schauser, K.E.6
Subramonian, R.7
Von Eicken, T.8
-
5
-
-
37549003336
-
MapReduce: Simplified data processing on large clusters
-
J. Dean and S. Ghemawat, "MapReduce: Simplified Data Processing on Large Clusters," Comm. ACM, vol.51, no.1, pp. 107-113, 2008.
-
(2008)
Comm. ACM
, vol.51
, Issue.1
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
7
-
-
70349746305
-
PRO: A model for the design and analysis of efficient and scalable parallel algorithms
-
A.H. Gebremedhin, M. Essaïdi, I.G. Lassous, J. Gustedt, and J.A. Telle, "PRO: A Model for the Design and Analysis of Efficient and Scalable Parallel Algorithms," Nordic J. Computing, vol.13, pp. 215- 239, 2006.
-
(2006)
Nordic J. Computing
, vol.13
, pp. 215-239
-
-
Gebremedhin, A.H.1
Essaïdi, M.2
Lassous, I.G.3
Gustedt, J.4
Telle, J.A.5
-
8
-
-
84957015019
-
Validation of dimemas communication model for mpi collective operations
-
S. Girona, J. Labarta, and R.M. Badia, "Validation of Dimemas Communication Model for MPI Collective Operations," Proc. Recent Advances in Parallel Virtual Machine and Message Passing Interface: Seventh European PVM/MPI Users' Group Meeting, pp. 39- 46, 2000.
-
(2000)
Proc. Recent Advances in Parallel Virtual Machine and Message Passing Interface: Seventh European PVM/MPI Users' Group Meeting
, pp. 39-46
-
-
Girona, S.1
Labarta, J.2
Badia, R.M.3
-
9
-
-
0001064241
-
Toward formally-based design of message passing programs
-
Mar.
-
S. Gorlatch, "Toward Formally-Based Design of Message Passing Programs," IEEE Trans. Software Eng., vol.26, no.3, pp. 276-288, Mar. 2000.
-
(2000)
IEEE Trans. Software Eng.
, vol.26
, Issue.3
, pp. 276-288
-
-
Gorlatch, S.1
-
10
-
-
1242332596
-
Send-Receive considered harmful: Myths and realities of message passing
-
S. Gorlatch, "Send-Receive Considered Harmful: Myths and Realities of Message Passing," ACM Trans. Programming Languages and Systems, vol.26, no.1, pp. 47-56, 2004.
-
(2004)
ACM Trans. Programming Languages and Systems
, vol.26
, Issue.1
, pp. 47-56
-
-
Gorlatch, S.1
-
11
-
-
0003710739
-
-
MIT Press
-
W. Gropp, S. Huss-Lederman, A. Lumsdaine, E. Lusk, B. Nitzberg, W. Saphir, and M. Snir, MPI-The Complete Reference, vol.2. MIT Press, 1998.
-
(1998)
MPI-The Complete Reference
, vol.2
-
-
Gropp, W.1
Huss-Lederman, S.2
Lumsdaine, A.3
Lusk, E.4
Nitzberg, B.5
Saphir, W.6
Snir, M.7
-
12
-
-
56449092705
-
Self- Consistent MPI IO performance requirements and expectations
-
W.D. Gropp, D. Kimpe, R. Ross, R. Thakur, and J.L. Träff, "Self- Consistent MPI IO Performance Requirements and Expectations," Proc. Recent Advances in Parallel Virtual Machine and Message Passing Interface: 15th European PVM/MPI Users' Group Meeting, pp. 167-176, 2008.
-
(2008)
Proc. Recent Advances in Parallel Virtual Machine and Message Passing Interface: 15th European PVM/MPI Users' Group Meeting
, pp. 167-176
-
-
Gropp, W.D.1
Kimpe, D.2
Ross, R.3
Thakur, R.4
Träff, J.L.5
-
13
-
-
34250877184
-
Exchanging multiple messages via MPI
-
The Univ. of Edinburgh
-
J. Hein, S. Booth, and M. Bull, "Exchanging Multiple Messages via MPI," Technical Report HPCxTR0308, EPCC, The Univ. of Edinburgh, 2003.
-
(2003)
Technical Report HPCxTR0308, EPCC
-
-
Hein, J.1
Booth, S.2
Bull, M.3
-
14
-
-
38149121511
-
Netgauge: A network performance measurement framework
-
T. Hoefler, T. Mehlan, A. Lumsdaine, and W. Rehm, "Netgauge: A Network Performance Measurement Framework," Proc. High Performance Computing and Comm. (HPPC), pp. 659-671, 2007.
-
(2007)
Proc. High Performance Computing and Comm. (HPPC)
, pp. 659-671
-
-
Hoefler, T.1
Mehlan, T.2
Lumsdaine, A.3
Rehm, W.4
-
15
-
-
0030673395
-
Application restructuring and performance portability on shared virtual memory and Hardware-Coherent multiprocessors
-
D. Jiang, H. Shan, and J.P. Singh, "Application Restructuring and Performance Portability on Shared Virtual Memory and Hardware- Coherent Multiprocessors," Proc. Sixth ACM SIGPLAN Symp. Principles and Practice of Parallel Programming (PPoPP), pp. 217-229, 1997. (Pubitemid 127452555)
-
(1997)
SIGPLAN Notices (ACM Special Interest Group on Programming Languages)
, vol.32
, Issue.7
, pp. 217-229
-
-
Jiang, D.1
Shan, H.2
Singh, J.P.3
-
16
-
-
34548281165
-
A practical approach to performance analysis and modeling of large-scale systems
-
DOI 10.1145/1188455.1188670, Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, SC'06
-
D.J. Kerbyson and A. Hoisie, "S05-A Practical Approach to Performance Analysis and Modeling of Large-Scale Systems," Proc. ACM/IEEE SC Conf. High Performance Networking and Computing, p. 206, 2006. (Pubitemid 47318737)
-
(2006)
Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, SC'06
, pp. 1188670
-
-
Kerbyson, D.J.1
Hoisie, A.2
-
17
-
-
84876347047
-
Fast measurement of logp parameters for message passing platforms
-
T. Kielmann, H.E. Bal, and K. Verstoep, "Fast Measurement of LogP Parameters for Message Passing Platforms," Proc. Int'l Parallel and Distributed Processing Symp. (IPDPS '00), pp. 1176-1183, 2000.
-
(2000)
Proc. Int'l Parallel and Distributed Processing Symp. (IPDPS '00)
, pp. 1176-1183
-
-
Kielmann, T.1
Bal, H.E.2
Verstoep, K.3
-
18
-
-
33748875639
-
Optimizing MPI collective communication by orthogonal structures
-
DOI 10.1007/s10586-006-9740-9, Cluster Computing in Science and Engineering
-
M. Kühnemann, T. Rauber, and G. Rünger, "Optimizing MPI Collective Communication by Orthogonal Structures," Cluster Computing, vol.9, no.3, pp. 257-279, 2006. (Pubitemid 44419723)
-
(2006)
Cluster Computing
, vol.9
, Issue.3
, pp. 257-279
-
-
Kuhnemann, M.1
Rauber, T.2
Runger, G.3
-
20
-
-
28044435048
-
A performance model of non-deterministic particle transport on large-scale systems
-
DOI 10.1016/j.future.2004.11.018, PII S0167739X04002249
-
M.M. Mathis, D.J. Kerbyson, and A. Hoisie, "A Performance Model of Non-Deterministic Particle Transport on Large-Scale Systems," Future Generation Computer Systems, vol.22, no.3, pp. 324-335, 2006. (Pubitemid 41689814)
-
(2006)
Future Generation Computer Systems
, vol.22
, Issue.3
, pp. 324-335
-
-
Mathis, M.M.1
Kerbyson, D.J.2
Hoisie, A.3
-
21
-
-
36049024176
-
Computational quality of service for scientific cca applications: Composition, substitution, and reconfiguration
-
Feb.
-
L.C. McInnes, J. Ray, R. Armstrong, T.L. Dahlgren, A. Malony, B. Norris, S. Shende, J.P. Kenny, and J. Steensland, "Computational Quality of Service for Scientific CCA Applications: Composition, Substitution, and Reconfiguration," Technical Report ANL/MCSP1326- 0206, Argonne Nat'l Laboratory, Feb. 2006.
-
(2006)
Technical Report ANL/MCSP1326- 0206, Argonne Nat'l Laboratory
-
-
McInnes, L.C.1
Ray, J.2
Armstrong, R.3
Dahlgren, T.L.4
Malony, A.5
Norris, B.6
Shende, S.7
Kenny, J.P.8
Steensland, J.9
-
23
-
-
38449101453
-
Computational quality of service in parallel CFD
-
May
-
B. Norris, L. McInnes, and I. Veljkovic, "Computational Quality of Service in Parallel CFD," Proc. 17th Int'l Conf. Parallel Computational Fluid Dynamics, pp. 24-27, May 2006.
-
(2006)
Proc. 17th Int'l Conf. Parallel Computational Fluid Dynamics
, pp. 24-27
-
-
Norris, B.1
McInnes, L.2
Veljkovic, I.3
-
24
-
-
77950626520
-
An LPAR-Customized MPI-Alltoallv for the materials science Code CASTEP
-
The Univ. of Edinburgh
-
M. Plummer and K. Refson, "An LPAR-Customized MPI-Alltoallv for the Materials Science Code CASTEP," Technical Report HPCxTR0401, EPCC, The Univ. of Edinburgh, 2004.
-
(2004)
Technical Report HPCxTR0401, EPCC
-
-
Plummer, M.1
Refson, K.2
-
25
-
-
0038587352
-
Using SKaMPI for developing high-performance MOI programs with performance portability
-
R. Reussner, "Using SKaMPI for Developing High-Performance MOI Programs with Performance Portability," Future Generation Computing Systems, vol.19, no.5, pp. 749-759, 2003.
-
(2003)
Future Generation Computing Systems
, vol.19
, Issue.5
, pp. 749-759
-
-
Reussner, R.1
-
26
-
-
84957882532
-
SKaMPI: A detailed, accurate MPI benchmark
-
Recent Advances in Parallel Virtual Machine and Message Passing Interface
-
R. Reussner, P. Sanders, L. Prechelt, and M. Müller, "SKaMPI: A Detailed, Accurate MPI Benchmark," Proc. Recent Advances in Parallel Virtual Machine and Message Passing Interface: Fifth European PVM/MPI Users' Group Meeting, pp. 52-59, 1998. (Pubitemid 128135093)
-
(1998)
LECTURE NOTES IN COMPUTER SCIENCE
, Issue.1497
, pp. 52-62
-
-
Reussner, R.1
Sanders, P.2
Prechelt, L.3
Mueller, M.4
-
27
-
-
0036082072
-
SKaMPI: A comprehensive benchmark for public benchmarking of MPI
-
R. Reussner, P. Sanders, and J.L. Träff, "SKaMPI: A Comprehensive Benchmark for Public Benchmarking of MPI," Scientific Programming, vol.10, no.1, pp. 55-65, 2002. (Pubitemid 34685255)
-
(2002)
Scientific Programming
, vol.10
, Issue.1
, pp. 55-65
-
-
Reussner, R.1
Sanders, P.2
Traff, J.L.3
-
29
-
-
35048892196
-
Generation of simple analytical models for message passing applications
-
Euro-Par 2004 Parallel Processing
-
G. Rodríguez, R.M. Badia, and J. Labarta, "Generation of Simple Analytical Models for Message Passing Applications," Proc. Euro- Par '04 Parallel Processing Conf., pp. 183-188, 2004. (Pubitemid 39217270)
-
(2004)
LECTURE NOTES IN COMPUTER SCIENCE
, Issue.3149
, pp. 183-188
-
-
Rodriguez, G.1
Badia, R.M.2
Labarta, J.3
-
30
-
-
34748894221
-
X10: Concurrent programming for modern architectures
-
DOI 10.1145/1229428.1229483, Proceedings of the 2007 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP'07
-
V.A. Saraswat, V. Sarkar, and C. von Praun, "X10: Concurrent Programming for Modern Architectures," Proc. ACM SIGPLAN Symp. Principles and Practice of Parallel Programming (PPoPP), p. 271, 2007. (Pubitemid 47479115)
-
(2007)
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP
, pp. 271
-
-
Saraswat, V.A.1
Sarkar, V.2
Von Praun, C.3
-
31
-
-
0003710740
-
-
second ed. MIT Press
-
M. Snir, S. Otto, S. Huss-Lederman, D. Walker, and J. Dongarra, MPI-The Complete Reference, second ed., vol.1. MIT Press, 1998.
-
(1998)
MPI-The Complete Reference
, vol.1
-
-
Snir, M.1
Otto, S.2
Huss-Lederman, S.3
Walker, D.4
Dongarra, J.5
-
32
-
-
35248859849
-
Improving the performance of collective operations in MPICH
-
Recent Advances in Parallel Virtual Machine and Message Passing Interface
-
R. Thakur, W.D. Gropp, and R. Rabenseifner, "Improving the Performance of Collective Operations in MPICH," Int'l J. High Performance Computing Applications, vol.19, pp. 49-66, 2004. (Pubitemid 37240338)
-
(2003)
LECTURE NOTES IN COMPUTER SCIENCE
, Issue.2840
, pp. 257-267
-
-
Thakur, R.1
Gropp, W.D.2
-
35
-
-
38449083545
-
Self-Consistent MPI performance requirements
-
J.L. Träff, W. Gropp, and R. Thakur, "Self-Consistent MPI Performance Requirements," Proc. Recent Advances in Parallel Virtual Machine and Message Passing Interface: 14th European PVM/MPI Users' Group Meeting, pp. 36-45, 2007.
-
(2007)
Proc. Recent Advances in Parallel Virtual Machine and Message Passing Interface: 14th European PVM/MPI Users' Group Meeting
, pp. 36-45
-
-
Träff, J.L.1
Gropp, W.2
Thakur, R.3
-
36
-
-
0025467711
-
A bridging model for parallel computation
-
L.G. Valiant, "A Bridging Model for Parallel Computation," Comm. ACM, vol.33, no.8, pp. 103-111, 1990.
-
(1990)
Comm. ACM
, vol.33
, Issue.8
, pp. 103-111
-
-
Valiant, L.G.1
-
37
-
-
23844503894
-
Performance portability in the physical parameterizations of the community atmospheric model
-
P. Worley and J. Drake, "Performance Portability in the Physical Parameterizations of the Community Atmospheric Model," Int'l J. High Performance Computing Applications, vol.19, no.3, pp. 187-202, 2005.
-
(2005)
Int'l J. High Performance Computing Applications
, vol.19
, Issue.3
, pp. 187-202
-
-
Worley, P.1
Drake, J.2
-
41
-
-
34447552672
-
Parallel languages and compilers: Perspective
-
K. Yelick, P. Hilfinger, S. Graham, D. Bonachea, J. Su, A. Kamil, P.C.K. Datta, and T. Wen, "Parallel Languages and Compilers: Perspective from the Titanium Experience," Int'l J. High Performance Computing Applications, vol.21, pp. 266-290, 2007.
-
(2007)
Int'l J. High Performance Computing Applications
, vol.21
, pp. 266-290
-
-
Yelick, K.1
Hilfinger, P.2
Graham, S.3
Bonachea, D.4
Su, J.5
Kamil, A.6
Datta, P.C.K.7
Wen, T.8
|