-
1
-
-
84976742719
-
"Algorithm 539: Basic Linear Algebra Subprograms for FORTRAN Usage [F1]"
-
C. L. Lawson, R. J. Hanson, F. T. Krogh, and D. R. Kincaid, "Algorithm 539: Basic Linear Algebra Subprograms for FORTRAN Usage [F1]," ACM Trans. Math. Software 5, No. 3, 324-325 (1979).
-
(1979)
ACM Trans. Math. Software
, vol.5
, Issue.3
, pp. 324-325
-
-
Lawson, C.L.1
Hanson, R.J.2
Krogh, F.T.3
Kincaid, D.R.4
-
2
-
-
0343462141
-
"Automated Empirical Optimization of Software and the ATLAS Project"
-
R. C. Whaley, A. Petitet, and J. J. Dongarra, "Automated Empirical Optimization of Software and the ATLAS Project," Parallel Computing 27, No. 1/2, 3-35 (2001).
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Whaley, R.C.1
Petitet, A.2
Dongarra, J.J.3
-
3
-
-
0000793139
-
"Cramming More Components onto Integrated Circuits"
-
G. E. Moore, "Cramming More Components onto Integrated Circuits," Electronics 38, No. 8, 114-117 (1965).
-
(1965)
Electronics
, vol.38
, Issue.8
, pp. 114-117
-
-
Moore, G.E.1
-
5
-
-
0022874874
-
"Advanced Compiler Optimizations for Supercomputers"
-
D. A. Padua and M. J. Wolfe, "Advanced Compiler Optimizations for Supercomputers," Source Commun. ACM 29, No. 12, 1184-1201 (1986).
-
(1986)
Source Commun. ACM
, vol.29
, Issue.12
, pp. 1184-1201
-
-
Padua, D.A.1
Wolfe, M.J.2
-
6
-
-
33646107115
-
"Automatic Blocking of QR and LU Factorizations for Locality"
-
Q. Yi, K. Kennedy, H. You, K. Seymour, and J. Dongarra, "Automatic Blocking of QR and LU Factorizations for Locality," Proceedings of the ACM SIGPLAN Workshop on Memory System Performance, 2004, pp. 12-22.
-
(2004)
Proceedings of the ACM SIGPLAN Workshop on Memory System Performance
, pp. 12-22
-
-
Yi, Q.1
Kennedy, K.2
You, H.3
Seymour, K.4
Dongarra, J.5
-
7
-
-
0003929457
-
"Automatic Blocking of Nested Loops"
-
Department of Computer Science, University of Tennessee, Knoxville, TN 37996
-
R. Schreiber and J. Dongarra, "Automatic Blocking of Nested Loops," Technical Report CS-90-108, Department of Computer Science, University of Tennessee, Knoxville, TN 37996, 1990.
-
Technical Report CS-90-108
, pp. 1990
-
-
Schreiber, R.1
Dongarra, J.2
-
8
-
-
0030190854
-
"Improving Data Locality with Loop Transformations"
-
K. S. McKinley, S. Carr, and C.-W. Tseng, "Improving Data Locality with Loop Transformations," ACM Trans. Program. Lang. & Syst. 18, No. 4, 424-453 (1996).
-
(1996)
ACM Trans. Program. Lang. & Syst.
, vol.18
, Issue.4
, pp. 424-453
-
-
McKinley, K.S.1
Carr, S.2
Tseng, C.-W.3
-
10
-
-
20744452904
-
"Self-Adapting Linear Algebra Algorithms and Software"
-
J. Demmel, J. Dongarra, V. Eijkhout, E. Fuentes, A. Petitet, R. Vuduc, R. C. Whaley, and K. Yelick, "Self-Adapting Linear Algebra Algorithms and Software," Proc. IEEE 93, No. 2, 293-312 (2005).
-
(2005)
Proc. IEEE
, vol.93
, Issue.2
, pp. 293-312
-
-
Demmel, J.1
Dongarra, J.2
Eijkhout, V.3
Fuentes, E.4
Petitet, A.5
Vuduc, R.6
Whaley, R.C.7
Yelick, K.8
-
11
-
-
0030661485
-
"Optimizing Matrix Multiply Using PHiPAC: A Portable, High-Performance, ANSI C Coding Methodology"
-
J. Bilmes, K. Asanovic, C.-W. Chin, and J. Demmel, "Optimizing Matrix Multiply Using PHiPAC: A Portable, High-Performance, ANSI C Coding Methodology," Proceedings of the International Conference on Supercomputing, 1997, pp. 340-347.
-
(1997)
Proceedings of the International Conference on Supercomputing
, pp. 340-347
-
-
Bilmes, J.1
Asanovic, K.2
Chin, C.-W.3
Demmel, J.4
-
12
-
-
0031636309
-
"FFTW: An Adaptive Software Architecture for the FFT"
-
M. Frigo and S. G. Johnson, "FFTW: An Adaptive Software Architecture for the FFT," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 1998, pp. 1381-1384.
-
(1998)
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
, pp. 1381-1384
-
-
Frigo, M.1
Johnson, S.G.2
-
13
-
-
10744232785
-
"A Comparison of Empirical and Model-Driven Optimization"
-
K. Yotov, X. Li, G. Ren, M. Cibulskis, G. DeJong, M. Garzaran, D. Padua, K. Pingali, P. Stodghill, and P. Wu, "A Comparison of Empirical and Model-Driven Optimization," Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, 2003, pp. 63-76.
-
(2003)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 63-76
-
-
Yotov, K.1
Li, X.2
Ren, G.3
Cibulskis, M.4
DeJong, G.5
Garzaran, M.6
Padua, D.7
Pingali, K.8
Stodghill, P.9
Wu, P.10
-
14
-
-
0000238336
-
"A Simplex Method for Function Minimization"
-
J. A. Nelder and R. Mead, "A Simplex Method for Function Minimization," The Computer J. 7, No. 4, 308-313 (1965).
-
(1965)
The Computer J.
, vol.7
, Issue.4
, pp. 308-313
-
-
Nelder, J.A.1
Mead, R.2
-
16
-
-
33646082955
-
"Classification and Utilization of Abstractions for Optimization"
-
D. Quinlan, M. Schordan, Q. Yi, and A. Saebjornsen, "Classification and Utilization of Abstractions for Optimization," Proceedings of the 1st International Symposium on Leveraging Applications of Formal Methods, 2004, pp. 2-9.
-
(2004)
Proceedings of the 1st International Symposium on Leveraging Applications of Formal Methods
, pp. 2-9
-
-
Quinlan, D.1
Schordan, M.2
Yi, Q.3
Saebjornsen, A.4
-
17
-
-
0004493166
-
"On the Approximability of Minimizing Nonzero Variables or Unsatisfied Relations in Linear Systems"
-
E. Amaldi and V. Kann, "On the Approximability of Minimizing Nonzero Variables or Unsatisfied Relations in Linear Systems," Theoret. Computer Sci. 209, 237-260 (1998).
-
(1998)
Theoret. Computer Sci.
, vol.209
, pp. 237-260
-
-
Amaldi, E.1
Kann, V.2
-
21
-
-
0024018137
-
"A Polynomial Approximation Scheme for Machine Scheduling on Uniform Processors: Using the Dual Approach"
-
D. S. Hochbaum and D. B. Shmoys, "A Polynomial Approximation Scheme for Machine Scheduling on Uniform Processors: Using the Dual Approach," SIAM J. Computing 17, No. 3, 539-551 (1988).
-
(1988)
SIAM J. Computing
, vol.17
, Issue.3
, pp. 539-551
-
-
Hochbaum, D.S.1
Shmoys, D.B.2
-
23
-
-
0000438412
-
"Approximation Algorithms for Scheduling Unrelated Parallel Machines"
-
J. Lenstra, D. Shmoys, and E. Tardos, "Approximation Algorithms for Scheduling Unrelated Parallel Machines," Math. Program. 46, No. 3, 259-271 (1990).
-
(1990)
Math. Program.
, vol.46
, Issue.3
, pp. 259-271
-
-
Lenstra, J.1
Shmoys, D.2
Tardos, E.3
-
24
-
-
0038368778
-
"Deploying Parallel Numerical Library Routines to Cluster Computing in a Self-Adapting Fashion"
-
Imperial College Press, London
-
K. J. Roche and J. J. Dongarra, "Deploying Parallel Numerical Library Routines to Cluster Computing in a Self-Adapting Fashion," Parallel Computing: Advances and Current Issues, Imperial College Press, London, 2002.
-
(2002)
Parallel Computing: Advances and Current Issues
-
-
Roche, K.J.1
Dongarra, J.J.2
-
25
-
-
0003706460
-
-
Third Edition, Society for Industrial and Applied Mathematics, Philadelphia
-
E. Anderson, Z. Bai, C. Bischof, S. L. Blackford, J. W. Demmel, J. J. Dongarra, J. Du Croz, A. Greenbaum, S. Hammarling, A. McKenney, and D. C. Sorensen, LAPACK User's Guide, Third Edition, Society for Industrial and Applied Mathematics, Philadelphia, 1999.
-
(1999)
LAPACK User's Guide
-
-
Anderson, E.1
Bai, Z.2
Bischof, C.3
Blackford, S.L.4
Demmel, J.W.5
Dongarra, J.J.6
Du Croz, J.7
Greenbaum, A.8
Hammarling, S.9
McKenney, A.10
Sorensen, D.C.11
-
26
-
-
0003615167
-
-
Society for Industrial and Applied Mathematics, Philadelphia
-
L. S. Blackford, J. Choi, A. Cleary, E. F. D'Azevedo, J. W. Demmel, I. S. Dhillon, J. J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. W. Walker, and R. C. Whaley, ScaLAPACK Users' Guide, Society for Industrial and Applied Mathematics, Philadelphia, 1997.
-
(1997)
ScaLAPACK Users' Guide
-
-
Blackford, L.S.1
Choi, J.2
Cleary, A.3
D'Azevedo, E.F.4
Demmel, J.W.5
Dhillon, I.S.6
Dongarra, J.J.7
Hammarling, S.8
Henry, G.9
Petitet, A.10
Stanley, K.11
Walker, D.W.12
Whaley, R.C.13
-
27
-
-
0001439335
-
"MPI: A Message-Passing Interface Standard"
-
Message Passing Interface Forum
-
Message Passing Interface Forum, "MPI: A Message-Passing Interface Standard," Intl. J. Supercomputer Appl. & High Perform. Computing 8, No. 3/4, 159-416 (1994).
-
(1994)
Intl. J. Supercomputer Appl. & High Perform. Computing
, vol.8
, Issue.3-4
, pp. 159-416
-
-
-
28
-
-
33646105586
-
-
Message Passing Interface Forum, MPI: A Message-Passing Interface Standard Version 1.1, see
-
Message Passing Interface Forum, MPI: A Message-Passing Interface Standard Version 1.1, 1995; see http://www.mpi-forum.org/docs/docs.html.
-
(1995)
-
-
-
29
-
-
33646113109
-
-
Message Passing Interface Forum, MPI-2: Extensions to the Message-Passing Interface, see
-
Message Passing Interface Forum, MPI-2: Extensions to the Message-Passing Interface, 1997; see http://www.mpi-forum.org/docs/ mpi2-report.pdf.
-
(1997)
-
-
-
30
-
-
33646119829
-
-
MPICH; see
-
MPICH; see http://www.mcs.anl.gov/mpi/mpich/.
-
-
-
-
31
-
-
33646116020
-
-
LAM/MPI Parallel Computing; see
-
LAM/MPI Parallel Computing; see http://www.lam-mpi.org/.
-
-
-
-
33
-
-
0030244536
-
"Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines"
-
J. Choi, J. J. Dongarra, L. S. Ostrouchov, A. P. Petitet, D. W. Walker, and R. C. Whaley, "Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines," Sci. Program. 5, No. 3, 173-184 (1996).
-
(1996)
Sci. Program.
, vol.5
, Issue.3
, pp. 173-184
-
-
Choi, J.1
Dongarra, J.J.2
Ostrouchov, L.S.3
Petitet, A.P.4
Walker, D.W.5
Whaley, R.C.6
-
34
-
-
33646092244
-
-
TOP500 Supercomputer Sites; see and http://www.netlib.org/benchmark/top500.html
-
TOP500 Supercomputer Sites; see http://www.top500.org and http://www.netlib.org/benchmark/top500.html.
-
-
-
-
35
-
-
0042674307
-
"The LINPACK Benchmark: Past, Present, and Future"
-
J. J. Dongarra, P. Luszczek, and A. Petitet, "The LINPACK Benchmark: Past, Present, and Future," Concurrency & Computation: Pract. & Exper. 15, No. 9, 803-820 (2003).
-
(2003)
Concurrency & Computation: Pract. & Exper.
, vol.15
, Issue.9
, pp. 803-820
-
-
Dongarra, J.J.1
Luszczek, P.2
Petitet, A.3
-
36
-
-
12444275589
-
"A Proposed Standard for Numerical Metadata"
-
Technical Report ICL-UT-03-02, Innovative Computing Laboratory, University of Tennessee, Knoxville, TN 37996
-
V. Eijkhout and E. Fuentes, "A Proposed Standard for Numerical Metadata," Technical Report ICL-UT-03-02, Innovative Computing Laboratory, University of Tennessee, Knoxville, TN 37996, 2003.
-
(2003)
-
-
Eijkhout, V.1
Fuentes, E.2
-
37
-
-
33646100261
-
-
Matrix Market; see
-
Matrix Market; see http://math.nist.gov/MatrixMarket.
-
-
-
-
38
-
-
84937397986
-
"Parallel Multilevel Algorithms for Multi-Constraint Graph Partitioning"
-
K. Schloegel, G. Karypis, and V. Kumar, "Parallel Multilevel Algorithms for Multi-Constraint Graph Partitioning," Proceedings of the 6th International Euro-Par Conference, 2000, pp. 296-310.
-
(2000)
Proceedings of the 6th International Euro-Par Conference
, pp. 296-310
-
-
Schloegel, K.1
Karypis, G.2
Kumar, V.3
-
39
-
-
33646101226
-
-
The ParMETIS/METIS package; see
-
The ParMETIS/METIS package; see http://glaros.dtc.umn.edu/gkhome/views/ metis/.
-
-
-
-
40
-
-
33646106745
-
"Automatic Determination of Matrix Blocks"
-
Technical Report UT-CS-01-458, Department of Computer Science, University of Tennessee, Knoxville, TN 37996
-
V. Eijkhout, "Automatic Determination of Matrix Blocks," Technical Report UT-CS-01-458, Department of Computer Science, University of Tennessee, Knoxville, TN 37996, 2001.
-
-
-
Eijkhout, V.1
-
41
-
-
33646110228
-
"Extending the MPI Specification for Process Fault Tolerance on High Performance Computing Systems"
-
G. E. Fagg, E. Gabriel, G. Bosilca, T. Angskun, Z. Chen, J. Pjesivac-Grbovic, K. London, and J. J. Dongarra, "Extending the MPI Specification for Process Fault Tolerance on High Performance Computing Systems," Proceedings of the International Supercomputer Conference, 2004.
-
(2004)
Proceedings of the International Supercomputer Conference
-
-
Fagg, G.E.1
Gabriel, E.2
Bosilca, G.3
Angskun, T.4
Chen, Z.5
Pjesivac-Grbovic, J.6
London, K.7
Dongarra, J.J.8
-
42
-
-
33646103394
-
"A Fault-Tolerant Communication Library for Grid Environments"
-
see
-
E. Gabriel, G. E. Fagg, A. Bukovsky, T. Angskun, and J. J. Dongarra, "A Fault-Tolerant Communication Library for Grid Environments," Proceedings of the 17th Annual ACM International Conference on Supercomputing (ICS'03), International Workshop on Grid Computing, 2003; see http://icl.cs.utk.edu/news_pub/submissions/FTMPI-SF-gabriel.pdf.
-
(2003)
Proceedings of the 17th Annual ACM International Conference on Supercomputing (ICS'03), International Workshop on Grid Computing
-
-
Gabriel, E.1
Fagg, G.E.2
Bukovsky, A.3
Angskun, T.4
Dongarra, J.J.5
-
43
-
-
0031570636
-
"Fault-Tolerant Matrix Operations for Networks of Workstations Using Diskless Checkpointing"
-
J. S. Plank, Y. Kim, and J. J. Dongarra, "Fault-Tolerant Matrix Operations for Networks of Workstations Using Diskless Checkpointing," J. Parallel & Distr. Computing 43, No. 2, 125-138 (1997).
-
(1997)
J. Parallel & Distr. Computing
, vol.43
, Issue.2
, pp. 125-138
-
-
Plank, J.S.1
Kim, Y.2
Dongarra, J.J.3
-
44
-
-
31844452364
-
"Recovery Patterns for Iterative Methods in a Parallel Unstable Environment"
-
Technical Report UT-CS-04-538, Computer Science Department, University of Tennessee, Knoxville, TN 37996
-
G. Bosilca, Z. Chen, J. Dongarra, and J. Langou, "Recovery Patterns for Iterative Methods in a Parallel Unstable Environment," Technical Report UT-CS-04-538, Computer Science Department, University of Tennessee, Knoxville, TN 37996, 2004.
-
-
-
Bosilca, G.1
Chen, Z.2
Dongarra, J.3
Langou, J.4
-
45
-
-
31844451082
-
"Building Fault Survivable MPI Programs with FT-MPI Using Diskless Checkpointing"
-
Z. Chen, G. E. Fagg, E. Gabriel, J. Langou, T. Angskun, G. Bosilca, and J. Dongarra, "Building Fault Survivable MPI Programs with FT-MPI Using Diskless Checkpointing," Proceedings of the ACM SIG-PLAN Symposium on Principles and Practice of Parallel Programming, 2005, pp. 213-223.
-
(2005)
Proceedings of the ACM SIG-PLAN Symposium on Principles and Practice of Parallel Programming
, pp. 213-223
-
-
Chen, Z.1
Fagg, G.E.2
Gabriel, E.3
Langou, J.4
Angskun, T.5
Bosilca, G.6
Dongarra, J.7
-
46
-
-
0037447584
-
"A Bandwidth Latency Tradeoff for Broadcast and Reduction"
-
P. Sanders and J. F. Sibeyn, "A Bandwidth Latency Tradeoff for Broadcast and Reduction," Info. Process. Lett. 86, No. 1, 33-38 (2003).
-
(2003)
Info. Process. Lett.
, vol.86
, Issue.1
, pp. 33-38
-
-
Sanders, P.1
Sibeyn, J.F.2
-
47
-
-
33646079822
-
"Development of Naturally Fault Tolerant Algorithms for Computing on 100,000 Processors"
-
see
-
C. Engelmann and G. A. Geist, "Development of Naturally Fault Tolerant Algorithms for Computing on 100,000 Processors," see http://www.csm.ornl.gov/~geist/Lyon2002-geist.pdf.
-
-
-
Engelmann, C.1
Geist, G.A.2
-
49
-
-
85095842731
-
"Automatically Tuned Collective Communications"
-
S. S. Vadhiyar, G. E. Fagg, and J. Dongarra, "Automatically Tuned Collective Communications," Proceedings of the ACM/IEEE Conference on Supercomputing, 2000, p. 3.
-
(2000)
Proceedings of the ACM/IEEE Conference on Supercomputing
, pp. 3
-
-
Vadhiyar, S.S.1
Fagg, G.E.2
Dongarra, J.3
-
50
-
-
1542396075
-
"Towards an Accurate Model for Collective Communications"
-
S. S. Vadhiyar, G. E. Fagg, and J. J. Dongarra, "Towards an Accurate Model for Collective Communications," Intl. J. High Perform. Computing Appl. 18, No. 1, 159-167 (2004).
-
(2004)
Intl. J. High Perform. Computing Appl.
, vol.18
, Issue.1
, pp. 159-167
-
-
Vadhiyar, S.S.1
Fagg, G.E.2
Dongarra, J.J.3
-
51
-
-
0028401457
-
"The Communication Challenge for MPP: Intel Paragon and Meiko CS-2"
-
(March)
-
R. W. Hockney, "The Communication Challenge for MPP: Intel Paragon and Meiko CS-2," Parallel Computing 20, No. 3, 389-398 (March 1994).
-
(1994)
Parallel Computing
, vol.20
, Issue.3
, pp. 389-398
-
-
Hockney, R.W.1
-
52
-
-
0009346826
-
"LogP: Towards a Realistic Model of Parallel Computation"
-
D. Culler, R. Karp, D. Patterson, A. Sahay, K. E. Schauser, E. Santos, R. Subramonian, and T. von Eicken, "LogP: Towards a Realistic Model of Parallel Computation," Proceedings of the 4th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 1993, pp. 1-12.
-
(1993)
Proceedings of the 4th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 1-12
-
-
Culler, D.1
Karp, R.2
Patterson, D.3
Sahay, A.4
Schauser, K.E.5
Santos, E.6
Subramonian, R.7
von Eicken, T.8
-
53
-
-
0029193089
-
"LogGP: Incorporating Long Messages into the LogP Model - One Step Closer Towards a Realistic Model for Parallel Computation"
-
A. Alexandrov, M. F. Ionescu, K. E. Schauser, and C. J. Scheiman, "LogGP: Incorporating Long Messages into the LogP Model - One Step Closer Towards a Realistic Model for Parallel Computation," Proceedings of the 7th Annual ACM Symposium on Parallel Algorithms and Architectures, 1995, pp. 95-105.
-
(1995)
Proceedings of the 7th Annual ACM Symposium on Parallel Algorithms and Architectures
, pp. 95-105
-
-
Alexandrov, A.1
Ionescu, M.F.2
Schauser, K.E.3
Scheiman, C.J.4
-
54
-
-
84876347047
-
"Fast Measurement of LogP Parameters for Message Passing Platforms"
-
T. Kielmann, H. E. Bal, and K. Verstoep, "Fast Measurement of LogP Parameters for Message Passing Platforms," Proceedings of the 15th IPDPS Workshops on Parallel and Distributed Processing, 2000, pp. 1176-1183.
-
(2000)
Proceedings of the 15th IPDPS Workshops on Parallel and Distributed Processing
, pp. 1176-1183
-
-
Kielmann, T.1
Bal, H.E.2
Verstoep, K.3
-
55
-
-
3643067761
-
"Assessing Fast Network Interfaces"
-
D. E. Culler, L. T. Liu, R. P. Martin, and C. O. Yoshikawa, "Assessing Fast Network Interfaces," IEEE Micro 16, No. 1 35-43 (1996).
-
(1996)
IEEE Micro
, vol.16
, Issue.1
, pp. 35-43
-
-
Culler, D.E.1
Liu, L.T.2
Martin, R.P.3
Yoshikawa, C.O.4
-
57
-
-
34548696258
-
"More Efficient Reduction Algorithms for Non-Power-of-Two Number of Processors in Message-Passing Parallel Systems"
-
R. Rabenseifner and J. L. Träff, "More Efficient Reduction Algorithms for Non-Power-of-Two Number of Processors in Message-Passing Parallel Systems," Proceedings of the 11th European PVM/MPI Users' Group Meeting, 2004, pp. 36-46.
-
(2004)
Proceedings of the 11th European PVM/MPI Users' Group Meeting
, pp. 36-46
-
-
Rabenseifner, R.1
Träff, J.L.2
-
58
-
-
18844428650
-
"MagPIe: MPI's Collective Communication Operations for Clustered Wide Area Systems"
-
T. Kielmann, R. F. H. Hofman, H. E. Bal, A. Plaat, and R. A. F. Bhoedjang, "MagPIe: MPI's Collective Communication Operations for Clustered Wide Area Systems," Proceedings of the 7th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 1999, pp. 131-140.
-
(1999)
Proceedings of the 7th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 131-140
-
-
Kielmann, T.1
Hofman, R.F.H.2
Bal, H.E.3
Plaat, A.4
Bhoedjang, R.A.F.5
-
59
-
-
84947248378
-
"An Evaluation of Current High-Performance Networks"
-
C. Bell, D. Bonachea, Y. Cote, J. Duell, P. Hargrove, P. Husbands, C. Iancu, M. Welcome, and K. Yelick, "An Evaluation of Current High-Performance Networks," Proceedings of the 17th International Symposium on Parallel and Distributed Processing, 2003, p. 28.
-
(2003)
Proceedings of the 17th International Symposium on Parallel and Distributed Processing
, pp. 28
-
-
Bell, C.1
Bonachea, D.2
Cote, Y.3
Duell, J.4
Hargrove, P.5
Husbands, P.6
Iancu, C.7
Welcome, M.8
Yelick, K.9
-
60
-
-
0141732229
-
"Efficient Implementation of Reduce-Scatter in MPI"
-
M. Bernaschi, G. Iannello, and M. Lauria, "Efficient Implementation of Reduce-Scatter in MPI," J. Syst. Arch. 49, No. 3, 89-108 (2003).
-
(2003)
J. Syst. Arch.
, vol.49
, Issue.3
, pp. 89-108
-
-
Bernaschi, M.1
Iannello, G.2
Lauria, M.3
-
61
-
-
0028734038
-
"Building a High-Performance Collective Communication Library"
-
M. Barnett, L. Shuler, S. Gupta, D. G. Payne, R. van de Geijn, and J. Watts, "Building a High-Performance Collective Communication Library," Proceedings of the ACM/IEEE Conference on Supercomputing, 1994, pp. 107-116.
-
(1994)
Proceedings of the ACM/IEEE Conference on Supercomputing
, pp. 107-116
-
-
Barnett, M.1
Shuler, L.2
Gupta, S.3
Payne, D.G.4
van de Geijn, R.5
Watts, J.6
-
62
-
-
33746274942
-
"Performance Analysis of MPI Collective Operations"
-
J. Pjesivac-Grbovic, T. Angskun, G. Bosilca, G. E. Fagg, E. Gabriel, and J. J. Dongarra, "Performance Analysis of MPI Collective Operations," Proceedings of the 4th International Workshop on Performance Modeling, Evaluation, and Optimization of Parallel and Distributed Systems, 2005, p. 272a.
-
(2005)
Proceedings of the 4th International Workshop on Performance Modeling, Evaluation, and Optimization of Parallel and Distributed Systems
-
-
Pjesivac-Grbovic, J.1
Angskun, T.2
Bosilca, G.3
Fagg, G.E.4
Gabriel, E.5
Dongarra, J.J.6
-
63
-
-
0035480335
-
"HARNESS and Fault Tolerant MPI"
-
G. E. Fagg, A. Bukovsky, and J. J. Dongara, "HARNESS and Fault Tolerant MPI," J. Parallel Computing 27, No. 11, 1479-1495 (2001).
-
(2001)
J. Parallel Computing
, vol.27
, Issue.11
, pp. 1479-1495
-
-
Fagg, G.E.1
Bukovsky, A.2
Dongara, J.J.3
|