-
1
-
-
0025467711
-
A bridging model for parallel computation
-
L. G. Valiant, "A bridging model for parallel computation," Commun. ACM, vol. 33, no. 8, pp. 103-111, 1990.
-
(1990)
Commun. ACM
, vol.33
, Issue.8
, pp. 103-111
-
-
Valiant, L.G.1
-
2
-
-
35248859849
-
Improving the performance of collective operations in mpich
-
Recent Advances in Parallel Virtual Machine and Message Passing Interface. Springer Verlag 257267 10th European PVM/MPI Users Group Meeting, Springer Verlag, 2003
-
R. Thakur, "Improving the performance of collective operations in mpich," in Recent Advances in Parallel Virtual Machine and Message Passing Interface. Number 2840 in LNCS, Springer Verlag (2003) 257267 10th European PVM/MPI Users Group Meeting, pp. 257-267, Springer Verlag, 2003.
-
(2003)
LNCS
, Issue.2840
, pp. 257-267
-
-
Thakur, R.1
-
3
-
-
1242332596
-
Send-receive considered harmful: Myths and realities of message passing
-
Jan.
-
S. Gorlatch, "Send-receive considered harmful: Myths and realities of message passing," ACM Trans. Program. Lang. Syst., vol. 26, pp. 47-56, Jan. 2004.
-
(2004)
ACM Trans. Program. Lang. Syst.
, vol.26
, pp. 47-56
-
-
Gorlatch, S.1
-
5
-
-
79951761626
-
The Scalable Process Topology Interface of MPI 2.2
-
Aug.
-
T. Hoefler, R. Rabenseifner, H. Ritzdorf, B. R. de Supinski, R. Thakur, and J. L. Traeff, "The Scalable Process Topology Interface of MPI 2.2," Concurrency and Computation: Practice and Experience, vol. 23, pp. 293-310, Aug. 2010.
-
(2010)
Concurrency and Computation: Practice and Experience
, vol.23
, pp. 293-310
-
-
Hoefler, T.1
Rabenseifner, R.2
Ritzdorf, H.3
De Supinski, B.R.4
Thakur, R.5
Traeff, J.L.6
-
7
-
-
56449130431
-
Sparse Non-Blocking Collectives in Quantum Mechanical Calculations
-
Recent Advances in Parallel Virtual Machine and Message Passing Interface, 15th European PVM/MPI Users' Group Meeting, Springer, Sep.
-
T. Hoefler, F. Lorenzen, and A. Lumsdaine, "Sparse Non-Blocking Collectives in Quantum Mechanical Calculations," in Recent Advances in Parallel Virtual Machine and Message Passing Interface, 15th European PVM/MPI Users' Group Meeting, vol. LNCS 5205, pp. 55-63, Springer, Sep. 2008.
-
(2008)
LNCS
, vol.5205
, pp. 55-63
-
-
Hoefler, T.1
Lorenzen, F.2
Lumsdaine, A.3
-
8
-
-
34548691020
-
MPI collective algorithm selection and quadtree encoding
-
DOI 10.1016/j.parco.2007.06.005, PII S0167819107000804
-
J. Pješivac-Grbović, G. Bosilca, G. E. Fagg, T. Angskun, and J. J. Dongarra, "Mpi collective algorithm selection and quadtree encoding," Parallel Comput., vol. 33, pp. 613-623, Sept. 2007. (Pubitemid 47418299)
-
(2007)
Parallel Computing
, vol.33
, Issue.9
, pp. 613-623
-
-
Pjesivac-Grbovic, J.1
Bosilca, G.2
Fagg, G.E.3
Angskun, T.4
Dongarra, J.J.5
-
9
-
-
0000729827
-
Designing broadcasting algorithms in the postal model for message-passing systems
-
A. Bar-Noy and S. Kipnis, "Designing broadcasting algorithms in the postal model for message-passing systems," Math. Syst. Theory, vol. 27, no. 5, pp. 431-452, 1994.
-
(1994)
Math. Syst. Theory
, vol.27
, Issue.5
, pp. 431-452
-
-
Bar-Noy, A.1
Kipnis, S.2
-
10
-
-
85031726860
-
Optimal broadcast and summation in the LogP model
-
R. M. Karp, A. Sahay, E. E. Santos, and K. E. Schauser, "Optimal broadcast and summation in the LogP model," in Proc. of Symposium on Parallel Algorithms and Architectures, pp. 142-153, 1993.
-
(1993)
Proc. of Symposium on Parallel Algorithms and Architectures
, pp. 142-153
-
-
Karp, R.M.1
Sahay, A.2
Santos, E.E.3
Schauser, K.E.4
-
11
-
-
71549164097
-
Two-tree algorithms for full bandwidth broadcast, reduction and scan
-
December
-
P. Sanders, J. Speck, and J. L. Träff, "Two-tree algorithms for full bandwidth broadcast, reduction and scan," Parallel Comput., vol. 35, pp. 581-594, December 2009.
-
(2009)
Parallel Comput.
, vol.35
, pp. 581-594
-
-
Sanders, P.1
Speck, J.2
Träff, J.L.3
-
12
-
-
33750234379
-
High performance RDMA protocols in HPC
-
Proceedings, 13th European PVM/MPI Users' Group Meeting, (Bonn, Germany), Springer-Verlag, September
-
"High performance RDMA protocols in HPC," in Proceedings, 13th European PVM/MPI Users' Group Meeting, Lecture Notes in Computer Science, (Bonn, Germany), Springer-Verlag, September 2006.
-
(2006)
Lecture Notes in Computer Science
-
-
-
13
-
-
0002076006
-
An upper bound for the chromatic number of a graph and its application to timetabling problems
-
D. J. A. Welsh and M. B. Powell, "An upper bound for the chromatic number of a graph and its application to timetabling problems," The Computer Journal, vol. 10, no. 1, pp. 85-86, 1967.
-
(1967)
The Computer Journal
, vol.10
, Issue.1
, pp. 85-86
-
-
Welsh, D.J.A.1
Powell, M.B.2
-
14
-
-
0008669884
-
LogGP: Incorporating long messages into the LogP model for parallel computation
-
DOI 10.1006/jpdc.1997.1346, PII S0743731597913460
-
A. Alexandrov, M. F. Ionescu, K. E. Schauser, and C. Scheiman, "LogGP: Incorporating long messages into the LogP model," J. of Par. and Distr. Comp., vol. 44, no. 1, pp. 71-79, 1995. (Pubitemid 127340829)
-
(1997)
Journal of Parallel and Distributed Computing
, vol.44
, Issue.1
, pp. 71-79
-
-
Alexandrov, A.1
Ionescu, M.F.2
Schauser, K.E.3
Scheiman, C.4
-
15
-
-
21044437801
-
Overview of the bluegene/l system architecture
-
A. Gara, M. A. Blumrich, D. Chen, G. L.-T. Chiu, M. E. G. P. Coteus, R. A. Haring, P. Heidelberger, D. Hoenicke, G. V. Kopcsay, T. A. Liebsch, M. Ohmacht, B. D. Steinmacher-Burow, T. Takken, and P. Vranas, "Overview of the bluegene/l system architecture," IBM Journal of Research and Development, vol. 49, no. 2, pp. 195-213, 2005.
-
(2005)
IBM Journal of Research and Development
, vol.49
, Issue.2
, pp. 195-213
-
-
Gara, A.1
Blumrich, M.A.2
Chen, D.3
Chiu, G.L.-T.4
Coteus, M.E.G.P.5
Haring, R.A.6
Heidelberger, P.7
Hoenicke, D.8
Kopcsay, G.V.9
Liebsch, T.A.10
Ohmacht, M.11
Steinmacher-Burow, B.D.12
Takken, T.13
Vranas, P.14
-
18
-
-
77958112922
-
The gemini system interconnect
-
IEEE Computer Society
-
R. Alverson, D. Roweth, and L. Kaplan, "The gemini system interconnect," in Proceedings of the 2010 18th IEEE Symposium on High Performance Interconnects, HOTI '10, (Washington, DC, USA), pp. 83-87, IEEE Computer Society, 2010.
-
(2010)
Proceedings of the 2010 18th IEEE Symposium on High Performance Interconnects, HOTI '10, (Washington, DC, USA)
, pp. 83-87
-
-
Alverson, R.1
Roweth, D.2
Kaplan, L.3
-
23
-
-
39749134275
-
A time-split nonhydrostatic atmospheric model for weather research and forecasting applications
-
Mar.
-
W. C. Skamarock and J. B. Klemp, "A time-split nonhydrostatic atmospheric model for weather research and forecasting applications," J. Comput. Phys., vol. 227, pp. 3465-3485, Mar. 2008.
-
(2008)
J. Comput. Phys.
, vol.227
, pp. 3465-3485
-
-
Skamarock, W.C.1
Klemp, J.B.2
-
24
-
-
0000331979
-
Lattice boltzmann method for 3-d flows with curved boundary
-
July
-
R. Mei, W. Shyy, D. Yu, and L.-S. Luo, "Lattice boltzmann method for 3-d flows with curved boundary," J. Comput. Phys., vol. 161, pp. 680-699, July 2000.
-
(2000)
J. Comput. Phys.
, vol.161
, pp. 680-699
-
-
Mei, R.1
Shyy, W.2
Yu, D.3
Luo, L.-S.4
-
25
-
-
84973786808
-
Studying Quarks and Gluons On Mimd Parallel Computers
-
C. Bernard, M. C. Ogilvie, T. A. DeGrand, C. E. DeTar, S. A. Gottlieb, A. Krasnitz, R. Sugar, and D. Toussaint, "Studying Quarks and Gluons On Mimd Parallel Computers," International Journal of High Performance Computing Applications, vol. 5, no. 4, pp. 61-70, 1991.
-
(1991)
International Journal of High Performance Computing Applications
, vol.5
, Issue.4
, pp. 61-70
-
-
Bernard, C.1
Ogilvie, M.C.2
DeGrand, T.A.3
DeTar, C.E.4
Gottlieb, S.A.5
Krasnitz, A.6
Sugar, R.7
Toussaint, D.8
-
26
-
-
0013269731
-
University of Florida Sparse Matrix Collection
-
T. A. Davis, "University of Florida Sparse Matrix Collection," NA Digest, vol. 92, 1994.
-
(1994)
NA Digest
, vol.92
-
-
Davis, T.A.1
-
27
-
-
0036505103
-
Parallel static and dynamic multi-constraint graph partitioning
-
DOI 10.1002/cpe.605
-
K. Schloegel, G. Karypis, and V. Kumar, "Parallel static and dynamic multi-constraint graph partitioning," Concurrency and Computation: Practice and Experience, vol. 14, no. 3, pp. 219-240, 2002. (Pubitemid 34460007)
-
(2002)
Concurrency Computation Practice and Experience
, vol.14
, Issue.3
, pp. 219-240
-
-
Schloegel, K.1
Karypis, G.2
Kumar, V.3
-
28
-
-
0037249228
-
Parallel algebraic multigrid methods on distributed memory computers
-
Feb.
-
G. Haase, M. Kuhn, and S. Reitzinger, "Parallel algebraic multigrid methods on distributed memory computers," SIAM J. Sci. Comput., vol. 24, pp. 410-427, Feb. 2002.
-
(2002)
SIAM J. Sci. Comput.
, vol.24
, pp. 410-427
-
-
Haase, G.1
Kuhn, M.2
Reitzinger, S.3
-
29
-
-
84883516917
-
Efficient algorithms for all-to-all communications in multi-port message-passing systems
-
J. Bruck, C. T. Ho, S. Kipnis, and D. Weathersby, "Efficient algorithms for all-to-all communications in multi-port message-passing systems," in 6th ACM Symp. on Par. Alg. and Arch., pp. 298-309, 1994.
-
(1994)
6th ACM Symp. on Par. Alg. and Arch.
, pp. 298-309
-
-
Bruck, J.1
Ho, C.T.2
Kipnis, S.3
Weathersby, D.4
-
30
-
-
0242308158
-
Communication characteristics of large-scale scientific applications for contemporary cluster architectures
-
DOI 10.1016/S0743-7315(03)00104-7
-
J. S. Vetter and F. Mueller, "Communication characteristics of large-scale scientific applications for contemporary cluster architectures," J. Parallel Distrib. Comput., vol. 63, pp. 853-865, Sept. 2003. (Pubitemid 37364491)
-
(2003)
Journal of Parallel and Distributed Computing
, vol.63
, Issue.9
, pp. 853-865
-
-
Vetter, J.S.1
Mueller, F.2
-
31
-
-
75449107210
-
Communication requirements and interconnect optimization for high-end scientific applications
-
S. Kamil, L. Oliker, A. Pinar, and J. Shalf, "Communication requirements and interconnect optimization for high-end scientific applications," IEEE Trans. Parallel Distrib. Syst., vol. 21, no. 2, pp. 188-202, 2010.
-
(2010)
IEEE Trans. Parallel Distrib. Syst.
, vol.21
, Issue.2
, pp. 188-202
-
-
Kamil, S.1
Oliker, L.2
Pinar, A.3
Shalf, J.4
-
32
-
-
0029717350
-
Automatic optimization of communication in compiling out-of-core stencil codes
-
ACM
-
R. Bordawekar, A. Choudhary, and J. Ramanujam, "Automatic optimization of communication in compiling out-of-core stencil codes," in Proceedings of the 10th international conference on Supercomputing, ICS '96, (New York, NY, USA), pp. 366-373, ACM, 1996.
-
(1996)
Proceedings of the 10th International Conference on Supercomputing, ICS '96, (New York, NY, USA)
, pp. 366-373
-
-
Bordawekar, R.1
Choudhary, A.2
Ramanujam, J.3
-
33
-
-
34548752231
-
Towards optimal multi-level tiling for stencil computations
-
march
-
L. Renganarayana, M. Harthikote-Matha, R. Dewri, and S. Rajopadhye, "Towards optimal multi-level tiling for stencil computations," in Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International, pp. 1-10, march 2007.
-
(2007)
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
, pp. 1-10
-
-
Renganarayana, L.1
Harthikote-Matha, M.2
Dewri, R.3
Rajopadhye, S.4
-
34
-
-
35448944792
-
Effective automatic parallelization of stencil computations
-
ACM
-
S. Krishnamoorthy, M. Baskaran, U. Bondhugula, J. Ramanujam, A. Rountev, and P. Sadayappan, "Effective automatic parallelization of stencil computations," in Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation, PLDI '07, (New York, NY, USA), pp. 235-244, ACM, 2007.
-
(2007)
Proceedings of the 2007 ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI '07, (New York, NY, USA)
, pp. 235-244
-
-
Krishnamoorthy, S.1
Baskaran, M.2
Bondhugula, U.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
35
-
-
84877705001
-
-
tech. rep., Ohio State University, OSU-CISRC-12/09
-
S. Potluri, P. Lai, K. Tomko, Y. Cui, M. Tatineni, K. Schulz, W. Barth, A. Majumdar, and D. Panda, "Optimizing a Stencil-Based Application for Earthquake Modeling on Modern InfiniBand Clusters," tech. rep., Ohio State University, 2009. OSU-CISRC-12/09.
-
(2009)
Optimizing a Stencil-Based Application for Earthquake Modeling on Modern InfiniBand Clusters
-
-
Potluri, S.1
Lai, P.2
Tomko, K.3
Cui, Y.4
Tatineni, M.5
Schulz, K.6
Barth, W.7
Majumdar, A.8
Panda, D.9
-
36
-
-
84871158565
-
Towards performance portability through runtime adaptation for high-performance computing applications
-
Nov.
-
E. Gabriel, S. Feki, K. Benkert, and M. M. Resch, "Towards performance portability through runtime adaptation for high-performance computing applications," Concurr. Comput. : Pract. Exper., vol. 22, pp. 2230-2246, Nov. 2010.
-
(2010)
Concurr. Comput.: Pract. Exper.
, vol.22
, pp. 2230-2246
-
-
Gabriel, E.1
Feki, S.2
Benkert, K.3
Resch, M.M.4
-
37
-
-
77953986067
-
Optimization of applications with non-blocking neighborhood collectives via multisends on the blue gene/p supercomputer
-
april
-
S. Kumar, P. Heidelberger, D. Chen, and M. Hines, "Optimization of applications with non-blocking neighborhood collectives via multisends on the blue gene/p supercomputer," in Parallel Distributed Processing (IPDPS), 2010 IEEE International Symposium on, pp. 1 -11, april 2010.
-
(2010)
Parallel Distributed Processing (IPDPS), 2010 IEEE International Symposium on
, pp. 1-11
-
-
Kumar, S.1
Heidelberger, P.2
Chen, D.3
Hines, M.4
-
38
-
-
0001483604
-
Communication optimizations for irregular scientific computations on distributed memory architectures
-
Sept.
-
R. Das, M. Uysal, J. Saltz, and Y.-S. Hwang, "Communication optimizations for irregular scientific computations on distributed memory architectures," J. Parallel Distrib. Comput., vol. 22, pp. 462-478, Sept. 1994.
-
(1994)
J. Parallel Distrib. Comput.
, vol.22
, pp. 462-478
-
-
Das, R.1
Uysal, M.2
Saltz, J.3
Hwang, Y.-S.4
-
39
-
-
34248373234
-
Star-mpi: Self tuned adaptive routines for mpi collective operations
-
ACM
-
A. Faraj, X. Yuan, and D. Lowenthal, "Star-mpi: self tuned adaptive routines for mpi collective operations," in Proceedings of the 20th annual international conference on Supercomputing, ICS '06, (New York, NY, USA), pp. 199-208, ACM, 2006.
-
(2006)
Proceedings of the 20th Annual International Conference on Supercomputing, ICS '06, (New York, NY, USA)
, pp. 199-208
-
-
Faraj, A.1
Yuan, X.2
Lowenthal, D.3
|