-
2
-
-
77958110333
-
The PERCS high-performance interconnect
-
IEEE Computer Society
-
B. Arimilli, R. Arimilli, V. Chung, S. Clark, W. Denzel, B. Drerup, T. Hoefler, J. Joyner, J. Lewis, J. Li, N. Ni, and R. Rajamony. The PERCS high-performance interconnect. In Proceedings of the IEEE Symposium on High Performance Interconnects (HOTI'10), pages 75-82. IEEE Computer Society, 2010.
-
(2010)
Proceedings of the IEEE Symposium on High Performance Interconnects (HOTI'10)
, pp. 75-82
-
-
Arimilli, B.1
Arimilli, R.2
Chung, V.3
Clark, S.4
Denzel, W.5
Drerup, B.6
Hoefler, T.7
Joyner, J.8
Lewis, J.9
Li, J.10
Ni, N.11
Rajamony, R.12
-
4
-
-
84863638735
-
Performance modeling and comparative analysis of the MILC lattice QCD application su3 rmd
-
IEEE Computer Society
-
G. Bauer, S. Gottlieb, and T. Hoefler. Performance modeling and comparative analysis of the MILC lattice QCD application su3 rmd. In Proceedings of the IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID'12), pages 652-659. IEEE Computer Society, 2012.
-
(2012)
Proceedings of the IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID'12)
, pp. 652-659
-
-
Bauer, G.1
Gottlieb, S.2
Hoefler, T.3
-
6
-
-
84947248378
-
An evaluation of current high-performance networks
-
C. Bell, D. Bonachea, Y. Cote, J. Duell, P. Hargrove, P. Husbands, C. Iancu, M. Welcome, and K. Yelick. An evaluation of current high-performance networks. In Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS'03). IEEE Computer Society, 2003.
-
(2003)
Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS'03). IEEE Computer Society
-
-
Bell, C.1
Bonachea, D.2
Cote, Y.3
Duell, J.4
Hargrove, P.5
Husbands, P.6
Iancu, C.7
Welcome, M.8
Yelick, K.9
-
7
-
-
33847103649
-
Optimizing bandwidth limited problems using one-sided communication and overlap
-
IEEE Computer Society
-
C. Bell, D. Bonachea, R. Nishtala, and K. Yelick. Optimizing bandwidth limited problems using one-sided communication and overlap. In Proceedings of the International Conference on Parallel and Distributed Processing (IPDPS'06), pages 1-10. IEEE Computer Society, 2006.
-
(2006)
Proceedings of the International Conference on Parallel and Distributed Processing (IPDPS'06)
, pp. 1-10
-
-
Bell, C.1
Bonachea, D.2
Nishtala, R.3
Yelick, K.4
-
8
-
-
84973786808
-
Studying quarks and gluons on MIMD parallel computers
-
C. Bernard, M. C. Ogilvie, T. A. DeGrand, C. E. DeTar, S. A. Gottlieb, A. Krasnitz, R. Sugar, and D. Toussaint. Studying quarks and gluons on MIMD parallel computers. International Journal of High Performance Computing Applications, 5(4):61-70, 1991.
-
(1991)
International Journal of High Performance Computing Applications, 5(4
, pp. 61-70
-
-
Bernard, C.1
Ogilvie, M.C.2
Degrand, T.A.3
Detar, C.E.4
Gottlieb, S.A.5
Krasnitz, A.6
Sugar, R.7
Toussaint, D.8
-
9
-
-
84877718746
-
Cray Cascade: A scalable HPC system based on a Dragonfly network
-
9. IEEE Computer Society
-
G. Faanes, A. Bataineh, D. Roweth, T. Court, E. Froese, B. Alverson, T. Johnson, J. Kopnick, M. Higgins, and J. Reinhard. Cray Cascade: A scalable HPC system based on a Dragonfly network. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'12), pages 103:1-103:9. IEEE Computer Society, 2012.
-
(2012)
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'12)
, vol.103
, pp. 1-103
-
-
Faanes, G.1
Bataineh, A.2
Roweth, D.3
Court, T.4
Froese, E.5
Alverson, B.6
Johnson, T.7
Kopnick, J.8
Higgins, M.9
Reinhard, J.10
-
11
-
-
0027649341
-
Isoefficiency: Measuring the scalability of parallel algorithms and architectures
-
A. Y. Grama, A. Gupta, and V. Kumar. Isoefficiency: measuring the scalability of parallel algorithms and architectures. Parallel and Distributed Technology: Systems and Technology, 1(3):12-21, 1993.
-
(1993)
Parallel and Distributed Technology: Systems and Technology
, vol.1
, Issue.3
, pp. 12-21
-
-
Grama, A.Y.1
Gupta, A.2
Kumar, V.3
-
12
-
-
84867646537
-
Leveraging MPI's one-sided communication interface for shared-memory programming
-
Springer
-
T. Hoefler, J. Dinan, D. Buntinas, P. Balaji, B. Barrett, R. Brightwell, W. Gropp, V. Kale, and R. Thakur. Leveraging MPI's one-sided communication interface for shared-memory programming. In Recent Advances in the Message Passing Interface (EuroMPI'12), volume LNCS 7490, pages 132-141. Springer, 2012.
-
(2012)
Recent Advances in the Message Passing Interface (EuroMPI'12), Volume LNCS 7490
, pp. 132-141
-
-
Hoefler, T.1
Dinan, J.2
Buntinas, D.3
Balaji, P.4
Barrett, B.5
Brightwell, R.6
Gropp, W.7
Kale, V.8
Thakur, R.9
-
14
-
-
78650818849
-
Characterizing the influence of system noise on large-scale applications by simulation
-
IEEE Computer Society
-
T. Hoefler, T. Schneider, and A. Lumsdaine. Characterizing the influence of system noise on large-scale applications by simulation. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'10), pages 1-11. IEEE Computer Society, 2010.
-
(2010)
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'10)
, pp. 1-11
-
-
Hoefler, T.1
Schneider, T.2
Lumsdaine, A.3
-
17
-
-
4544268140
-
High performance MPI-2 one-sided communication over InfiniBand
-
IEEE Computer Society
-
W. Jiang, J. Liu, H.-W. Jin, D. K. Panda, W. Gropp, and R. Thakur. High performance MPI-2 one-sided communication over InfiniBand. In Proceedings of the IEEE International Symposium on Cluster Computing and the Grid (CCGRID'04), pages 531-538. IEEE Computer Society, 2004.
-
(2004)
Proceedings of the IEEE International Symposium on Cluster Computing and the Grid (CCGRID'04)
, pp. 531-538
-
-
Jiang, W.1
Liu, J.2
Jin, H.-W.3
Panda, D.K.4
Gropp, W.5
Thakur, R.6
-
19
-
-
85031726860
-
Optimal broadcast and summation in the LogP model
-
ACM
-
R. M. Karp, A. Sahay, E. E. Santos, and K. E. Schauser. Optimal broadcast and summation in the LogP model. In Proceedings of the ACM Symposium on Parallel Algorithms and Architectures (SPAA'93), pages 142-153. ACM, 1993.
-
(1993)
Proceedings of the ACM Symposium on Parallel Algorithms and Architectures (SPAA'93)
, pp. 142-153
-
-
Karp, R.M.1
Sahay, A.2
Santos, E.E.3
Schauser, K.E.4
-
20
-
-
84866873171
-
PAMI: A parallel active message interface for the Blue Gene/Q supercomputer
-
IEEE Computer Society
-
S. Kumar, A. Mamidala, D. A. Faraj, B. Smith, M. Blocksome, B. Cernohous, D. Miller, J. Parker, J. Ratterman, P. Heidelberger, D. Chen, and B. D. Steinmacher-Burrow. PAMI: A parallel active message interface for the Blue Gene/Q supercomputer. In Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS'12), pages 763-773. IEEE Computer Society, 2012.
-
(2012)
Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS'12)
, pp. 763-773
-
-
Kumar, S.1
Mamidala, A.2
Faraj, D.A.3
Smith, B.4
Blocksome, M.5
Cernohous, B.6
Miller, D.7
Parker, J.8
Ratterman, J.9
Heidelberger, P.10
Chen, D.11
Steinmacher-Burrow, B.D.12
-
21
-
-
77950627571
-
Self-consistent MPI performance guidelines
-
J. Larsson Traff, W. D. Gropp, and R. Thakur. Self-consistent MPI performance guidelines. IEEE Transactions on Parallel and Distributed Systems, 21(5):698-709, 2010.
-
(2010)
IEEE Transactions on Parallel and Distributed Systems
, vol.21
, Issue.5
, pp. 698-709
-
-
Larsson Traff, J.1
Gropp, W.D.2
Thakur, R.3
-
22
-
-
77955144409
-
A new vision for coarray fortran
-
J. Mellor-Crummey, L. Adhianto, W. N. Scherer III, and G. Jin. A new vision for Coarray Fortran. In Proceedings of the Conference on Partitioned Global Address Space Programming Models (PGAS'09), pages 5:1-5:9. ACM, 2009.
-
(2009)
Proceedings of the Conference on Partitioned Global Address Space Programming Models (PGAS'09) ACM
, pp. 51-59
-
-
Mellor-Crummey, J.1
Adhianto, L.2
Scherer III, W.N.3
Jin, G.4
-
23
-
-
84976771728
-
Scalable reader-writer synchronization for shared-memory multiprocessors
-
J. M. Mellor-Crummey and M. L. Scott. Scalable reader-writer synchronization for shared-memory multiprocessors. SIGPLAN Notices, 26(7):106-113, 1991.
-
(1991)
SIGPLAN Notices
, vol.26
, Issue.7
, pp. 106-113
-
-
Mellor-Crummey, J.M.1
Scott, M.L.2
-
24
-
-
0026137159
-
Synchronization without contention
-
J. M. Mellor-Crummey and M. L. Scott. Synchronization without contention. SIGPLAN Notices, 26(4):269-278, 1991.
-
(1991)
SIGPLAN Notices
, vol.26
, Issue.4
, pp. 269-278
-
-
Mellor-Crummey, J.M.1
Scott, M.L.2
-
25
-
-
23844539932
-
A scalable implementation of a finite-volume dynamical core in the community atmosphere model
-
A. A. Mirin and W. B. Sawyer. A scalable implementation of a finite-volume dynamical core in the community atmosphere model. International Journal of High Performance Computing Applications, 19(3):203-212, 2005.
-
(2005)
International Journal of High Performance Computing Applications
, vol.19
, Issue.3
, pp. 203-212
-
-
Mirin, A.A.1
Sawyer, W.B.2
-
28
-
-
70449905663
-
Scaling communication-intensive applications on BlueGene/P using one-sided communication and overlap
-
IEEE Computer Society
-
R. Nishtala, P. H. Hargrove, D. O. Bonachea, and K. A. Yelick. Scaling communication-intensive applications on BlueGene/P using one-sided communication and overlap. In Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS'09), pages 1-12. IEEE Computer Society, 2009.
-
(2009)
Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS'09)
, pp. 1-12
-
-
Nishtala, R.1
Hargrove, P.H.2
Bonachea, D.O.3
Yelick, K.A.4
-
29
-
-
84899680132
-
-
OpenFabrics Alliance (OFA). OpenFabrics Enterprise Distribution (OFED)
-
OpenFabrics Alliance (OFA). OpenFabrics Enterprise Distribution (OFED) www. openfabrics. org.
-
-
-
-
30
-
-
84877019178
-
The case of the missing supercomputer performance: Achieving optimal performance on the 8,192 processors of ASCI Q
-
F. Petrini, D. J. Kerbyson, and S. Pakin. The case of the missing supercomputer performance: Achieving optimal performance on the 8,192 processors of ASCI Q. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'03). ACM, 2003.
-
(2003)
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'03). ACM
-
-
Petrini, F.1
Kerbyson, D.J.2
Pakin, S.3
-
31
-
-
77954729562
-
Quantifying performance benefits of overlap using MPI-2 in a seismic modeling application
-
ACM
-
S. Potluri, P. Lai, K. Tomko, S. Sur, Y. Cui, M. Tatineni, K. W. Schulz, W. L. Barth, A. Majumdar, and D. K. Panda. Quantifying performance benefits of overlap using MPI-2 in a seismic modeling application. In Proceedings of the ACM International Conference on Supercomputing (ICS'10), pages 17-25. ACM, 2010.
-
(2010)
Proceedings of the ACM International Conference on Supercomputing (ICS'10)
, pp. 17-25
-
-
Potluri, S.1
Lai, P.2
Tomko, K.3
Sur, S.4
Cui, Y.5
Tatineni, M.6
Schulz, K.W.7
Barth, W.L.8
Majumdar, A.9
Panda, D.K.10
-
32
-
-
70350441805
-
Processing MPI datatypes outside MPI
-
Springer
-
R. Ross, R. Latham, W. Gropp, E. Lusk, and R. Thakur. Processing MPI datatypes outside MPI. In Recent Advances in Parallel Virtual Machine and Message Passing Interface (EuroPVM/MPI'09), volume LNCS 5759, pages 42-53. Springer, 2009.
-
(2009)
Recent Advances in Parallel Virtual Machine and Message Passing Interface (EuroPVM/MPI'09), Volume LNCS 5759
, pp. 42-53
-
-
Ross, R.1
Latham, R.2
Gropp, W.3
Lusk, E.4
Thakur, R.5
-
33
-
-
70349740809
-
Natively supporting true one-sided communication in MPI on multi-core systems with InfiniBand
-
IEEE Computer Society
-
G. Santhanaraman, P. Balaji, K. Gopalakrishnan, R. Thakur, W. Gropp, and D. K. Panda. Natively supporting true one-sided communication in MPI on multi-core systems with InfiniBand. In Proceedings of the IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'09), pages 380-387. IEEE Computer Society, 2009.
-
(2009)
Proceedings of the IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'09)
, pp. 380-387
-
-
Santhanaraman, G.1
Balaji, P.2
Gopalakrishnan, K.3
Thakur, R.4
Gropp, W.5
Panda, D.K.6
-
34
-
-
84899672565
-
Accelerating applications at scale using one-sided communication
-
H. Shan, B. Austin, N. Wright, E. Strohmaier, J. Shalf, and K. Yelick. Accelerating applications at scale using one-sided communication. In Proceedings of the Conference on Partitioned Global Address Space Programming Models (PGAS'12), 2012.
-
(2012)
Proceedings of the Conference on Partitioned Global Address Space Programming Models (PGAS'12)
-
-
Shan, H.1
Austin, B.2
Wright, N.3
Strohmaier, E.4
Shalf, J.5
Yelick, K.6
-
35
-
-
84899690113
-
Infiniband architecture specification volume 1, release 1 2
-
Infiniband Trade Association T.
-
The InfiniBand Trade Association. Infiniband Architecture Specification Volume 1, Release 1. 2. InfiniBand Trade Association, 2004.
-
(2004)
InfiniBand Trade Association
-
-
-
37
-
-
79959600862
-
Active Pebbles: Parallel programming for data-driven applications
-
ACM
-
J. Willcock, T. Hoefler, N. Edmonds, and A. Lumsdaine. Active Pebbles: Parallel programming for data-driven applications. In Proceedings of the ACM International Conference on Supercomputing (ICS'11), pages 235-245. ACM, 2011.
-
(2011)
Proceedings of the ACM International Conference on Supercomputing (ICS'11)
, pp. 235-245
-
-
Willcock, J.1
Hoefler, T.2
Edmonds, N.3
Lumsdaine, A.4
-
39
-
-
33750234379
-
High performance RDMA protocols in HPC
-
Springer
-
T. S. Woodall, G. M. Shipman, G. Bosilca, and A. B. Maccabe. High performance RDMA protocols in HPC. In Recent Advances in Parallel Virtual Machine and Message Passing Interface (EuroPVM/MPI'06), volume LNCS 4192, pages 76-85. Springer, 2006.
-
(2006)
Recent Advances in Parallel Virtual Machine and Message Passing Interface (EuroPVM/MPI'06), Volume LNCS 4192
, pp. 76-85
-
-
Woodall, T.S.1
Shipman, G.M.2
Bosilca, G.3
Maccabe, A.B.4
-
40
-
-
83155193225
-
Optimizing the Barnes-Hut algorithm in UPC
-
11. ACM
-
J. Zhang, B. Behzad, and M. Snir. Optimizing the Barnes-Hut algorithm in UPC. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'11), pages 75:1-75:11. ACM, 2011.
-
(2011)
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'11)
, vol.75
, pp. 1-75
-
-
Zhang, J.1
Behzad, B.2
Snir, M.3
|