-
1
-
-
20444508120
-
Application-bypass reduction for large-scale clusters
-
IEEE Computer Society
-
Wagner, A., Buntinas, D., Panda, D.K., Brightwell, R.: Application-bypass reduction for large-scale clusters. In: 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003), IEEE Computer Society (2003) 404-411
-
(2003)
2003 IEEE International Conference on Cluster Computing (CLUSTER 2003)
, pp. 404-411
-
-
Wagner, A.1
Buntinas, D.2
Panda, D.K.3
Brightwell, R.4
-
2
-
-
32844473627
-
Improving application performance on hpc systems with process synchronization
-
Terry, P., Shan, A., Huttunen, P.: Improving application performance on hpc systems with process synchronization. Linux J. 2004(127) (2004) 3
-
(2004)
Linux J
, vol.2004
, Issue.127
, pp. 3
-
-
Terry, P.1
Shan, A.2
Huttunen, P.3
-
3
-
-
33745195144
-
Hunting the overlap
-
IEEE Computer Society
-
Iancu, C., Husbands, P., Hargrove, P.: Hunting the overlap. In: PACT '05: Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05), Washington, DC, USA, IEEE Computer Society (2005) 279-290
-
(2005)
PACT '05: Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05), Washington, DC, USA
, pp. 279-290
-
-
Iancu, C.1
Husbands, P.2
Hargrove, P.3
-
5
-
-
84948981514
-
Comb: A portable benchmark suite for assessing mpi overlap
-
IEEE Computer Society
-
Lawry, W., Wilson, C., Maccabe, A.B., Brightwell, R.: Comb: A portable benchmark suite for assessing mpi overlap. In: CLUSTER, IEEE Computer Society (2002) 472-475
-
(2002)
CLUSTER
, pp. 472-475
-
-
Lawry, W.1
Wilson, C.2
Maccabe, A.B.3
Brightwell, R.4
-
7
-
-
8344267505
-
An analysis of the impact of mpi overlap and independent progress
-
ACM Press
-
Brightwell, R., Underwood, K.D.: An analysis of the impact of mpi overlap and independent progress. In: ICS '04: Proceedings of the 18th annual international conference on Supercomputing, New York, NY, USA, ACM Press (2004) 298-305
-
(2004)
ICS '04: Proceedings of the 18th Annual International Conference on Supercomputing, New York, NY, USA
, pp. 298-305
-
-
Brightwell, R.1
Underwood, K.D.2
-
9
-
-
0032659850
-
Tiling on systems with communication/- computation overlap
-
Calland, P.Y., Dongarra, J., Robert, Y.: Tiling on systems with communication/- computation overlap. Concurrency - Practice and Experience 11(3) (1999) 139-153
-
(1999)
Concurrency - Practice and Experience
, vol.11
, Issue.3
, pp. 139-153
-
-
Calland, P.Y.1
Dongarra, J.2
Robert, Y.3
-
10
-
-
23044530213
-
Optimizing metacomputing with communication-computation overlap
-
Baude, F., Caromel, D., Furmento, N., Sagnol, D.: Optimizing metacomputing with communication-computation overlap. In: PaCT '01: Proceedings of the 6th International Conference on Parallel Computing Technologies, London, UK, Springer-Verlag (2001) 190-204
-
PaCT '01: Proceedings of the 6th International Conference on Parallel Computing Technologies, London, UK, Springer-Verlag (2001)
, pp. 190-204
-
-
Baude, F.1
Caromel, D.2
Furmento, N.3
Sagnol, D.4
-
11
-
-
33845393854
-
Transformations to parallel codes for communication-computation overlap
-
IEEE Computer Society
-
Danalis, A., Kim, K.Y., Pollock, L., Swany, M.: Transformations to parallel codes for communication-computation overlap. In: SC '05: Proceedings of the 2005 ACM/IEEE conference on Supercomputing, Washington, DC, USA, IEEE Computer Society (2005) 58
-
(2005)
SC '05: Proceedings of the 2005 ACM/IEEE Conference on Supercomputing, Washington, DC, USA
, pp. 58
-
-
Danalis, A.1
Kim, K.Y.2
Pollock, L.3
Swany, M.4
-
13
-
-
0035004646
-
Redistribution strategies for portable parallel FFT: A case study
-
DOI 10.1002/cpe.564
-
Dubey, A., Tessera, D.: Redistribution strategies for portable parallel FFT: a case study. Concurrency and Computation: Practice and Experience 13(3) (2001) 209-220 (Pubitemid 32432110)
-
(2001)
Concurrency Computation Practice and Experience
, vol.13
, Issue.3
, pp. 209-220
-
-
Dubey, A.1
Tessera, D.2
-
14
-
-
19744376706
-
Analyzing the impact of overlap, offload, and independent progress for message passing interface applications
-
DOI 10.1177/1094342005054257
-
Brightwell, R., Riesen, R., Underwood, K.D.: Analyzing the impact of overlap, offload, and independent progress for message passing interface applications. Int. J. High Perform. Comput. Appl. 19(2) (2005) 103-117 (Pubitemid 40743298)
-
(2005)
International Journal of High Performance Computing Applications
, vol.19
, Issue.2 SPEC. ISS.
, pp. 103-117
-
-
Brightwell, R.1
Riesen, R.2
Underwood, K.D.3
-
15
-
-
84947212732
-
A Framework for Collective Personalized Communication
-
Kale, L.V., Kumar, S., Vardarajan, K.: A Framework for Collective Personalized Communication. In: Proceedings of IPDPS'03, Nice, France (2003)
-
Proceedings of IPDPS'03, Nice, France (2003)
-
-
Kale, L.V.1
Kumar, S.2
Vardarajan, K.3
-
16
-
-
84877019178
-
The case of the missing supercomputer performance: Achieving optimal performance on the 8, 192 processors of asci q
-
CD-Rom, ACM
-
Petrini, F., Kerbyson, D.J., Pakin, S.: The case of the missing supercomputer performance: Achieving optimal performance on the 8, 192 processors of asci q. In: Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 15-21 November 2003, Phoenix, AZ, USA, CD-Rom, ACM (2003) 55
-
(2003)
Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 15-21 November 2003, Phoenix, AZ, USA
, pp. 55
-
-
Petrini, F.1
Kerbyson, D.J.2
Pakin, S.3
-
17
-
-
79960011716
-
The impact of noise on the scaling of collectives: A theoretical approach
-
Agarwal, S., Garg, R., Vishnoi, N.: The impact of noise on the scaling of collectives: A theoretical approach. In: 12th Annual IEEE International Conference on High Performance Computing, Goa, India (2005)
-
12th Annual IEEE International Conference on High Performance Computing, Goa, India (2005)
-
-
Agarwal, S.1
Garg, R.2
Vishnoi, N.3
-
18
-
-
84877078118
-
Improving the scalability of parallel jobs by adding parallel awareness to the operating system
-
Jones, T., Dawson, S., Neely, R., Jr., W.G.T., Brenner, L., Fier, J., Blackmore, R., Caffrey, P., Maskell, B., Tomlinson, P., Roberts, M.: Improving the scalability of parallel jobs by adding parallel awareness to the operating system. In: Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing. (2003) 10
-
Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing. (2003)
, pp. 10
-
-
Jones, T.1
Dawson, S.2
Neely Jr., R.3
T, W.G.4
Brenner, L.5
Fier, J.6
Blackmore, R.7
Caffrey, P.8
Maskell, B.9
Tomlinson, P.10
Roberts, M.11
-
19
-
-
1242332596
-
Send-receive considered harmful: Myths and realities of message passing
-
Gorlatch, S.: Send-receive considered harmful: Myths and realities of message passing. ACM Trans. Program. Lang. Syst. 26(1) (2004) 47-56
-
(2004)
ACM Trans. Program. Lang. Syst.
, vol.26
, Issue.1
, pp. 47-56
-
-
Gorlatch, S.1
-
20
-
-
33746274942
-
Performance Analysis of MPI Collective Operations
-
Pjesivac-Grbovic, J., Angskun, T., Bosilca, G., Fagg, G.E., Gabriel, E., Dongarra, J.J.: Performance Analysis of MPI Collective Operations. In: Proceedings of the 19th International Parallel and Distributed Processing Symposium, 4th International Workshop on Performance Modeling, Evaluation, and Optimization of Parallel and Distributed Systems (PMEO-PDS 05), Denver, CO (2005)
-
Proceedings of the 19th International Parallel and Distributed Processing Symposium, 4th International Workshop on Performance Modeling, Evaluation, and Optimization of Parallel and Distributed Systems (PMEO-PDS 05), Denver, CO (2005)
-
-
Pjesivac-Grbovic, J.1
Angskun, T.2
Bosilca, G.3
Fagg, G.E.4
Gabriel, E.5
Dongarra, J.J.6
-
21
-
-
33745217824
-
A practical Approach to the Rating of Barrier Algorithms using the LogP Model and Open MPI
-
Hoefler, T., Cerquetti, L., Mehlan, T., Mietke, F., Rehm, W.: A practical Approach to the Rating of Barrier Algorithms using the LogP Model and Open MPI. In: Proceedings of the 2005 International Conference on Parallel Processing Workshops (ICPP'05). (2005) 562-569
-
Proceedings of the 2005 International Conference on Parallel Processing Workshops (ICPP'05). (2005)
, pp. 562-569
-
-
Hoefler, T.1
Cerquetti, L.2
Mehlan, T.3
Mietke, F.4
Rehm, W.5
-
22
-
-
0009346826
-
LogP: Towards a realistic model of parallel computation
-
Culler, D., Karp, R., Patterson, D., Sahay, A., Schauser, K.E., Santos, E., Subramonian, R., von Eicken, T.: LogP: towards a realistic model of parallel computation. In: Principles Practice of Parallel Programming. (1993) 1-12
-
(1993)
Principles Practice of Parallel Programming
, pp. 1-12
-
-
Culler, D.1
Karp, R.2
Patterson, D.3
Sahay, A.4
Schauser, K.E.5
Santos, E.6
Subramonian, R.7
Von Eicken, T.8
-
23
-
-
0008669884
-
Incorporating Long Messages into the LogP Model
-
Alexandrov, A., Ionescu, M.F., Schauser, K.E., Scheiman, C.: LogGP: Incorporating Long Messages into the LogP Model. Journal of Parallel and Distributed Computing 44(1) (1995) 71-79
-
(1995)
Journal of Parallel and Distributed Computing
, vol.44
, Issue.1
, pp. 71-79
-
-
Alexandrov, A.1
Ionescu, M.F.2
Schauser, K.E.3
Scheiman, C.4
Log, G.P.5
-
24
-
-
84966663968
-
Communication characteristics of large-scale scientific applications for contemporary cluster architectures
-
IEEE Computer Society
-
Vetter, J.S., Mueller, F.: Communication characteristics of large-scale scientific applications for contemporary cluster architectures. In: IPDPS '02: Proceedings of the 16th International Parallel and Distributed Processing Symposium, Washington, DC, USA, IEEE Computer Society (2002) 96
-
(2002)
IPDPS '02: Proceedings of the 16th International Parallel and Distributed Processing Symposium, Washington, DC, USA
, pp. 96
-
-
Vetter, J.S.1
Mueller, F.2
-
25
-
-
50649103668
-
Implications of application usage characteristics for collective communication offload
-
Brightwell, R., Goudy, S., Rodrigues, A., Underwood, K.: Implications of application usage characteristics for collective communication offload. Internation Journal of High-Performance Computing and Networking 4(2) (2006)
-
(2006)
Internation Journal of High-Performance Computing and Networking
, vol.4
, Issue.2
-
-
Brightwell, R.1
Goudy, S.2
Rodrigues, A.3
Underwood, K.4
-
27
-
-
84883886447
-
Low overhead ethernet communication for open mpi on linux clusters
-
Submitted to
-
Hoefler, T., Reinhardt, M., Mehlan, T., Mietke, F., Rehm, W.: Low overhead ethernet communication for open mpi on linux clusters. In: Submitted to EuroPVM'06. (2006)
-
EuroPVM'06. (2006)
-
-
Hoefler, T.1
Reinhardt, M.2
Mehlan, T.3
Mietke, F.4
Rehm, W.5
-
28
-
-
84944408245
-
Emp: Zero-copy os-bypass nic-driven gigabit ethernet message passing
-
ACM Press
-
Shivam, P., Wyckoff, P., Panda, D.: Emp: zero-copy os-bypass nic-driven gigabit ethernet message passing. In: Supercomputing '01: Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM), New York, NY, USA, ACM Press (2001) 57-57
-
(2001)
Supercomputing '01: Proceedings of the 2001 ACM/IEEE Conference on Supercomputing (CDROM), New York, NY, USA
, pp. 57-57
-
-
Shivam, P.1
Wyckoff, P.2
Panda, D.3
-
29
-
-
84883861192
-
Optimizing a Conjugate Gradient Solver with Non-Blocking Collective Operations
-
Accepted for publication at the
-
Hoefler, T., Gottschling, P., Rehm, W., Lumsdaine, A.: Optimizing a Conjugate Gradient Solver with Non-Blocking Collective Operations. (2006) Accepted for publication at the ParSim 2006 Workshop.
-
(2006)
ParSim 2006 Workshop
-
-
Hoefler, T.1
Gottschling, P.2
Rehm, W.3
Lumsdaine, A.4
-
30
-
-
84883855072
-
-
LibNBC
-
LibNBC: http://www.unixer.de/NBC (2006)
-
(2006)
-
-
-
31
-
-
51049107155
-
-
Technical report, Open Systems Lab, Indiana University
-
Hoefler, T., Lumsdaine, A.: Design, Implementation, and Usage of LibNBC. Technical report, Open Systems Lab, Indiana University (2006)
-
(2006)
Design, Implementation, and Usage of LibNBC
-
-
Hoefler, T.1
Lumsdaine, A.2
-
32
-
-
33750226072
-
Adding Low-Cost Hardware Barrier Support to Small Commodity Clusters
-
Hoefler, T., Mehlan, T., Mietke, F., Rehm, W.: Adding Low-Cost Hardware Barrier Support to Small Commodity Clusters. In: 19th International Conference on Architecture and Computing Systems - ARCS'06. (2006) 343-350
-
19th International Conference on Architecture and Computing Systems - ARCS'06. (2006)
, pp. 343-350
-
-
Hoefler, T.1
Mehlan, T.2
Mietke, F.3
Rehm, W.4
-
33
-
-
33745201924
-
The Component Architecture of Open MPI: Enabling Third-Party Collective Algorithms
-
Squyres, J.M., Lumsdaine, A.: The Component Architecture of Open MPI: Enabling Third-Party Collective Algorithms. In: Proceedings, 18th ACM International Conference on Supercomputing, Workshop on Component Models and Systems for Grid Applications, St. Malo, France (2004)
-
Proceedings, 18th ACM International Conference on Supercomputing, Workshop on Component Models and Systems for Grid Applications, St. Malo, France (2004)
-
-
Squyres, J.M.1
Lumsdaine, A.2
-
34
-
-
33750253089
-
-
Hoefler, T., Squyres, J., Bosilca, G., Fagg, G., Lumsdaine, A., Rehm, W.: Non-Blocking Collective Operations for MPI-2. (2006)
-
(2006)
Non-Blocking Collective Operations for MPI-2
-
-
Hoefler, T.1
Squyres, J.2
Bosilca, G.3
Fagg, G.4
Lumsdaine, A.5
Rehm, W.6
|