메뉴 건너뛰기




Volumn 33, Issue 9, 2007, Pages 624-633

Optimizing a conjugate gradient solver with non-blocking collective operations

Author keywords

Collective operations; Communication; Computation overlap; Message passing interface (MPI); Non blocking collective operations; Poisson solver

Indexed keywords

COMMUNICATION SYSTEMS; INTERFACES (COMPUTER); MESSAGE PASSING; OPTIMIZATION; PARALLEL PROCESSING SYSTEMS; POISSON DISTRIBUTION;

EID: 34548698431     PISSN: 01678191     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.parco.2007.06.006     Document Type: Article
Times cited : (49)

References (27)
  • 5
    • 34548702103 scopus 로고    scopus 로고
    • Edgar Gabriel, Graham E. Fagg, George Bosilca, Thara Angskun, Jack J. Dongarra, Jeffrey M. Squyres, Vishal Sahay, Prabhanjan Kambadur, Brian Barrett, Andrew Lumsdaine, Ralph H. Castain, David J. Daniel, Richard L. Graham, Timothy S. Woodall, Open MPI: goals, concept, and design of a next generation MPI implementation, in: Proceedings, 11th European PVM/MPI Users' Group Meeting, Budapest, Hungary, September 2004.
  • 6
    • 1242332596 scopus 로고    scopus 로고
    • Send-receive considered harmful: myths and realities of message passing
    • Gorlatch S. Send-receive considered harmful: myths and realities of message passing. ACM Trans. Program. Lang. Syst. 26 1 (2004) 47-56
    • (2004) ACM Trans. Program. Lang. Syst. , vol.26 , Issue.1 , pp. 47-56
    • Gorlatch, S.1
  • 7
    • 34548670158 scopus 로고    scopus 로고
    • Peter Gottschling, Wolfgang E. Nagel, An efficient parallel linear solver with a cascadic conjugate gradient method, in: EuroPar 2000, LNCS, 1900, 2000.
  • 9
    • 0000135303 scopus 로고
    • Methods of conjugate gradients for solving linear systems
    • Hestenes M.R., and Stiefel E. Methods of conjugate gradients for solving linear systems. J. Res. Natl. Bur. Stand. 49 (1952) 409-436
    • (1952) J. Res. Natl. Bur. Stand. , vol.49 , pp. 409-436
    • Hestenes, M.R.1    Stiefel, E.2
  • 10
    • 34548699764 scopus 로고    scopus 로고
    • T.Hoefler, T. Mehlan, F. Mietke, W, Rehm. Adding low-cost hardware barrier support to small commodity clusters, in: Proceedings of 19th International Conference on Architecture and Computing Systems - ARCS'06, vol. 3, 2006, pp. 343-250.
  • 11
    • 34548667779 scopus 로고    scopus 로고
    • T. Hoefler, J. Squyres, G. Bosilca, G. Fagg, A. Lumsdaine, W. Rehm. Non-Blocking Collective Operations for MPI-2. Technical report, Open Systems Lab, Indiana University, 08 2006.
  • 12
    • 84883859962 scopus 로고    scopus 로고
    • T. Hoefler, J. Squyres, W. Rehm, A. Lumsdaine, A case for non-blocking collective operations, in: Frontiers of High Performance Computing and Networking - ISPA 2006 Workshops, vol. 4331/2006, Springer, Berlin, Heidelberg, 2006, 12, pp. 155-164.
  • 13
    • 33847106529 scopus 로고    scopus 로고
    • Torsten Hoefler, Torsten Mehlan, Frank Mietke, Wolfgang Rehm, Fast barrier synchronization for InfiniBand, in: Proceedings, 20th International Parallel and Distributed Processing Symposium IPDPS 2006 (CAC 06), April 2006.
  • 14
    • 84947212732 scopus 로고    scopus 로고
    • L.V. Kale, Sameer Kumar, Krishnan Vardarajan, A framework for collective personalized communication, in: Proceedings of IPDPS'03, Nice, France, April 2003.
  • 15
    • 0031599954 scopus 로고    scopus 로고
    • Arkady Kanevsky, Anthony Skjellum, Anna Rounbehler, MPI/RT - an emerging standard for high-performance real-time systems, in: HICSS, vol. 3, 1998, pp. 157-166.
  • 16
    • 34548675031 scopus 로고    scopus 로고
    • G. Liu, T.S. Abdelrahman, Computation-communication overlap on network-of-workstation multiprocessors, in: Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, July 1998, pp. 1635-1642.
  • 17
    • 34548661593 scopus 로고    scopus 로고
    • J. Liu, A. Mamidala, D. Panda, Fast and Scalable MPI-Level Broadcast using InfiniBand's Hardware Multicast Support, Technical report, OSU-CISRC-10/03-TR57, 2003.
  • 18
    • 34548665142 scopus 로고    scopus 로고
    • Message Passing Interface Forum. MPI: A Message Passing Interface Standard. 1995.
  • 19
    • 34548677380 scopus 로고    scopus 로고
    • Message Passing Interface Forum. MPI-2: Extensions to the Message-Passing Interface. Technical Report, University of Tennessee, Knoxville, 1997.
  • 20
    • 34548702308 scopus 로고    scopus 로고
    • Message Passing Interface Forum. MPI-2 Journal of Development, July (1997).
  • 21
    • 34548703932 scopus 로고    scopus 로고
    • MPICH2 Developers. , 2006.
  • 22
    • 84877019178 scopus 로고    scopus 로고
    • Fabrizio Petrini, Darren J. Kerbyson, Scott Pakin, The case of the missing supercomputer performance: achieving optimal performance on the 8, 192 processors of ASCI Q, in: Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 15-21 November 2003, ACM, Phoenix, AZ, USA, CD-Rom, 2003, pp. 55.
  • 23
    • 0000048673 scopus 로고
    • GMRES: a generalized minimum residual algorithm for solving nonsymmetric linear systems
    • Saad Y., and Schultz M.H. GMRES: a generalized minimum residual algorithm for solving nonsymmetric linear systems. SIAM J. Sci. Statist. Comput. 7 3 (1986) 856-869
    • (1986) SIAM J. Sci. Statist. Comput. , vol.7 , Issue.3 , pp. 856-869
    • Saad, Y.1    Schultz, M.H.2
  • 24
    • 0002716979 scopus 로고
    • CGS, a fast Lanczos-type solver for nonsymmetric linear systems
    • Sonnefeld P. CGS, a fast Lanczos-type solver for nonsymmetric linear systems. SIAM J. Sci. Statist. Comput. 10 (1989) 36-52
    • (1989) SIAM J. Sci. Statist. Comput. , vol.10 , pp. 36-52
    • Sonnefeld, P.1
  • 25
    • 33845432364 scopus 로고    scopus 로고
    • Vinod Tipparaju, Jarek Nieplocha. Optimizing all-to-all collective communication by exploiting concurrency in modern networks, in: SC '05: Proceedings of the 2005 ACM/IEEE Conference on Supercomputing, IEEE Computer Society, Washington, DC, USA, 2005, pp. 46.
  • 27
    • 0000005482 scopus 로고
    • Bi-CGSTAB: a fast and smoothly converging variant of Bi-CG for the solution of nonsymmetric linear systems
    • van der Vorst H. Bi-CGSTAB: a fast and smoothly converging variant of Bi-CG for the solution of nonsymmetric linear systems. SIAM J. Sci. Statist. Comput. 13 (1992) 631-644
    • (1992) SIAM J. Sci. Statist. Comput. , vol.13 , pp. 631-644
    • van der Vorst, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.