메뉴 건너뛰기




Volumn , Issue , 2012, Pages

Optimization principles for collective neighborhood communications

Author keywords

[No Author keywords available]

Indexed keywords

COLLECTIVE COMMUNICATIONS; COLLECTIVE OPERATIONS; NEIGHBORHOOD COMMUNICATION; OPTIMIZATION HEURISTICS; OPTIMIZATION PRINCIPLE; OPTIMIZED IMPLEMENTATION; PERFORMANCE IMPROVEMENTS; SCIENTIFIC APPLICATIONS;

EID: 84877693951     PISSN: 21674329     EISSN: 21674337     Source Type: Conference Proceeding    
DOI: 10.1109/SC.2012.86     Document Type: Conference Paper
Times cited : (38)

References (39)
  • 1
    • 0025467711 scopus 로고
    • A bridging model for parallel computation
    • L. G. Valiant, "A bridging model for parallel computation," Commun. ACM, vol. 33, no. 8, pp. 103-111, 1990.
    • (1990) Commun. ACM , vol.33 , Issue.8 , pp. 103-111
    • Valiant, L.G.1
  • 2
    • 35248859849 scopus 로고    scopus 로고
    • Improving the performance of collective operations in mpich
    • Recent Advances in Parallel Virtual Machine and Message Passing Interface. Springer Verlag 257267 10th European PVM/MPI Users Group Meeting, Springer Verlag, 2003
    • R. Thakur, "Improving the performance of collective operations in mpich," in Recent Advances in Parallel Virtual Machine and Message Passing Interface. Number 2840 in LNCS, Springer Verlag (2003) 257267 10th European PVM/MPI Users Group Meeting, pp. 257-267, Springer Verlag, 2003.
    • (2003) LNCS , Issue.2840 , pp. 257-267
    • Thakur, R.1
  • 3
    • 1242332596 scopus 로고    scopus 로고
    • Send-receive considered harmful: Myths and realities of message passing
    • Jan.
    • S. Gorlatch, "Send-receive considered harmful: Myths and realities of message passing," ACM Trans. Program. Lang. Syst., vol. 26, pp. 47-56, Jan. 2004.
    • (2004) ACM Trans. Program. Lang. Syst. , vol.26 , pp. 47-56
    • Gorlatch, S.1
  • 7
    • 56449130431 scopus 로고    scopus 로고
    • Sparse Non-Blocking Collectives in Quantum Mechanical Calculations
    • Recent Advances in Parallel Virtual Machine and Message Passing Interface, 15th European PVM/MPI Users' Group Meeting, Springer, Sep.
    • T. Hoefler, F. Lorenzen, and A. Lumsdaine, "Sparse Non-Blocking Collectives in Quantum Mechanical Calculations," in Recent Advances in Parallel Virtual Machine and Message Passing Interface, 15th European PVM/MPI Users' Group Meeting, vol. LNCS 5205, pp. 55-63, Springer, Sep. 2008.
    • (2008) LNCS , vol.5205 , pp. 55-63
    • Hoefler, T.1    Lorenzen, F.2    Lumsdaine, A.3
  • 8
    • 34548691020 scopus 로고    scopus 로고
    • MPI collective algorithm selection and quadtree encoding
    • DOI 10.1016/j.parco.2007.06.005, PII S0167819107000804
    • J. Pješivac-Grbović, G. Bosilca, G. E. Fagg, T. Angskun, and J. J. Dongarra, "Mpi collective algorithm selection and quadtree encoding," Parallel Comput., vol. 33, pp. 613-623, Sept. 2007. (Pubitemid 47418299)
    • (2007) Parallel Computing , vol.33 , Issue.9 , pp. 613-623
    • Pjesivac-Grbovic, J.1    Bosilca, G.2    Fagg, G.E.3    Angskun, T.4    Dongarra, J.J.5
  • 9
    • 0000729827 scopus 로고
    • Designing broadcasting algorithms in the postal model for message-passing systems
    • A. Bar-Noy and S. Kipnis, "Designing broadcasting algorithms in the postal model for message-passing systems," Math. Syst. Theory, vol. 27, no. 5, pp. 431-452, 1994.
    • (1994) Math. Syst. Theory , vol.27 , Issue.5 , pp. 431-452
    • Bar-Noy, A.1    Kipnis, S.2
  • 11
    • 71549164097 scopus 로고    scopus 로고
    • Two-tree algorithms for full bandwidth broadcast, reduction and scan
    • December
    • P. Sanders, J. Speck, and J. L. Träff, "Two-tree algorithms for full bandwidth broadcast, reduction and scan," Parallel Comput., vol. 35, pp. 581-594, December 2009.
    • (2009) Parallel Comput. , vol.35 , pp. 581-594
    • Sanders, P.1    Speck, J.2    Träff, J.L.3
  • 12
    • 33750234379 scopus 로고    scopus 로고
    • High performance RDMA protocols in HPC
    • Proceedings, 13th European PVM/MPI Users' Group Meeting, (Bonn, Germany), Springer-Verlag, September
    • "High performance RDMA protocols in HPC," in Proceedings, 13th European PVM/MPI Users' Group Meeting, Lecture Notes in Computer Science, (Bonn, Germany), Springer-Verlag, September 2006.
    • (2006) Lecture Notes in Computer Science
  • 13
    • 0002076006 scopus 로고
    • An upper bound for the chromatic number of a graph and its application to timetabling problems
    • D. J. A. Welsh and M. B. Powell, "An upper bound for the chromatic number of a graph and its application to timetabling problems," The Computer Journal, vol. 10, no. 1, pp. 85-86, 1967.
    • (1967) The Computer Journal , vol.10 , Issue.1 , pp. 85-86
    • Welsh, D.J.A.1    Powell, M.B.2
  • 23
    • 39749134275 scopus 로고    scopus 로고
    • A time-split nonhydrostatic atmospheric model for weather research and forecasting applications
    • Mar.
    • W. C. Skamarock and J. B. Klemp, "A time-split nonhydrostatic atmospheric model for weather research and forecasting applications," J. Comput. Phys., vol. 227, pp. 3465-3485, Mar. 2008.
    • (2008) J. Comput. Phys. , vol.227 , pp. 3465-3485
    • Skamarock, W.C.1    Klemp, J.B.2
  • 24
    • 0000331979 scopus 로고    scopus 로고
    • Lattice boltzmann method for 3-d flows with curved boundary
    • July
    • R. Mei, W. Shyy, D. Yu, and L.-S. Luo, "Lattice boltzmann method for 3-d flows with curved boundary," J. Comput. Phys., vol. 161, pp. 680-699, July 2000.
    • (2000) J. Comput. Phys. , vol.161 , pp. 680-699
    • Mei, R.1    Shyy, W.2    Yu, D.3    Luo, L.-S.4
  • 26
    • 0013269731 scopus 로고
    • University of Florida Sparse Matrix Collection
    • T. A. Davis, "University of Florida Sparse Matrix Collection," NA Digest, vol. 92, 1994.
    • (1994) NA Digest , vol.92
    • Davis, T.A.1
  • 27
    • 0036505103 scopus 로고    scopus 로고
    • Parallel static and dynamic multi-constraint graph partitioning
    • DOI 10.1002/cpe.605
    • K. Schloegel, G. Karypis, and V. Kumar, "Parallel static and dynamic multi-constraint graph partitioning," Concurrency and Computation: Practice and Experience, vol. 14, no. 3, pp. 219-240, 2002. (Pubitemid 34460007)
    • (2002) Concurrency Computation Practice and Experience , vol.14 , Issue.3 , pp. 219-240
    • Schloegel, K.1    Karypis, G.2    Kumar, V.3
  • 28
    • 0037249228 scopus 로고    scopus 로고
    • Parallel algebraic multigrid methods on distributed memory computers
    • Feb.
    • G. Haase, M. Kuhn, and S. Reitzinger, "Parallel algebraic multigrid methods on distributed memory computers," SIAM J. Sci. Comput., vol. 24, pp. 410-427, Feb. 2002.
    • (2002) SIAM J. Sci. Comput. , vol.24 , pp. 410-427
    • Haase, G.1    Kuhn, M.2    Reitzinger, S.3
  • 29
    • 84883516917 scopus 로고
    • Efficient algorithms for all-to-all communications in multi-port message-passing systems
    • J. Bruck, C. T. Ho, S. Kipnis, and D. Weathersby, "Efficient algorithms for all-to-all communications in multi-port message-passing systems," in 6th ACM Symp. on Par. Alg. and Arch., pp. 298-309, 1994.
    • (1994) 6th ACM Symp. on Par. Alg. and Arch. , pp. 298-309
    • Bruck, J.1    Ho, C.T.2    Kipnis, S.3    Weathersby, D.4
  • 30
    • 0242308158 scopus 로고    scopus 로고
    • Communication characteristics of large-scale scientific applications for contemporary cluster architectures
    • DOI 10.1016/S0743-7315(03)00104-7
    • J. S. Vetter and F. Mueller, "Communication characteristics of large-scale scientific applications for contemporary cluster architectures," J. Parallel Distrib. Comput., vol. 63, pp. 853-865, Sept. 2003. (Pubitemid 37364491)
    • (2003) Journal of Parallel and Distributed Computing , vol.63 , Issue.9 , pp. 853-865
    • Vetter, J.S.1    Mueller, F.2
  • 31
    • 75449107210 scopus 로고    scopus 로고
    • Communication requirements and interconnect optimization for high-end scientific applications
    • S. Kamil, L. Oliker, A. Pinar, and J. Shalf, "Communication requirements and interconnect optimization for high-end scientific applications," IEEE Trans. Parallel Distrib. Syst., vol. 21, no. 2, pp. 188-202, 2010.
    • (2010) IEEE Trans. Parallel Distrib. Syst. , vol.21 , Issue.2 , pp. 188-202
    • Kamil, S.1    Oliker, L.2    Pinar, A.3    Shalf, J.4
  • 36
    • 84871158565 scopus 로고    scopus 로고
    • Towards performance portability through runtime adaptation for high-performance computing applications
    • Nov.
    • E. Gabriel, S. Feki, K. Benkert, and M. M. Resch, "Towards performance portability through runtime adaptation for high-performance computing applications," Concurr. Comput. : Pract. Exper., vol. 22, pp. 2230-2246, Nov. 2010.
    • (2010) Concurr. Comput.: Pract. Exper. , vol.22 , pp. 2230-2246
    • Gabriel, E.1    Feki, S.2    Benkert, K.3    Resch, M.M.4
  • 38
    • 0001483604 scopus 로고
    • Communication optimizations for irregular scientific computations on distributed memory architectures
    • Sept.
    • R. Das, M. Uysal, J. Saltz, and Y.-S. Hwang, "Communication optimizations for irregular scientific computations on distributed memory architectures," J. Parallel Distrib. Comput., vol. 22, pp. 462-478, Sept. 1994.
    • (1994) J. Parallel Distrib. Comput. , vol.22 , pp. 462-478
    • Das, R.1    Uysal, M.2    Saltz, J.3    Hwang, Y.-S.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.