메뉴 건너뛰기




Volumn , Issue , 2008, Pages 99-109

Performance without pain = Productivity data layout and collective communication in UPC

Author keywords

Blue gene; Collective communication; Parallel programming; PGAS; Programming productivity; UPC

Indexed keywords

BLUE GENE; CHOLESKY FACTORIZATIONS; COLLECTIVE COMMUNICATIONS; DENSE MATRICES; MACHINE RESOURCES; MULTIDIMENSIONAL FOURIER TRANSFORM; PARALLEL LANGUAGES; PARTITIONED GLOBAL ADDRESS SPACE; PGAS; PRODUCTIVITY DATA; RUNTIME SYSTEMS; UNIFIED PARALLEL C; UPC; UPC CODE;

EID: 70350625706     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (21)

References (48)
  • 1
    • 7444229864 scopus 로고    scopus 로고
    • The cascade high productivity language
    • The cascade high productivity language, hips, 00: 52-60, 2004.
    • (2004) Hips , vol.0 , pp. 52-60
  • 11
    • 79959386767 scopus 로고    scopus 로고
    • The Berkeley UPC Compiler
    • The Berkeley UPC Compiler, 2002. http : //upc.1b1.gov.
    • (2002)
  • 12
    • 79959485086 scopus 로고    scopus 로고
    • BLAS Home Page
    • BLAS Home Page, http://www.netlib.org/blas/.
  • 23
    • 54249097779 scopus 로고    scopus 로고
    • ESSL User Guide. http://www-03.ibm.com/systems/p/software/essl.html.
    • ESSL User Guide.
  • 24
    • 27144559253 scopus 로고    scopus 로고
    • ScaLAPACK: A linear algebra library for messagepassing computers
    • Minneapolis, MN, (electronic), Philadelphia, PA, USA, 1997. Society for Industrial and Applied Mathematics
    • L. S. B. et al. ScaLAPACK: a linear algebra library for messagepassing computers. In Proceedings of the Eighth SIAM Conference on Parallel Processing for Scientific Computing (Minneapolis, MN, 1997), page 15 (electronic), Philadelphia, PA, USA, 1997. Society for Industrial and Applied Mathematics.
    • (1997) Proceedings of the Eighth SIAM Conference on Parallel Processing for Scientific Computing , pp. 15
    • Sanoj, L.S.1
  • 25
    • 20744449792 scopus 로고    scopus 로고
    • The design and implementation of FFTW3
    • DOI 10.1109/JPROC.2004.840301, Program Generation, Optimization and Platform Adaptation
    • M. Frigo and S. G. Johnson. The design and implementation of FFTW3. Proceedings of the IEEE, 93(2): 216-231, 2005. special issue on "Program Generation, Optimization, and Platform Adaptation". (Pubitemid 40851223)
    • (2005) Proceedings of the IEEE , vol.93 , Issue.2 , pp. 216-231
    • Frigo, M.1    Johnson, S.G.2
  • 31
    • 0004235292 scopus 로고    scopus 로고
    • T. MathWorks
    • T. MathWorks. Using matlab, 1997.
    • (1997) Using Matlab
  • 32
    • 79959416586 scopus 로고    scopus 로고
    • Message Passing Interface
    • Message Passing Interface. http://www.mpiforum.org/docs/docs.html.
  • 34
    • 0002081678 scopus 로고    scopus 로고
    • Co-array fortran for parallel programming
    • R. W. Numrich and J. Reid. Co-array fortran for parallel programming. ACMFortran Forum, 17(2): 1 -31, 1998.
    • (1998) ACMFortran Forum , vol.17 , Issue.2 , pp. 1-31
    • Numrich, R.W.1    Reid, J.2
  • 35
    • 0002081678 scopus 로고    scopus 로고
    • Co-array fortran for parallel programming
    • R. W. Numrich and J. Reid. Co-array fortran for parallel programming. SIGPLAN Fortran Forum, 17(2): 1-31, 1998.
    • (1998) SIGPLAN Fortran Forum , vol.17 , Issue.2 , pp. 1-31
    • Numrich, R.W.1    Reid, J.2
  • 38
    • 33847138695 scopus 로고    scopus 로고
    • Efficient rdma-based multi-port collectives on multi-rail qsnetii clusters
    • Proceedin gs of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006)
    • Y. Qian and A. Afsahi. Efficient rdma-based multi-port collectives on multi-rail qsnetii clusters. In The 6th Workshop on Communication Architecture for Clusters (CAC 2006), In Proceedin gs of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006.
    • (2006) The 6th Workshop on Communication Architecture for Clusters (CAC 2006)
    • Qian, Y.1    Afsahi, A.2
  • 39
    • 79959485085 scopus 로고    scopus 로고
    • A specification of the extensions to the collective operations of unified parallel c
    • Michigan Technological University, Department of Computer Science
    • Z. Ryne and S. Seidel. A specification of the extensions to the collective operations of unified parallel c. Technical Report Technical Report 05-08, Michigan Technological University, Department of Computer Science, 2005.
    • (2005) Technical Report Technical Report 05-08
    • Ryne, Z.1    Seidel, S.2
  • 42
    • 4344655318 scopus 로고    scopus 로고
    • Performance modeling for self adapting collective communications for mpi
    • S. S. Vadhiyar, G. E. Fagg, and J. J. Dongarra. Performance modeling for self adapting collective communications for mpi. In LACSI Symposium, 2001.
    • (2001) LACSI Symposium
    • Vadhiyar, S.S.1    Fagg, G.E.2    Dongarra, J.J.3
  • 45
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimizations of software and the ATLAS project
    • DOI 10.1016/S0167-8191(00)00087-9
    • R. C. Whaley, A. Petitet, and J. J. Dongarra. Automated empirical optimization of software and the ATLAS project. Parallel Computing, 27(1-2): 3-35, 2001. (Pubitemid 32264775)
    • (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
    • Clint, W.R.1    Petitet, A.2    Dongarra, J.J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.