메뉴 건너뛰기




Volumn 2006, Issue , 2006, Pages

Optimizing bandwidth limited problems using one-sided communication and overlap

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; BANDWIDTH; COMPUTER PROGRAMMING LANGUAGES; FREQUENCY ALLOCATION; OPTIMIZATION; SEMANTICS;

EID: 33847103649     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2006.1639320     Document Type: Conference Paper
Times cited : (97)

References (38)
  • 1
    • 0028757636 scopus 로고
    • A high performance parallel algorithm for 1-d fft
    • R. C. Agarwal, F. G. Gustavson, and M. Zubair. A high performance parallel algorithm for 1-d fft. In SC, pages 34-40, 1994.
    • (1994) SC , pp. 34-40
    • Agarwal, R.C.1    Gustavson, F.G.2    Zubair, M.3
  • 3
    • 25844504635 scopus 로고    scopus 로고
    • QSNETII: Defining high-performance network design
    • J. Beecroft , et al. QSNETII: Defining high-performance network design. IEEE Micro, 25(4):34-47, 2005.
    • (2005) IEEE Micro , vol.25 , Issue.4 , pp. 34-47
    • Beecroft, J.1
  • 5
    • 85033334323 scopus 로고    scopus 로고
    • C. Bell, D. Bonachea, R. Nishtala, and K. Yelick. Optimizing bandwidth limited problems using one-sided communication and overlap. Technical Report LBNL-59207, Berkeley National Lab, 2005.
    • C. Bell, D. Bonachea, R. Nishtala, and K. Yelick. Optimizing bandwidth limited problems using one-sided communication and overlap. Technical Report LBNL-59207, Berkeley National Lab, 2005.
  • 7
    • 85033341619 scopus 로고    scopus 로고
    • The Berkeley UPC Compiler
    • The Berkeley UPC Compiler, 2002. http://upc.lbl.gov.
    • (2002)
  • 8
    • 33847094060 scopus 로고    scopus 로고
    • GASNet specification
    • Technical Report CSD-02-1207, University of California, Berkeley, October
    • D. Bonachea. GASNet specification. Technical Report CSD-02-1207, University of California, Berkeley, October 2002.
    • (2002)
    • Bonachea, D.1
  • 9
    • 33746759468 scopus 로고    scopus 로고
    • Proposal for extending the UPC memory copy library functions and supporting extensions to GASNet, v1.0
    • Technical Report LBNL-56495, Berkeley National Lab, October 2004
    • D. Bonachea. Proposal for extending the UPC memory copy library functions and supporting extensions to GASNet, v1.0. Technical Report LBNL-56495, Berkeley National Lab, October 2004.
    • Bonachea, D.1
  • 16
    • 33845393854 scopus 로고    scopus 로고
    • Transformations to parallel codes for communication-computation overlap
    • November
    • A. Danalis, K.-Y. Kim, L. Pollock, and M. Swany. Transformations to parallel codes for communication-computation overlap. In Supercomputing 2005, November 2005.
    • (2005) Supercomputing 2005
    • Danalis, A.1    Kim, K.-Y.2    Pollock, L.3    Swany, M.4
  • 18
    • 0031997862 scopus 로고    scopus 로고
    • A method for exploiting communication/computation overlap in hypercubes
    • L. Díaz, M. Valero-García, and A. González. A method for exploiting communication/computation overlap in hypercubes. Parallel Computing, 24(2):221-245, 1998.
    • (1998) Parallel Computing , vol.24 , Issue.2 , pp. 221-245
    • Díaz, L.1    Valero-García, M.2    González, A.3
  • 19
    • 0035980881 scopus 로고    scopus 로고
    • Scalable parallel FFT for spectral simulations on a beowulf cluster
    • P. Dmitruk, et al. Scalable parallel FFT for spectral simulations on a beowulf cluster. Parallel Computing, 2001.
    • (2001) Parallel Computing
    • Dmitruk, P.1
  • 20
    • 80052802178 scopus 로고    scopus 로고
    • UPC performance and potential: A NPB experimental study
    • T. El-Ghazawi and F. Cantonnet. UPC performance and potential: A NPB experimental study. In Supercomputing, 2002.
    • (2002) Supercomputing
    • El-Ghazawi, T.1    Cantonnet, F.2
  • 21
    • 33847169750 scopus 로고    scopus 로고
    • Automatic generation and tuning of MPI collective communication routines
    • A. Faraj and X. Yuan. Automatic generation and tuning of MPI collective communication routines. In Proc. Supercomputing, 2005.
    • (2005) Proc. Supercomputing
    • Faraj, A.1    Yuan, X.2
  • 22
    • 20744449792 scopus 로고    scopus 로고
    • The design and implementation of FFTW3
    • M. Frigo and S. G. Johnson. The design and implementation of FFTW3. Proc. of the IEEE, 93(2):216-231, 2005.
    • (2005) Proc. of the IEEE , vol.93 , Issue.2 , pp. 216-231
    • Frigo, M.1    Johnson, S.G.2
  • 23
    • 85033334967 scopus 로고    scopus 로고
    • home
    • GASNet home page. http://gasnet.cs.berkeley.edu/.
    • GASNet
  • 24
    • 33847106262 scopus 로고    scopus 로고
    • Survey of MPI call usage
    • D. Han and T. Jones. Survey of MPI call usage. In SciComp, 2004.
    • (2004) SciComp
    • Han, D.1    Jones, T.2
  • 25
    • 85033330031 scopus 로고    scopus 로고
    • P. Hilfinger, D. Bonachea, D. Gay, S. Graham, B. Liblit, G. Pike, and K. Yelick. Titanium language reference manual. Tech Report UCB/CSD-01-1163, U.C. Berkeley, November 2001.
    • P. Hilfinger, D. Bonachea, D. Gay, S. Graham, B. Liblit, G. Pike, and K. Yelick. Titanium language reference manual. Tech Report UCB/CSD-01-1163, U.C. Berkeley, November 2001.
  • 27
    • 33847152014 scopus 로고    scopus 로고
    • Building multirail Infiniband clusters: MPI-level design
    • J. Liu, A. Vishnu, and D. K. Panda. Building multirail Infiniband clusters: MPI-level design. In SuperComputing, 2004.
    • (2004) SuperComputing
    • Liu, J.1    Vishnu, A.2    Panda, D.K.3
  • 28
    • 3042721503 scopus 로고    scopus 로고
    • High performance RDMA-based mpi implementation over Infiniband
    • J. Liu, J. Wu, and D. K. Panda. High performance RDMA-based mpi implementation over Infiniband. Int'l J. of Parallel Prog., 2004.
    • (2004) Int'l J. of Parallel Prog
    • Liu, J.1    Wu, J.2    Panda, D.K.3
  • 29
    • 0003413675 scopus 로고
    • A message-passing interface standard, v1.1
    • MPI:, Technical report, University of Tennessee, Knoxville, June 12
    • MPI: A message-passing interface standard, v1.1. Technical report, University of Tennessee, Knoxville, June 12, 1995.
    • (1995)
  • 30
    • 85033343554 scopus 로고    scopus 로고
    • MPI-2: a message-passing interface standard. Int'l J. of High Performance Computing Applications, 12:1-299, 1998.
    • MPI-2: a message-passing interface standard. Int'l J. of High Performance Computing Applications, 12:1-299, 1998.
  • 31
    • 0006168939 scopus 로고    scopus 로고
    • ARMCI: A portable remote memory copy library for distributed array libraries and compiler run-time systems
    • J. Nieplocha and B. Carpenter. ARMCI: A portable remote memory copy library for distributed array libraries and compiler run-time systems. In Proc. RTSPP IPPS/SDP'99, 1999.
    • (1999) Proc. RTSPP IPPS/SDP'99
    • Nieplocha, J.1    Carpenter, B.2
  • 32
    • 0002081678 scopus 로고    scopus 로고
    • Co-array fortran for parallel programming
    • R. Numrich and J. Reid. Co-array fortran for parallel programming. In ACM Fortran Forum 17, 2, 1-31., 1998.
    • (1998) ACM Fortran Forum , vol.17 , Issue.2 , pp. 1-31
    • Numrich, R.1    Reid, J.2
  • 33
    • 33845425848 scopus 로고    scopus 로고
    • Scientific computations on modern parallel vector systems
    • L. Oliker, et al. Scientific computations on modern parallel vector systems. In Proc. of Supercomputing, 2004.
    • (2004) Proc. of Supercomputing
    • Oliker, L.1
  • 34
    • 0035342056 scopus 로고    scopus 로고
    • A comparison of optimal FFTs on torus and hypercube multicomputers
    • P. Swartztrauber and S. Hammond. A comparison of optimal FFTs on torus and hypercube multicomputers. Parallel Computing, 2001.
    • (2001) Parallel Computing
    • Swartztrauber, P.1    Hammond, S.2
  • 35
    • 85033349291 scopus 로고    scopus 로고
    • UPC consortium home
    • UPC consortium home page. http://upc.gwu.edu/.
  • 36
    • 85033327980 scopus 로고    scopus 로고
    • UPC language specifications, v1.2. Technical Report LBNL-59208, Berkeley National Lab, 2005.
    • UPC language specifications, v1.2. Technical Report LBNL-59208, Berkeley National Lab, 2005.
  • 38
    • 84942813297 scopus 로고    scopus 로고
    • Programming the Infiniband network architecture for high performance message passing systems
    • V. Velusamy, et al. Programming the Infiniband network architecture for high performance message passing systems. In ISCA, 2003.
    • (2003) ISCA
    • Velusamy, V.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.