메뉴 건너뛰기




Volumn , Issue , 2012, Pages 91-100

A scalable framework for heterogeneous GPU-based clusters

Author keywords

Distributed runtime; Heterogeneous clusters; Hybrid CPU GPU architectures; Linear algebra; Manycore scheduling

Indexed keywords

CHOLESKY FACTORIZATIONS; COMPUTATIONAL PERFORMANCE; CPU CORES; DATA DEPENDENCIES; DATAFLOW PROGRAMMING; DISTRIBUTED DYNAMICS; DISTRIBUTED MEMORY; DISTRIBUTED MEMORY CLUSTERS; DYNAMIC SCHEDULING; ENTIRE SYSTEM; FASTER RATES; GPU CLUSTERS; HETEROGENEOUS CLUSTERS; HETEROGENEOUS SYSTEMS; HIGH ENERGY EFFICIENCY; MANY-CORE; MULTI-LEVEL PARTITIONING; PARALLEL SOFTWARE; PCI EXPRESS; PROCESSING UNITS; RUNTIME SYSTEMS; RUNTIMES;

EID: 84864149777     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2312005.2312025     Document Type: Conference Paper
Times cited : (36)

References (25)
  • 9
    • 0035481895 scopus 로고    scopus 로고
    • A proposal for a heterogeneous cluster ScaLAPACK (dense linear solvers)
    • DOI 10.1109/12.956091
    • O. Beaumont, V. Boudet, A. Petitet, F. Rastello, and Y. Robert. A proposal for a heterogeneous cluster ScaLAPACK (dense linear solvers). IEEE Transactions on Computers, 50:1052-1070, 2001. (Pubitemid 33048369)
    • (2001) IEEE Transactions on Computers , vol.50 , Issue.10 , pp. 1052-1070
    • Beaumont, O.1    Boudet, V.2    Petitet, A.3    Rastello, F.4    Robert, Y.5
  • 11
    • 0032648736 scopus 로고    scopus 로고
    • Static tiling for heterogeneous computing platforms
    • P. Boulet, J. Dongarra, Y. Robert, and F. Vivien. Static tiling for heterogeneous computing platforms. Parallel Computing, 25(5):547-568, 1999.
    • (1999) Parallel Computing , vol.25 , Issue.5 , pp. 547-568
    • Boulet, P.1    Dongarra, J.2    Robert, Y.3    Vivien, F.4
  • 17
    • 36248980362 scopus 로고    scopus 로고
    • Data distribution for dense factorization on computers with memory heterogeneity
    • DOI 10.1016/j.parco.2007.06.001, PII S0167819107000762
    • A. Lastovetsky and R. Reddy. Data distribution for dense factorization on computers with memory heterogeneity. Parallel Comput., 33:757-779, December 2007. (Pubitemid 350122765)
    • (2007) Parallel Computing , vol.33 , Issue.12 , pp. 757-779
    • Lastovetsky, A.1    Reddy, R.2
  • 19
    • 84864153899 scopus 로고    scopus 로고
    • CUDA Toolkit 4.0 CUBLAS Library
    • NVIDIA. CUDA Toolkit 4.0 CUBLAS Library, 2011.
    • (2011) NVIDIA
  • 23
    • 84863925917 scopus 로고    scopus 로고
    • Efficient support for matrix computations on heterogeneous multi-core and multi-GPU architectures
    • June
    • F. Song, S. Tomov, and J. Dongarra. Efficient support for matrix computations on heterogeneous multi-core and multi-GPU architectures. LAPACK Working Note 250, UTK, June 2011.
    • (2011) LAPACK Working Note 250 UTK
    • Song, F.1    Tomov, S.2    Dongarra, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.