메뉴 건너뛰기




Volumn , Issue , 2010, Pages

Dynamic load balancing on single- and multi-GPU systems

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL POWER; DYNAMIC LOAD BALANCING; GPU PROGRAMMING; GRAPHICS PROCESSING UNITS; LINEAR SPEED-UP; LOAD BALANCE; LOAD IMBALANCE; LOAD-BALANCING; MANY-CORE; PERFORMANCE IMPROVEMENTS; PROGRAMMING TECHNIQUE; TASK-BASED;

EID: 77953985375     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2010.5470413     Document Type: Conference Paper
Times cited : (124)

References (25)
  • 1
    • 77954006904 scopus 로고    scopus 로고
    • ATI Stream
    • AMD. ATI Stream. http://www.amd.com.
  • 2
    • 70350641505 scopus 로고    scopus 로고
    • StarPU: A unified platform for task scheduling on heterogeneous multicore architectures
    • Delft, Netherlands
    • C. Augonnet, S. Thibault, R. Namyst, and P.-A. Wacrenier. StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures. In Euro-Par 2009, pages 863-874, Delft, Netherlands, 2009.
    • Euro-Par 2009 , vol.2009 , pp. 863-874
    • Augonnet, C.1    Thibault, S.2    Namyst, R.3    Wacrenier, P.-A.4
  • 4
    • 70450059008 scopus 로고    scopus 로고
    • Accelerating leukocyte tracking using CUDA: A case study in leveraging manycore coprocessors
    • M. Boyer, D. T., S. A., and K. S. Accelerating leukocyte tracking using CUDA: A case study in leveraging manycore coprocessors. In IPDPS 2009, pages 1-12, 2009.
    • (2009) IPDPS , vol.2009 , pp. 1-12
    • Boyeri, M.1    T, D.2    A, S.3    S, K.4
  • 5
    • 0001782767 scopus 로고
    • Parallelization of charmm for MIMD machines
    • B. Brooks and H. M. Parallelization of Charmm for MIMD Machines. Chemical Design Automation News, 7(16):16-22, 1992.
    • (1992) Chemical Design Automation News , vol.7 , Issue.16 , pp. 16-22
    • Brooks, B.1    M, H.2
  • 6
    • 77954018826 scopus 로고    scopus 로고
    • On dynamic load balancing on graphics processors
    • D. Cederman and P. T. On Dynamic Load Balancing on Graphics Processors. In GH 2008, pages 57-64, 2008.
    • (2008) GH 2008 , pp. 57-64
    • Cederman, D.1    T, P.2
  • 7
    • 0029179685 scopus 로고
    • Modeling the benefits of mixed data and task parallelism
    • New York, NY, USA, ACM
    • S. Chakrabarti, J. Demmel, and K. Yelick. Modeling the benefits of mixed data and task parallelism. In SPAA'95, pages 74-83, New York, NY, USA, 1995. ACM.
    • (1995) SPAA'95 , pp. 74-83
    • Chakrabarti, S.1    Demmel, J.2    Yelick, K.3
  • 8
    • 77953970436 scopus 로고
    • Parallel molecular dynamics
    • March
    • T. Clark, M. J.A., and S. L.R. Parallel Molecular Dynamics. In SIAMPP'91, pages 338-344, March 1991.
    • (1991) SIAMPP'91 , pp. 338-344
    • Clark, T.1    J, A.M.2    L, R.S.3
  • 10
    • 33750913667 scopus 로고    scopus 로고
    • Kd-tree acceleration structures for a gpu raytracer
    • New York, NY, USA
    • T. Foley and J. Sugerman. Kd-tree acceleration structures for a gpu raytracer. In HWWS'05, pages 15-22, New York, NY, USA, 2005.
    • (2005) HWWS'05 , pp. 15-22
    • Foley, T.1    Sugerman, J.2
  • 12
    • 77955990292 scopus 로고    scopus 로고
    • Enabling task parallelism in the cuda scheduler
    • M. Guevara, C. Gregg, and S. K. Enabling task parallelism in the cuda scheduler. In PEMA 2009, 2009.
    • PEMA 2009 , vol.2009
    • Guevara, M.1    Gregg, C.2    K, S.3
  • 13
    • 38349041620 scopus 로고    scopus 로고
    • Accelerating large graph algorithms on the gpu using cuda
    • P. Harish and N. P.J. Accelerating large graph algorithms on the gpu using cuda. In HiPC, pages 197-208, 2007.
    • (2007) HiPC , pp. 197-208
    • Harish, P.1    P, J.N.2
  • 14
    • 0025917643 scopus 로고
    • Wait-free synchronization
    • M. Herlihy. Wait-free synchronization. ACM TPLS., 13(1):124-149, 1991.
    • (1991) ACM TPLS , vol.13 , Issue.1 , pp. 124-149
    • Herlihy, M.1
  • 15
    • 77954019183 scopus 로고    scopus 로고
    • OpenCL
    • Khronos. OpenCL. http://www.khronos.org.
  • 16
    • 67650046428 scopus 로고    scopus 로고
    • Merge: A programming model for heterogeneous multi-core systems
    • M. D. Linderman, J. D. Collins, H. Wang, and T. H. M. Merge: a programming model for heterogeneous multi-core systems. SIG- PLANNot., 43(3):287-296, 2008.
    • (2008) SIG-PLANNot , vol.43 , Issue.3 , pp. 287-296
    • Linderman, M.D.1    Collins, J.D.2    Wang, H.3    H, M.T.4
  • 18
    • 34249052630 scopus 로고    scopus 로고
    • Adaptive load balancing for raycasting of non-uniformly bricked volumes
    • Parallel Graphics and Visualization
    • M. Mller, C. and Strengert and T. Ertl. Adaptive load balancing for raycasting of non-uniformly bricked volumes. Parallel Computing, 33(6):406-419, 2007. Parallel Graphics and Visualization.
    • (2007) Parallel Computing , vol.33 , Issue.6 , pp. 406-419
    • Mller, M.C.1    Strengert2    Ertl, T.3
  • 19
    • 78651550268 scopus 로고    scopus 로고
    • Scalable parallel programming with CUDA
    • J. Nickolls, I. Buck, M. G., and K. S. Scalable Parallel Programming with CUDA. Queue, 6(2):40-53, 2008.
    • (2008) Queue , vol.6 , Issue.2 , pp. 40-53
    • Nickolls, J.1    Buck, I.2    G, M.3    S, K.4
  • 20
    • 77953976782 scopus 로고    scopus 로고
    • CUDA
    • Nvidia. CUDA. http://www.nvidia.com.
  • 22
    • 60649087529 scopus 로고    scopus 로고
    • A task parallel algorithm for computing the costs of all-pairs shortest paths on the cuda-compatible gpu
    • T. Okuyama, F. I., and K. H. A task parallel algorithm for computing the costs of all-pairs shortest paths on the cuda-compatible gpu. In ISPA'08, pages 284-291, 2008.
    • (2008) ISPA'08 , pp. 284-291
    • Okuyama, T.1    I, F.2    H, K.3
  • 23
    • 79959466764 scopus 로고    scopus 로고
    • Optimization principles and application performance evaluation of a multithreaded gpu using cuda
    • S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. M. Hwu. Optimization principles and application performance evaluation of a multithreaded gpu using cuda. In PPoPP'08, pages 73-82, 2008.
    • (2008) PPoPP'08 , pp. 73-82
    • Ryoo, S.1    Rodrigues, C.I.2    Baghsorkhi, S.S.3    Stone, S.S.4    Kirk, D.B.5    Hwu, W.M.6
  • 24
    • 0026120011 scopus 로고
    • Molecular dynamics on hypercube parallel computers
    • W. Smith. Molecular dynamics on hypercube parallel computers. Computer Physics Communications, 62:229-248, 1991.
    • (1991) Computer Physics Communications , vol.62 , pp. 229-248
    • Smith, W.1
  • 25
    • 70350771131 scopus 로고    scopus 로고
    • Benchmarking GPUs to tune dense linear algebra
    • V. Volkov and J. W. Demmel. Benchmarking GPUs to tune dense linear algebra. In SC 2008, pages 1-11, 2008.
    • (2008) SC , vol.2008 , pp. 1-11
    • Volkov, V.1    Demmel, J.W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.