메뉴 건너뛰기




Volumn 29, Issue 8, 2013, Pages 2262-2271

Scheduling concurrent applications on a cluster of CPU-GPU nodes

Author keywords

CPU GPU systems; Scheduling

Indexed keywords

GRAPHICS PROCESSING UNIT; ROUTERS;

EID: 84886092852     PISSN: 0167739X     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.future.2013.06.002     Document Type: Article
Times cited : (14)

References (37)
  • 1
    • 78651550268 scopus 로고    scopus 로고
    • Scalable parallel programming with CUDA
    • J. Nickolls, I. Buck, M. Garland, and K. Skadron Scalable parallel programming with CUDA Queue 6 2008 40 53
    • (2008) Queue , vol.6 , pp. 40-53
    • Nickolls, J.1    Buck, I.2    Garland, M.3    Skadron, K.4
  • 2
    • 84886091143 scopus 로고    scopus 로고
    • OpenCL
    • OpenCL. http://www.khronos.org/opencl/.
  • 3
    • 78149231331 scopus 로고    scopus 로고
    • MapCG: Writing parallel program portable between CPU and GPU
    • New York, NY, USA
    • C. Hong, et al. MapCG: writing parallel program portable between CPU and GPU, in: PACT'10, New York, NY, USA, 2010, pp. 217-226.
    • (2010) PACT'10 , pp. 217-226
    • Hong, C.1
  • 4
    • 78149233155 scopus 로고    scopus 로고
    • Ocelot: A dynamic optimization framework for bulk-synchronous applications in heterogeneous systems
    • New York, NY, USA
    • G.F. Diamos, et al. Ocelot: a dynamic optimization framework for bulk-synchronous applications in heterogeneous systems, in: PACT'10, New York, NY, USA, 2010, pp. 353-364.
    • (2010) PACT'10 , pp. 353-364
    • Diamos, G.F.1
  • 5
    • 72049125355 scopus 로고    scopus 로고
    • Coordinating the use of GPU and CPU for improving performance of compute intensive applications
    • G. Teodoro, et al. Coordinating the use of GPU and CPU for improving performance of compute intensive applications, in: CLUSTER'09, 2009, pp. 1-10.
    • (2009) CLUSTER'09 , pp. 1-10
    • Teodoro, G.1
  • 6
    • 57349153933 scopus 로고    scopus 로고
    • Harmony: An execution model and runtime for heterogeneous many core systems
    • New York, NY, USA
    • G.F. Diamos, S. Yalamanchili, Harmony: an execution model and runtime for heterogeneous many core systems, in: HPDC'08, New York, NY, USA, 2008, pp. 197-200.
    • (2008) HPDC'08 , pp. 197-200
    • Diamos, G.F.1    Yalamanchili, S.2
  • 7
    • 76749140917 scopus 로고    scopus 로고
    • Qilin: Exploiting parallelism on heterogeneous multiprocessors with adaptive mapping
    • New York, NY, USA
    • C.-K. Luk, S. Hong, H. Kim, Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive mapping, in: MICRO'09, New York, NY, USA, 2009, pp. 45-55.
    • (2009) MICRO'09 , pp. 45-55
    • Luk, C.-K.1    Hong, S.2    Kim, H.3
  • 8
    • 77954709868 scopus 로고    scopus 로고
    • Compiler and runtime support for enabling generalized reduction computations on heterogeneous parallel configurations
    • New York, NY, USA
    • V.T. Ravi, W. Ma, D. Chiu, G. Agrawal, Compiler and runtime support for enabling generalized reduction computations on heterogeneous parallel configurations, in: ICS'10, New York, NY, USA, 2010, pp. 137-146.
    • (2010) ICS'10 , pp. 137-146
    • Ravi, V.T.1    Ma, W.2    Chiu, D.3    Agrawal, G.4
  • 9
    • 77954927300 scopus 로고    scopus 로고
    • Data-aware scheduling of legacy kernels on heterogeneous platforms with distributed memory
    • New York, NY, USA
    • M. Becchi, et al. Data-aware scheduling of legacy kernels on heterogeneous platforms with distributed memory, in: SPAA'10, New York, NY, USA, 2010, pp. 82-91.
    • (2010) SPAA'10 , pp. 82-91
    • Becchi, M.1
  • 12
    • 79251597562 scopus 로고    scopus 로고
    • Swan: A tool for porting cuda programs to opencl
    • M. Harvey, and G.D. Fabritiis Swan: a tool for porting cuda programs to opencl Computer Physics Communications 182 4 2011 1093 1099 URL http://www.sciencedirect.com/science/article/pii/S0010465511000117
    • (2011) Computer Physics Communications , vol.182 , Issue.4 , pp. 1093-1099
    • Harvey, M.1    Fabritiis, G.D.2
  • 14
    • 70649092154 scopus 로고    scopus 로고
    • Rodinia: A benchmark suite for heterogeneous computing
    • Washington, DC, USA
    • S. Che, et al. Rodinia: a benchmark suite for heterogeneous computing, in: IISWC'09, Washington, DC, USA, 2009, pp. 44-54.
    • (2009) IISWC'09 , pp. 44-54
    • Che, S.1
  • 15
    • 84886087035 scopus 로고    scopus 로고
    • Parboil Benchmark Suite
    • Parboil Benchmark Suite. http://impact.crhc.illinois.edu/parboil.php.
  • 16
    • 77952256778 scopus 로고    scopus 로고
    • Modeling GPU-CPU workloads and systems
    • A. Kerr, et al. Modeling GPU-CPU workloads and systems, in: GPGPU'10, 2010, pp. 31-42.
    • (2010) GPGPU'10 , pp. 31-42
    • Kerr, A.1
  • 18
    • 0030149947 scopus 로고    scopus 로고
    • Effective distributed scheduling of parallel workloads
    • A.C. Dusseau, R.H. Arpaci, and D.E. Culler Effective distributed scheduling of parallel workloads SIGMETRICS Performance Evaluation Review 24 1996 25 36 (Pubitemid 126549537)
    • (1996) Performance Evaluation Review , vol.24 , Issue.1 , pp. 25-36
    • Dusseau, A.C.1    Arpaci, R.H.2    Culler, D.E.3
  • 19
    • 0027721450 scopus 로고
    • Performance analysis of job scheduling policies in parallel supercomputing environments
    • New York, NY, USA
    • V.K. Naik, M.S. Squillante, S.K. Setia, Performance analysis of job scheduling policies in parallel supercomputing environments, in: Supercomputing'93, New York, NY, USA, 1993, pp. 824-833.
    • (1993) Supercomputing'93 , pp. 824-833
    • Naik, V.K.1    Squillante, M.S.2    Setia, S.K.3
  • 22
    • 4644370318 scopus 로고    scopus 로고
    • Single-isa heterogeneous multi-core architectures for multithreaded workload performance
    • IEEE Computer Society Washington, DC, USA
    • R. Kumar Single-isa heterogeneous multi-core architectures for multithreaded workload performance ISCA'04 2004 IEEE Computer Society Washington, DC, USA 64 75
    • (2004) ISCA'04 , pp. 64-75
    • Kumar, R.1
  • 23
    • 34247331460 scopus 로고    scopus 로고
    • Dynamic thread assignment on heterogeneous multiprocessor architectures
    • New York, NY, USA
    • M. Becchi, P. Crowley, Dynamic thread assignment on heterogeneous multiprocessor architectures, in: CF'06, New York, NY, USA, 2006, pp. 29-40.
    • (2006) CF'06 , pp. 29-40
    • Becchi, M.1    Crowley, P.2
  • 26
    • 84866869010 scopus 로고    scopus 로고
    • MATE-CG: A mapreduce-like framework for accelerating data-intensive com-putations on heterogeneous clusters
    • preparation
    • W. Jiang, G. Agrawal, MATE-CG: a mapreduce-like framework for accelerating data-intensive com-putations on heterogeneous clusters, in: IPDPS'12, 2012, in preparation.
    • (2012) IPDPS'12
    • Jiang, W.1    Agrawal, G.2
  • 27
    • 84886101820 scopus 로고    scopus 로고
    • Torque Resource Manager
    • Torque Resource Manager. http://www.clusterresources.com/products/torque- resource-manager.php.
  • 30
    • 0006547373 scopus 로고    scopus 로고
    • Scheduling resources in multi-user, heterogeneous, computing environments with smart-net
    • IEEE Computer Society Washington, DC, USA
    • R.F.t. Freund Scheduling resources in multi-user, heterogeneous, computing environments with smart-net Proceedings of the Seventh Heterogeneous Computing Workshop HCW'98 1998 IEEE Computer Society Washington, DC, USA 3
    • (1998) Proceedings of the Seventh Heterogeneous Computing Workshop HCW'98 , pp. 3
    • Freund, R.F.T.1
  • 31
    • 0036802314 scopus 로고    scopus 로고
    • Using moldability to improve the performance of supercomputer jobs
    • W. Cirne, and F. Berman Using moldability to improve the performance of supercomputer jobs Journal of Parallel and Distributed Computing 62 10 2002 1571 1601
    • (2002) Journal of Parallel and Distributed Computing , vol.62 , Issue.10 , pp. 1571-1601
    • Cirne, W.1    Berman, F.2
  • 32
    • 34248186212 scopus 로고    scopus 로고
    • Effective Selection of Partition Sizes for Moldable Scheduling of Parallel Jobs
    • High Performance Computing - HiPC 2002
    • S. Srinivasan, V. Subramani, R. Kettimuthu, P. Holenarsipur, and P. Sadayappan Effective selection of partition sizes for moldable scheduling of parallel jobs S. Sahni, V.K. Prasanna, U. Shukla, HiPC Lecture Notes in Computer Science vol. 2552 2002 Springer 174 183 (Pubitemid 36140700)
    • (2002) Lecture Notes In Computer Science , Issue.2552 , pp. 174-183
    • Srinivasan, S.1    Subramani, V.2    Kettimuthu, R.3    Holenarsipur, P.4    Sadayappan, P.5
  • 34
    • 84934343585 scopus 로고    scopus 로고
    • Realistic modeling and synthesis of resources for computational grids
    • Y.S. Kee, H. Casanova, A. Chien, Realistic modeling and synthesis of resources for computational grids, in: SC'04, 2004, pp. 54-63.
    • (2004) SC'04 , pp. 54-63
    • Kee, Y.S.1    Casanova, H.2    Chien, A.3
  • 35
    • 34548270092 scopus 로고    scopus 로고
    • Improving grid resource allocation via integrated selection and binding
    • Y.S. Kee, K. Yocum, A.A. Chien, H. Casanova, Improving grid resource allocation via integrated selection and binding, in: SC'06, 2006.
    • (2006) SC'06
    • Kee, Y.S.1    Yocum, K.2    Chien, A.A.3    Casanova, H.4
  • 37
    • 84983561277 scopus 로고    scopus 로고
    • Scheduling parallel applications on utility grids: Time and cost trade-off management
    • S.K. Garg, R. Buyya, H.J. Siegel, Scheduling parallel applications on utility grids: time and cost trade-off management, in: ACSC'09, 2009, pp. 139-147.
    • (2009) ACSC'09 , pp. 139-147
    • Garg, S.K.1    Buyya, R.2    Siegel, H.J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.