SCOPUS 정보 검색 플랫폼

Future Generation Computer Systems

Volumn 29, Issue 8, 2013, Pages 2262-2271

Scheduling concurrent applications on a cluster of CPU-GPU nodes

(5) Ravi, Vignesh T a Becchi, Michela b Jiang, Wei a Agrawal, Gagan a Chakradhar, Srimat c

a Ohio State University (United States)

b University of Missouri (United States)

c NEC LABORATORIES AMERICA (United States)

Author keywords

CPU GPU systems; Scheduling

Indexed keywords

GRAPHICS PROCESSING UNIT; ROUTERS;

CLOUD ENVIRONMENTS; EXECUTION SCENARIO; HETEROGENEOUS ARCHITECTURES; HETEROGENEOUS CLUSTERS; HETEROGENEOUS NODES; ROUND ROBIN SCHEDULERS; SCHEDULING POLICIES; SCHEDULING SCHEMES;

SCHEDULING;

EID: 84886092852 PISSN: 0167739X EISSN: None Source Type: Journal
DOI: 10.1016/j.future.2013.06.002 Document Type: Article

Times cited : (14)

References (37)

1
- 78651550268
- Scalable parallel programming with CUDA
- J. Nickolls, I. Buck, M. Garland, and K. Skadron Scalable parallel programming with CUDA Queue 6 2008 40 53
- (2008) Queue , vol.6 , pp. 40-53
- Nickolls, J.¹ Buck, I.² Garland, M.³ Skadron, K.⁴

2
- 84886091143
- OpenCL
- OpenCL. http://www.khronos.org/opencl/.

3
- 78149231331
- MapCG: Writing parallel program portable between CPU and GPU
- New York, NY, USA
- C. Hong, et al. MapCG: writing parallel program portable between CPU and GPU, in: PACT'10, New York, NY, USA, 2010, pp. 217-226.
- (2010) PACT'10 , pp. 217-226
- Hong, C.¹

4
- 78149233155
- Ocelot: A dynamic optimization framework for bulk-synchronous applications in heterogeneous systems
- New York, NY, USA
- G.F. Diamos, et al. Ocelot: a dynamic optimization framework for bulk-synchronous applications in heterogeneous systems, in: PACT'10, New York, NY, USA, 2010, pp. 353-364.
- (2010) PACT'10 , pp. 353-364
- Diamos, G.F.¹

5
- 72049125355
- Coordinating the use of GPU and CPU for improving performance of compute intensive applications
- G. Teodoro, et al. Coordinating the use of GPU and CPU for improving performance of compute intensive applications, in: CLUSTER'09, 2009, pp. 1-10.
- (2009) CLUSTER'09 , pp. 1-10
- Teodoro, G.¹

6
- 57349153933
- Harmony: An execution model and runtime for heterogeneous many core systems
- New York, NY, USA
- G.F. Diamos, S. Yalamanchili, Harmony: an execution model and runtime for heterogeneous many core systems, in: HPDC'08, New York, NY, USA, 2008, pp. 197-200.
- (2008) HPDC'08 , pp. 197-200
- Diamos, G.F.¹ Yalamanchili, S.²

7
- 76749140917
- Qilin: Exploiting parallelism on heterogeneous multiprocessors with adaptive mapping
- New York, NY, USA
- C.-K. Luk, S. Hong, H. Kim, Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive mapping, in: MICRO'09, New York, NY, USA, 2009, pp. 45-55.
- (2009) MICRO'09 , pp. 45-55
- Luk, C.-K.¹ Hong, S.² Kim, H.³

8
- 77954709868
- Compiler and runtime support for enabling generalized reduction computations on heterogeneous parallel configurations
- New York, NY, USA
- V.T. Ravi, W. Ma, D. Chiu, G. Agrawal, Compiler and runtime support for enabling generalized reduction computations on heterogeneous parallel configurations, in: ICS'10, New York, NY, USA, 2010, pp. 137-146.
- (2010) ICS'10 , pp. 137-146
- Ravi, V.T.¹ Ma, W.² Chiu, D.³ Agrawal, G.⁴

9
- 77954927300
- Data-aware scheduling of legacy kernels on heterogeneous platforms with distributed memory
- New York, NY, USA
- M. Becchi, et al. Data-aware scheduling of legacy kernels on heterogeneous platforms with distributed memory, in: SPAA'10, New York, NY, USA, 2010, pp. 82-91.
- (2010) SPAA'10 , pp. 82-91
- Becchi, M.¹

10
- 58449109179
- MCUDA: An efficient implementation of CUDA Kernels for multi-core CPUs
- J. Stratton, S. Stone, W. mei Hwu, MCUDA: An efficient implementation of CUDA Kernels for multi-core CPUs, in: 21st Annual Workshop on Languages and Compilers for Parallel Computing, LCPC'2008, 2008. URL http://www.gigascale.org/ pubs/1328.html.
- (2008) 21st Annual Workshop on Languages and Compilers for Parallel Computing, LCPC'2008
- Stratton, J.¹ Stone, S.² Hwu, W.³

11
- 84877719088
- PGI CUDA-X86 Compiler. http://www.pgroup.com/resources/cuda-x86.htm.
- PGI CUDA-X86 Compiler

12
- 79251597562
- Swan: A tool for porting cuda programs to opencl
- M. Harvey, and G.D. Fabritiis Swan: a tool for porting cuda programs to opencl Computer Physics Communications 182 4 2011 1093 1099 URL http://www.sciencedirect.com/science/article/pii/S0010465511000117
- (2011) Computer Physics Communications , vol.182 , Issue.4 , pp. 1093-1099
- Harvey, M.¹ Fabritiis, G.D.²

13
- 78650802947
- OpenMPC: Extended openmp programming and tuning for GPUs
- SC'10 IEEE Computer Society Washington, DC, USA URL
- S. Lee, and R. Eigenmann OpenMPC: extended openmp programming and tuning for GPUs Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis SC'10 2010 IEEE Computer Society Washington, DC, USA 1 11 URL http://dx.doi.org/10.1109/SC.2010.36
- (2010) Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis , pp. 1-11
- Lee, S.¹ Eigenmann, R.²

14
- 70649092154
- Rodinia: A benchmark suite for heterogeneous computing
- Washington, DC, USA
- S. Che, et al. Rodinia: a benchmark suite for heterogeneous computing, in: IISWC'09, Washington, DC, USA, 2009, pp. 44-54.
- (2009) IISWC'09 , pp. 44-54
- Che, S.¹

15
- 84886087035
- Parboil Benchmark Suite
- Parboil Benchmark Suite. http://impact.crhc.illinois.edu/parboil.php.

16
- 77952256778
- Modeling GPU-CPU workloads and systems
- A. Kerr, et al. Modeling GPU-CPU workloads and systems, in: GPGPU'10, 2010, pp. 31-42.
- (2010) GPGPU'10 , pp. 31-42
- Kerr, A.¹

17
- 23944523844
- Parallel job scheduling - A status report
- D.G. Feitelson, L. Rudolph, U. Schwiegelshohn, Parallel job scheduling - a status report, in: JSSPP, 2004, pp. 1-16.
- (2004) JSSPP , pp. 1-16
- Feitelson, D.G.¹ Rudolph, L.² Schwiegelshohn, U.³

18
- 0030149947
- Effective distributed scheduling of parallel workloads
- A.C. Dusseau, R.H. Arpaci, and D.E. Culler Effective distributed scheduling of parallel workloads SIGMETRICS Performance Evaluation Review 24 1996 25 36 (Pubitemid 126549537)
- (1996) Performance Evaluation Review , vol.24 , Issue.1 , pp. 25-36
- Dusseau, A.C.¹ Arpaci, R.H.² Culler, D.E.³

19
- 0027721450
- Performance analysis of job scheduling policies in parallel supercomputing environments
- New York, NY, USA
- V.K. Naik, M.S. Squillante, S.K. Setia, Performance analysis of job scheduling policies in parallel supercomputing environments, in: Supercomputing'93, New York, NY, USA, 1993, pp. 824-833.
- (1993) Supercomputing'93 , pp. 824-833
- Naik, V.K.¹ Squillante, M.S.² Setia, S.K.³

20
- 0242656076
- Scheduling of parallel jobs in a heterogeneous multi-site environment
- G. Sabin, R. Kettimuthu, A. Rajan, P. Sadayappan, Scheduling of parallel jobs in a heterogeneous multi-site environment, in: Proc. of the 9th International Workshop on Job Scheduling Strategies for Parallel Processing, 2003, pp. 87-104.
- (2003) Proc. of the 9th International Workshop on Job Scheduling Strategies for Parallel Processing , pp. 87-104
- Sabin, G.¹ Kettimuthu, R.² Rajan, A.³ Sadayappan, P.⁴

21
- 27544449350
- Assessment and enhancement of meta-schedulers for multi-site job sharing
- Proceedings - 14th IEEE Interntional Symposium on High Performance Distributed Computing, HPD-14
- G. Sabin, V. Sahasrabudhe, P. Sadayappan, Assessment and enhancement of meta-schedulers for multi-site job sharing, in: HPDC'05, Washington, DC, USA, 2005, pp. 144-153. (Pubitemid 41543484)
- (2005) Proceedings of the IEEE International Symposium on High Performance Distributed Computing , pp. 144-153
- Sabin, G.¹ Sahasrabudhe, V.² Sadayappan, P.³

22
- 4644370318
- Single-isa heterogeneous multi-core architectures for multithreaded workload performance
- IEEE Computer Society Washington, DC, USA
- R. Kumar Single-isa heterogeneous multi-core architectures for multithreaded workload performance ISCA'04 2004 IEEE Computer Society Washington, DC, USA 64 75
- (2004) ISCA'04 , pp. 64-75
- Kumar, R.¹

23
- 34247331460
- Dynamic thread assignment on heterogeneous multiprocessor architectures
- New York, NY, USA
- M. Becchi, P. Crowley, Dynamic thread assignment on heterogeneous multiprocessor architectures, in: CF'06, New York, NY, USA, 2006, pp. 29-40.
- (2006) CF'06 , pp. 29-40
- Becchi, M.¹ Crowley, P.²

24
- 84883263468
- SLURM: a highly scalable resource manager. https://computing.llnl.gov/ linux/slurm/slurm.html.
- SLURM: A Highly Scalable Resource Manager

25
- 83455220920
- Comprehensive performance monitoring for GPU cluster systems
- IEEE Computer Society Washington, DC, USA URL
- K. Furlinger, N.J. Wright, and D. Skinner Comprehensive performance monitoring for GPU cluster systems Proceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Ph.D. Forum IPDPSW '11 2011 IEEE Computer Society Washington, DC, USA 1377 1386 URL http://dx.doi.org/10.1109/IPDPS.2011.289
- (2011) Proceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Ph.D. Forum IPDPSW '11 , pp. 1377-1386
- Furlinger, K.¹ Wright, N.J.² Skinner, D.³

26
- 84866869010
- MATE-CG: A mapreduce-like framework for accelerating data-intensive com-putations on heterogeneous clusters
- preparation
- W. Jiang, G. Agrawal, MATE-CG: a mapreduce-like framework for accelerating data-intensive com-putations on heterogeneous clusters, in: IPDPS'12, 2012, in preparation.
- (2012) IPDPS'12
- Jiang, W.¹ Agrawal, G.²

27
- 84886101820
- Torque Resource Manager
- Torque Resource Manager. http://www.clusterresources.com/products/torque- resource-manager.php.

28
- 0032591264
- Dynamic matching and scheduling of a class of independent tasks onto heterogeneous computing systems
- IEEE Computer Society Washington, DC, USA
- M. Maheswaran, S. Ali, H.J. Siegel, D. Hensgen, and R.F. Freund Dynamic matching and scheduling of a class of independent tasks onto heterogeneous computing systems Proceedings of the Eighth Heterogeneous Computing Workshop HCW'99 1999 IEEE Computer Society Washington, DC, USA 30
- (1999) Proceedings of the Eighth Heterogeneous Computing Workshop HCW'99 , pp. 30
- Maheswaran, M.¹ Ali, S.² Siegel, H.J.³ Hensgen, D.⁴ Freund, R.F.⁵

29
- 0002691736
- The elusive goal of workload characterization
- A.B. Downey, and D.G. Feitelson The elusive goal of workload characterization SIGMETRICS Performance Evaluation Review 26 1999 14 29 URL http://doi.acm.org/10.1145/309746.309750
- (1999) SIGMETRICS Performance Evaluation Review , vol.26 , pp. 14-29
- Downey, A.B.¹ Feitelson, D.G.²

30
- 0006547373
- Scheduling resources in multi-user, heterogeneous, computing environments with smart-net
- IEEE Computer Society Washington, DC, USA
- R.F.t. Freund Scheduling resources in multi-user, heterogeneous, computing environments with smart-net Proceedings of the Seventh Heterogeneous Computing Workshop HCW'98 1998 IEEE Computer Society Washington, DC, USA 3
- (1998) Proceedings of the Seventh Heterogeneous Computing Workshop HCW'98 , pp. 3
- Freund, R.F.T.¹

31
- 0036802314
- Using moldability to improve the performance of supercomputer jobs
- W. Cirne, and F. Berman Using moldability to improve the performance of supercomputer jobs Journal of Parallel and Distributed Computing 62 10 2002 1571 1601
- (2002) Journal of Parallel and Distributed Computing , vol.62 , Issue.10 , pp. 1571-1601
- Cirne, W.¹ Berman, F.²

32
- 34248186212
- Effective Selection of Partition Sizes for Moldable Scheduling of Parallel Jobs
- High Performance Computing - HiPC 2002
- S. Srinivasan, V. Subramani, R. Kettimuthu, P. Holenarsipur, and P. Sadayappan Effective selection of partition sizes for moldable scheduling of parallel jobs S. Sahni, V.K. Prasanna, U. Shukla, HiPC Lecture Notes in Computer Science vol. 2552 2002 Springer 174 183 (Pubitemid 36140700)
- (2002) Lecture Notes In Computer Science , Issue.2552 , pp. 174-183
- Srinivasan, S.¹ Subramani, V.² Kettimuthu, R.³ Holenarsipur, P.⁴ Sadayappan, P.⁵

33
- 0032202051
- DPS: Dynamic priority scheduling heuristic for heterogeneous computing systems
- I. Ahmad, M. Dhodhi, and R. Ul-Mustafa DPS: dynamic priority scheduling heuristic for heterogeneous computing systems IEE Proceedings - Computers and Digital Techniques 145 6 1998 411 418
- (1998) IEE Proceedings - Computers and Digital Techniques , vol.145 , Issue.6 , pp. 411-418
- Ahmad, I.¹ Dhodhi, M.² Ul-Mustafa, R.³

34
- 84934343585
- Realistic modeling and synthesis of resources for computational grids
- Y.S. Kee, H. Casanova, A. Chien, Realistic modeling and synthesis of resources for computational grids, in: SC'04, 2004, pp. 54-63.
- (2004) SC'04 , pp. 54-63
- Kee, Y.S.¹ Casanova, H.² Chien, A.³

35
- 34548270092
- Improving grid resource allocation via integrated selection and binding
- Y.S. Kee, K. Yocum, A.A. Chien, H. Casanova, Improving grid resource allocation via integrated selection and binding, in: SC'06, 2006.
- (2006) SC'06
- Kee, Y.S.¹ Yocum, K.² Chien, A.A.³ Casanova, H.⁴

36
- 23944436115
- New grid scheduling and rescheduling methods in the GrADS project
- DOI 10.1007/s10766-005-3584-4
- F. Berman New grid scheduling and rescheduling methods in the grads project International Journal of Parallel Programming 2005 209 229 (Pubitemid 41202442)
- (2005) International Journal of Parallel Programming , vol.33 , Issue.2-3 , pp. 209-229
- Berman, F.¹ Casanova, H.² Chien, A.³ Cooper, K.⁴ Dail, H.⁵ Dasgupta, A.⁶ Deng, W.⁷ Dongarra, J.⁸ Johnsson, L.⁹ Kennedy, K.¹⁰ Koelbel, C.¹¹ Liu, B.¹² Liu, X.¹³ Mandal, A.¹⁴ Marin, G.¹⁵ Mazina, M.¹⁶ Mellor-Crummey, J.¹⁷ Mendes, C.¹⁸ Olugbile, A.¹⁹ Patel, M.²⁰ more..

37
- 84983561277
- Scheduling parallel applications on utility grids: Time and cost trade-off management
- S.K. Garg, R. Buyya, H.J. Siegel, Scheduling parallel applications on utility grids: time and cost trade-off management, in: ACSC'09, 2009, pp. 139-147.
- (2009) ACSC'09 , pp. 139-147
- Garg, S.K.¹ Buyya, R.² Siegel, H.J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.