메뉴 건너뛰기




Volumn 29, Issue 5, 2018, Pages 1089-1102

Intra-Node Memory Safe GPU Co-Scheduling

Author keywords

accelerator; access synchronisation; GPU co scheduling; memory safe; schedGPU; under utilisation

Indexed keywords

COMPUTER GRAPHICS; GRAPHICS PROCESSING UNIT; LIBRARIES; MEMORY ARCHITECTURE; MULTITASKING; PARTICLE ACCELERATORS; PROGRAM PROCESSORS; SCHEDULING; SERVERS;

EID: 85039762526     PISSN: 10459219     EISSN: None     Source Type: Journal    
DOI: 10.1109/TPDS.2017.2784428     Document Type: Article
Times cited : (13)

References (35)
  • 2
    • 84904350194 scopus 로고    scopus 로고
    • Trends in high-performance computing for engineering calculations
    • Art. no. 20130319
    • M. B. Giles and I. Reguly, "Trends in high-performance computing for engineering calculations, " Philosoph. Trans. Roy. Soc. London Series A, vol. 372, 2014, Art. no. 20130319.
    • (2014) Philosoph. Trans. Roy. Soc. London Series A , vol.372
    • Giles, M.B.1    Reguly, I.2
  • 4
    • 85045585440 scopus 로고    scopus 로고
    • Adaptive Computing. TORQUE Resource Manager. [Online]
    • Adaptive Computing. TORQUE Resource Manager. 2016. [Online]. Available: http://www.adaptivecomputing.com/products/open-source/torque/
    • (2016)
  • 6
    • 84907440423 scopus 로고    scopus 로고
    • A survey of methods for analysing and improving GPU energy efficiency
    • S. Mittal and J. S. Vetter, "A survey of methods for analysing and improving GPU energy efficiency, " ACM Comput. Surveys, vol. 47, no. 2, pp. 19:1-19:23, 2014.
    • (2014) ACM Comput. Surveys , vol.47 , Issue.2 , pp. 191-1923
    • Mittal, S.1    Vetter, J.S.2
  • 7
    • 84959042403 scopus 로고    scopus 로고
    • Acceleration-as-a-service: Exploiting virtualised GPUs for a financial application
    • B. Varghese, J. Prades, C. Reano, and F. Silla, "Acceleration-as-a-service: Exploiting virtualised GPUs for a financial application, " in Proc. 11th IEEE Int. Conf. e-Sci., 2015, pp. 47-56.
    • (2015) Proc. 11th IEEE Int. Conf. E-Sci. , pp. 47-56
    • Varghese, B.1    Prades, J.2    Reano, C.3    Silla, F.4
  • 10
    • 80955141000 scopus 로고    scopus 로고
    • Exploring fine-grained task-based execution on multi-GPU systems
    • L. Chen, O Villa, and G R. Gao, "Exploring fine-grained task-based execution on multi-GPU systems, " in Proc. IEEE Int. Conf. Cluster Comput., 2011, pp. 386-394.
    • (2011) Proc. IEEE Int. Conf. Cluster Comput. , pp. 386-394
    • Chen, L.1    Villa, O.2    Gao, G.R.3
  • 11
    • 85045565081 scopus 로고    scopus 로고
    • Towards multi-tenant GPGPU: Event-driven programming model for system-wide scheduling on shared GPUs
    • Y. Suzuki, H. Yamada, S. Kato, and K. Kono, "Towards multi-tenant GPGPU: Event-driven programming model for system-wide scheduling on shared GPUs, " in Proc. Workshop Multicore Rack-Scale Syst., 2016, pp. 1-7.
    • (2016) Proc. Workshop Multicore Rack-Scale Syst. , pp. 1-7
    • Suzuki, Y.1    Yamada, H.2    Kato, S.3    Kono, K.4
  • 14
    • 84964682582 scopus 로고    scopus 로고
    • Improving application concurrency on GPUs by managing implicit and explicit syn-chronisations
    • M. Butler, K. Sajjapongse, and M. Becchi, "Improving application concurrency on GPUs by managing implicit and explicit syn-chronisations, " in Proc. 21st IEEE Int. Conf. Parallel Distrib. Syst., 2015, pp. 535-544.
    • (2015) Proc. 21st IEEE Int. Conf. Parallel Distrib. Syst. , pp. 535-544
    • Butler, M.1    Sajjapongse, K.2    Becchi, M.3
  • 15
    • 84991588487 scopus 로고    scopus 로고
    • Improving GPU utilisation with multi-process service (MPS)
    • [Online]
    • P. Sah, "Improving GPU utilisation with multi-process service (MPS), " in Proc. GPU Technol. Conf., ID S5584, 2015. [Online]. Available: http://on-demand.gputechconf.com/gtc/2015/presentation/S5584-Priyanka-Sah.pdf
    • (2015) Proc. GPU Technol. Conf., ID S5584
    • Sah, P.1
  • 16
    • 84906356952 scopus 로고    scopus 로고
    • Multi-threaded Kernel offloading to GPGPU using hyper-Q on Kepler architecture
    • Jun.
    • F. Wende, T. Steinke, and F. Cordes, "Multi-threaded Kernel offloading to GPGPU using hyper-Q on Kepler architecture, " in Proc. Zuse Inst. Berlin Rep., Jun. 2014, pp. 1-17.
    • (2014) Proc. Zuse Inst. Berlin Rep , pp. 1-17
    • Wende, F.1    Steinke, T.2    Cordes, F.3
  • 17
    • 85045564794 scopus 로고    scopus 로고
    • NVIDIA, CUDA CProgramming Guide 8.0, 2016. [Online]
    • NVIDIA, CUDA CProgramming Guide 8.0, 2016. [Online]. Available: https://docs.nvidia.com/cuda/pdf/CUDA-C-Programming- Guide.pdf
  • 20
    • 0042830650 scopus 로고    scopus 로고
    • Performance analysis of five interprocess communication mechanisms across UNIX operating systems
    • P. K. Immich, R. S. Bhagavatula, and R. Pendse, "Performance analysis of five interprocess communication mechanisms across UNIX operating systems, " J. Syst. Softw., vol. 68, no. 1, pp. 27-43, 2003.
    • (2003) J. Syst. Softw. , vol.68 , Issue.1 , pp. 27-43
    • Immich, P.K.1    Bhagavatula, R.S.2    Pendse, R.3
  • 21
    • 0027721450 scopus 로고
    • Performance analysis of job scheduling policies in parallel supercomputing environ-ments
    • V. K. Naik, M. S. Squillante, and S. K. Setia, "Performance analysis of job scheduling policies in parallel supercomputing environ-ments, " in Proc. IEEE/ACM Conf. Supercomput., 1993, pp. 824-833.
    • (1993) Proc. IEEE/ACM Conf. Supercomput. , pp. 824-833
    • Naik, V.K.1    Squillante, M.S.2    Setia, S.K.3
  • 22
    • 84976722900 scopus 로고
    • The impact of operating system scheduling policies and synchronisation methods of per-formance of parallel applications
    • A. Gupta, A. Tucker, and S. Urushibara, "The impact of operating system scheduling policies and synchronisation methods of per-formance of parallel applications, " SIGMETRICS Perform. Eval. Rev., vol. 19, no. 1, pp. 120-132, 1991.
    • (1991) SIGMETRICS Perform. Eval. Rev. , vol.19 , Issue.1 , pp. 120-132
    • Gupta, A.1    Tucker, A.2    Urushibara, S.3
  • 23
    • 70649092154 scopus 로고    scopus 로고
    • Rodinia: A benchmark suite for heterogeneous computing
    • S. Che, et al., "Rodinia: A benchmark suite for heterogeneous computing, " in Proc. IEEE Int. Symp. Workload Characterization, 2009, pp. 44-54.
    • (2009) Proc. IEEE Int. Symp. Workload Characterization , pp. 44-54
    • Che, S.1
  • 26
    • 69949100622 scopus 로고    scopus 로고
    • Optimising data intensive GPGPU computations for DNA sequence alignment
    • C. Trapnell and M. C. Schatz, "Optimising data intensive GPGPU computations for DNA sequence alignment, " Parallel Comput., vol. 35, no. 8/9, pp. 429-440, 2009.
    • (2009) Parallel Comput. , vol.35 , Issue.8-9 , pp. 429-440
    • Trapnell, C.1    Schatz, M.C.2
  • 27
    • 78651415181 scopus 로고    scopus 로고
    • GPU-BLAST: Using graphics processors to accelerate protein sequence alignment
    • P. D. Vouzis and N. V. Sahinidis, "GPU-BLAST: Using graphics processors to accelerate protein sequence alignment, " Bioinf., vol. 27, no. 2, pp. 182-188, 2011.
    • (2011) Bioinf. , vol.27 , Issue.2 , pp. 182-188
    • Vouzis, P.D.1    Sahinidis, N.V.2
  • 30
    • 85045564529 scopus 로고    scopus 로고
    • NVIDIA CUDA Multi-Process Service, May 2015. [Online]
    • NVIDIA, CUDA Multi-Process Service, May 2015. [Online]. Available: https://docs.nvidia.com/deploy/pdf/CUDA-Multi-Process- Service-Overview.pdf
  • 31
    • 85045563630 scopus 로고    scopus 로고
    • [Online]
    • T. Bradley, Hyper-Q Example, NVIDIA, 2013. [Online]. Available: https://www.ecse.rpi.edu/wrf/wiki/ParallelComputingSpring 2014/cuda-samples/samples/6-Advanced/simpleHyperQ/doc/HyperQ.pdf
    • (2013) Hyper-Q Example NVIDIA
    • Bradley, T.1
  • 34
    • 85045576958 scopus 로고    scopus 로고
    • NVIDIA, Tesla P100, 2016. [Online]
    • NVIDIA, Tesla P100, 2016. [Online]. Available: https://images. nvidia.com/content/pdf/tesla/whitepaper/pascal-architecture-whitepaper.pdf


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.