SCOPUS 정보 검색 플랫폼

ACM Computing Surveys

Volumn 50, Issue 3, 2017, Pages

GPU virtualization and scheduling methods: A comprehensive survey

(3) Hong, Cheol Ho a Spence, Ivor a Nikolopoulos, Dimitrios S a

a QUEEN'S UNIVERSITY BELFAST (United Kingdom)

Author keywords

Cloud computing; CPU GPU heterogeneous computing; GPU scheduling methods; GPU virtualization

Indexed keywords

CLOUD COMPUTING; COMPUTER GRAPHICS; COMPUTER HARDWARE; DISTRIBUTED COMPUTER SYSTEMS; ENERGY EFFICIENCY; GREEN COMPUTING; NETWORK FUNCTION VIRTUALIZATION; PROGRAM PROCESSORS; SCHEDULING; SURVEYS; VIRTUAL ADDRESSES; VIRTUAL REALITY; VIRTUALIZATION;

DEPTH SURVEYS; HETEROGENEOUS COMPUTING; HIGH PERFORMANCE COMPUTING; LARGE SCALE DISTRIBUTED COMPUTING; PARADIGM SHIFTS; SCHEDULING METHODS; VIRTUALIZATION TECHNIQUES;

GRAPHICS PROCESSING UNIT;

EID: 85027078923 PISSN: 03600300 EISSN: 15577341 Source Type: Journal
DOI: 10.1145/3068281 Document Type: Review

Times cited : (97)

References (173)

1
- 34247885571
- Intel virtualization technology for directed I/O
- 2006
- Darren Abramson, Jeff Jackson, Sridhar Muthrasanallur, Gil Neiger, Greg Regnier, Rajesh Sankaran, Ioannis Schoinas, Rich Uhlig, Balaji Vembu, and John Wiegert. 2006. Intel virtualization technology for directed I/O. Intel Technol. J. 10, 3 (2006).
- (2006) Intel Technol. J. , vol.10 , Issue.3
- Abramson, D.¹ Jackson, J.² Muthrasanallur, S.³ Neiger, G.⁴ Regnier, G.⁵ Sankaran, R.⁶ Schoinas, I.⁷ Uhlig, R.⁸ Vembu, B.⁹ Wiegert, J.¹⁰

2
- 65749112026
- EC Amazon. 2010. Amazon elastic compute cloud (Amazon EC2). https://aws.amazon.com/ec2/.
- (2010) Amazon Elastic Compute Cloud (Amazon EC2)

3
- 85027038682
- 2009
- AMD. 2009. R6xx-3D-Registers.pdf. Retrieved from http://amd-dev. wpengine.netdna-cdn.com/wordpress/media/2013/10/R6xx-3D-Registers.pdf. (2009).
- (2009) R6xx-3D-Registers.pdf

4
- 84905472992
- HOOMD-blue, general-purpose many-body dynamics on the GPU
- Joshua Anderson, Aaron Keys, Carolyn Phillips, Trung Dac Nguyen, and Sharon Glotzer. 2010. HOOMD-blue, general-purpose many-body dynamics on the GPU. In APS Meeting Abstracts, Vol. 1. 18008.
- (2010) APS Meeting Abstracts , vol.1 , pp. 18008
- Anderson, J.¹ Keys, A.² Phillips, C.³ Nguyen, T.D.⁴ Glotzer, S.⁵

5
- 35648995516
- EECS Department Technical Report, University of California, Berkeley
- Krste Asanovic, Ras Bodik, Bryan Christopher Catanzaro, Joseph James Gebis, Parry Husbands, Kurt Keutzer, David A. Patterson, William Lester Plishker, John Shalf, Samuel Webb Williams, and others. 2006. The Landscape of Parallel Computing Research: A View from Berkeley. EECS Department Technical Report UCB/EECS-2006-183. University of California, Berkeley.
- (2006) The Landscape of Parallel Computing Research: A View from Berkeley
- Asanovic, K.¹ Bodik, R.² Catanzaro, B.C.³ Gebis, J.J.⁴ Husbands, P.⁵ Keutzer, K.⁶ Patterson, D.A.⁷ Plishker, W.L.⁸ Shalf, J.⁹ Williams, S.W.¹⁰

6
- 70350729133
- Accelerating monte carlo simulations of photon transport in a voxelized geometry using a massively parallel graphics processing unit
- 2009
- Andreu Badal and Aldo Badano. 2009. Accelerating monte carlo simulations of photon transport in a voxelized geometry using a massively parallel graphics processing unit. Med. Phys. 36, 11(2009), 4878-4880.
- (2009) Med. Phys. , vol.36 , Issue.11 , pp. 4878-4880
- Badal, A.¹ Badano, A.²

7
- 21644433634
- Xen and the art of virtualization
- 2003
- Paul Barham, Boris Dragovic, Keir Fraser, Steven Hand, Tim Harris, Alex Ho, Rolf Neugebauer, Ian Pratt, and Andrew Warfield. 2003. Xen and the art of virtualization. ACM SIGOPS Operat. Syst. Rev. 37, 5(2003), 164-177.
- (2003) ACM SIGOPS Operat. Syst. Rev. , vol.37 , Issue.5 , pp. 164-177
- Barham, P.¹ Dragovic, B.² Fraser, K.³ Hand, S.⁴ Harris, T.⁵ Ho, A.⁶ Neugebauer, R.⁷ Pratt, I.⁸ Warfield, A.⁹

8
- 84899626479
- GPU acceleration for support vector machines
- TU Delft; EWI; MM; PRB, Delft, The Netherlands
- Andreas Athanasopoulos, Anastasios Dimou, Vasileios Mezaris, and Ioannis Kompatsiaris. 2011. GPU acceleration for support vector machines. In 12th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS'11). TU Delft; EWI; MM; PRB, Delft, The Netherlands.
- (2011) 12th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS'11)
- Athanasopoulos, A.¹ Dimou, A.² Mezaris, V.³ Kompatsiaris, I.⁴

9
- 84866460333
- Supporting preemptive task executions and memory copies in gpgpus
- IEEE
- Can Basaran and Kyoung-Don Kang. 2012. Supporting preemptive task executions and memory copies in gpgpus. In Proceedings of the 2012 24th Euromicro Conference on Real-Time Systems. IEEE, 287-296.
- (2012) Proceedings of the 2012 24th Euromicro Conference on Real-time Systems , pp. 287-296
- Basaran, C.¹ Kang, K.-D.²

10
- 84863973589
- A virtual memory based runtime to support multi-tenancy in clusters with GPUs
- ACM
- Michela Becchi, Kittisak Sajjapongse, Ian Graves, Adam Procter, Vignesh Ravi, and Srimat Chakradhar. 2012. A virtual memory based runtime to support multi-tenancy in clusters with GPUs. In Proceedings of the 21st International Symposium on High-Performance Parallel and Distributed Computing. ACM, 97-108.
- (2012) Proceedings of the 21st International Symposium on High-performance Parallel and Distributed Computing , pp. 97-108
- Becchi, M.¹ Sajjapongse, K.² Graves, I.³ Procter, A.⁴ Ravi, V.⁵ Chakradhar, S.⁶

11
- 0035481820
- Credit-based fair queueing (CBFQ): A simple service-scheduling algorithm for packet-switched networks
- 2001
- Brahim Bensaou, Danny H. K. Tsang, and King Tung Chan. 2001. Credit-based fair queueing (CBFQ): A simple service-scheduling algorithm for packet-switched networks. IEEE/ACM Trans. Network. 9, 5(2001), 591-604.
- (2001) IEEE/ACM Trans. Network. , vol.9 , Issue.5 , pp. 591-604
- Bensaou, B.¹ Tsang, D.H.K.² Chan, K.T.³

12
- 33749245662
- The direct3d 10 system
- ACM
- David Blythe. 2006. The direct3d 10 system. In ACM Transactions on Graphics, Vol. 25. ACM, 724-734.
- (2006) ACM Transactions on Graphics , vol.25 , pp. 724-734
- Blythe, D.¹

13
- 84988905990
- Understanding GPU power: A survey of profiling, modeling, and simulation methods
- 2016
- Robert A. Bridges, Neena Imam, and Tiffany M Mintz. 2016. Understanding GPU power: A survey of profiling, modeling, and simulation methods. ACM Comput. Surv. 49, 3(2016), 41.
- (2016) ACM Comput. Surv. , vol.49 , Issue.3 , pp. 41
- Bridges, R.A.¹ Imam, N.² Mintz, T.M.³

14
- 85077094087
- Fido: Fast inter-virtual-machine communication for enterprise appliances
- Anton Burtsev, Kiran Srinivasan, Prashanth Radhakrishnan, Kaladhar Voruganti, and Garth R. Goodson. 2009. Fido: Fast inter-virtual-machine communication for enterprise appliances. In Proceedings of the USENIX Annual Technical Conference.
- (2009) Proceedings of the USENIX Annual Technical Conference
- Burtsev, A.¹ Srinivasan, K.² Radhakrishnan, P.³ Voruganti, K.⁴ Goodson, G.R.⁵

15
- 84959287454
- Exploring the suitability of remote GPGPU virtualization for the OpenACC programming model using rCUDA
- IEEE
- Adrián Castelló, Antonio J. Peña, Rafael Mayo, Pavan Balaji, and Enrique S. Quintana-Ortí. 2015. Exploring the suitability of remote GPGPU virtualization for the OpenACC programming model using rCUDA. In Proceedings of the 2015 IEEE International Conference on Cluster Computing. IEEE, 92-95.
- (2015) Proceedings of the 2015 IEEE International Conference on Cluster Computing , pp. 92-95
- Castelló, A.¹ Peña, A.J.² Mayo, R.³ Balaji, P.⁴ Quintana-Orti, E.S.⁵

16
- 0012526835
- O'Reilly Media, Inc
- Ethan Cerami. 2002. Web Services Essentials: Distributed Applications with XML-RPC, SOAP, UDDI & WSDL. O'Reilly Media, Inc.
- (2002) Web Services Essentials: Distributed Applications with XML-RPC, SOAP, UDDI & WSDL
- Cerami, E.¹

17
- 84866630668
- The architecture of vmware esxi
- 2008
- Charu Chaubal. 2008. The architecture of vmware esxi. VMware White Pap. 1, 7 (2008).
- (2008) VMware White Pap. , vol.1 , Issue.7
- Chaubal, C.¹

18
- 70649092154
- Rodinia: A benchmark suite for heterogeneous computing
- IEEE
- Shuai Che, Michael Boyer, Jiayuan Meng, David Tarjan, Jeremy W. Sheaffer, Sang-Ha Lee, and Kevin Skadron. 2009. Rodinia: A benchmark suite for heterogeneous computing. In Proceedings of the IEEE International Symposium on Workload Characterization, 2009 (IISWC'09). IEEE, 44-54.
- (2009) Proceedings of the IEEE International Symposium on Workload Characterization, 2009 (IISWC'09) , pp. 44-54
- Che, S.¹ Boyer, M.² Meng, J.³ Tarjan, D.⁴ Sheaffer, J.W.⁵ Lee, S.-H.⁶ Skadron, K.⁷

19
- 77956572197
- VMRPC: A high efficiency and light weight RPC system for virtual machines
- IEEE
- Hao Chen, Lin Shi, and Jianhua Sun. 2010. VMRPC: A high efficiency and light weight RPC system for virtual machines. In Proceedings of the 2010 18th International Workshop on Quality of Service (IWQoS'10). IEEE, 1-9.
- (2010) Proceedings of the 2010 18th International Workshop on Quality of Service (IWQoS'10) , pp. 1-9
- Chen, H.¹ Shi, L.² Sun, J.³

20
- 48349092854
- Sharing data between processes running on different domains in para-virtualized xen
- IEEE
- Yun Chan Cho and Jae Wook Jeon. 2007. Sharing data between processes running on different domains in para-virtualized xen. In Proceedings of the International Conference on Control, Automation and Systems, 2007 (ICCAS'07). IEEE, 1255-1260.
- (2007) Proceedings of the International Conference on Control, Automation and Systems, 2007 (ICCAS'07) , pp. 1255-1260
- Cho, Y.C.¹ Jeon, J.W.²

21
- 80955130221
- Heterogeneous cloud computing
- IEEE
- Steve Crago, Kyle Dunn, Patrick Eads, Lorin Hochstein, Dong-In Kang, Mikyung Kang, Devendra Modium, Karandeep Singh, Jinwoo Suh, and John Paul Walters. 2011. Heterogeneous cloud computing. In Proceedings of the 2011 IEEE International Conference on Cluster Computing. IEEE, 378-385.
- (2011) Proceedings of the 2011 IEEE International Conference on Cluster Computing , pp. 378-385
- Crago, S.¹ Dunn, K.² Eads, P.³ Hochstein, L.⁴ Kang, D.-I.⁵ Kang, M.⁶ Modium, D.⁷ Singh, K.⁸ Suh, J.⁹ Walters, J.P.¹⁰

22
- 71749121484
- Trusted virtual platforms: A key enabler for converged client devices
- 2009
- Chris I. Dalton, David Plaquin, Wolfgang Weidner, Dirk Kuhlmann, Boris Balacheff, and Richard Brown. 2009. Trusted virtual platforms: A key enabler for converged client devices. ACM SIGOPS Operat. Syst. Rev. 43, 1(2009), 36-43.
- (2009) ACM SIGOPS Operat. Syst. Rev. , vol.43 , Issue.1 , pp. 36-43
- Dalton, C.I.¹ Plaquin, D.² Weidner, W.³ Kuhlmann, D.⁴ Balacheff, B.⁵ Brown, R.⁶

23
- 77952273045
- The scalable heterogeneous computing (SHOC) benchmark suite
- ACM
- Anthony Danalis, Gabriel Marin, Collin McCurdy, Jeremy S. Meredith, Philip C. Roth, Kyle Spafford, Vinod Tipparaju, and Jeffrey S. Vetter. 2010. The scalable heterogeneous computing (SHOC) benchmark suite. In Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units. ACM, 63-74.
- (2010) Proceedings of the 3rd Workshop on General-purpose Computation on Graphics Processing Units , pp. 63-74
- Danalis, A.¹ Marin, G.² McCurdy, C.³ Meredith, J.S.⁴ Roth, P.C.⁵ Spafford, K.⁶ Tipparaju, V.⁷ Vetter, J.S.⁸

24
- 0002321045
- Design and analysis of a fair queuing algorithm
- A. Demers, S. Keshav, and S. Shenker. 1989. Design and analysis of a fair queuing algorithm. In Proceedings of the ACM Special Interest Group on Data Communication (SIGCOMM'89), Vol. 89.
- (1989) Proceedings of the ACM Special Interest Group on Data Communication (SIGCOMM'89) , vol.89
- Demers, A.¹ Keshav, S.² Shenker, S.³

25
- 84867244965
- Virtualizing general purpose GPUs for high performance cloud computing: An application to a fluid simulator
- IEEE
- Roberto Di Lauro, Flora Giannone, Luigia Ambrosio, and Raffaele Montella. 2012. Virtualizing general purpose GPUs for high performance cloud computing: An application to a fluid simulator. In Proceedings of the 2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications (ISPA'12). IEEE, 863-864.
- (2012) Proceedings of the 2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications (ISPA'12) , pp. 863-864
- Lauro, R.D.¹ Giannone, F.² Ambrosio, L.³ Montella, R.⁴

26
- 84938626895
- Accelerating option risk analytics in R using GPUs
- Matthew Dixon, Sabbir Ahmed Khan, and Mohammad Zubair. 2014. Accelerating option risk analytics in R using GPUs. In Proceedings of the High Performance Computing Symposium. Society for Computer Simulation International, 24.
- (2014) Proceedings of the High Performance Computing Symposium. Society for Computer Simulation International , pp. 24
- Dixon, M.¹ Khan, S.A.² Zubair, M.³

27
- 85069163055
- Boosting GPU virtualization performance with hybrid shadow page tables
- Yaozu Dong, Mochi Xue, Xiao Zheng, Jiajun Wang, Zhengwei Qi, and Haibing Guan. 2015. Boosting GPU virtualization performance with hybrid shadow page tables. In Proceedings of the 2015 USENIX Annual Technical Conference (USENIX ATC'15). 517-528.
- (2015) Proceedings of the 2015 USENIX Annual Technical Conference (USENIX ATC'15) , pp. 517-528
- Dong, Y.¹ Xue, M.² Zheng, X.³ Wang, J.⁴ Qi, Z.⁵ Guan, H.⁶

28
- 84866114929
- High performance network virtualization with SR-IOV
- 2012
- Yaozu Dong, Xiaowei Yang, Jianhui Li, Guangdeng Liao, Kun Tian, and Haibing Guan. 2012. High performance network virtualization with SR-IOV. J. Parallel Distrib. Comput. 72, 11(2012), 1471-1480.
- (2012) J. Parallel Distrib. Comput. , vol.72 , Issue.11 , pp. 1471-1480
- Dong, Y.¹ Yang, X.² Li, J.³ Liao, G.⁴ Tian, K.⁵ Guan, H.⁶

29
- 0042674307
- The LINPACK benchmark: Past, present and future
- 2003
- Jack J. Dongarra, Piotr Luszczek, and Antoine Petitet. 2003. The LINPACK benchmark: Past, present and future. Concurr. Comput.: Pract. Exper. 15, 9(2003), 803-820.
- (2003) Concurr. Comput.: Pract. Exper. , vol.15 , Issue.9 , pp. 803-820
- Dongarra, J.J.¹ Luszczek, P.² Petitet, A.³

30
- 77952266871
- GPU virtualization on VMware's hosted I/O architecture
- 2009
- Micah Dowty and Jeremy Sugerman. 2009. GPU virtualization on VMware's hosted I/O architecture. ACM SIGOPS Operat. Syst. Rev. 43, 3(2009), 73-82.
- (2009) ACM SIGOPS Operat. Syst. Rev. , vol.43 , Issue.3 , pp. 73-82
- Dowty, M.¹ Sugerman, J.²

31
- 77954589384
- An efficient implementation of GPU virtualization in high performance clusters
- Springer
- José Duato, Francisco D. Igual, Rafael Mayo, Antonio J. Peña, Enrique S. Quintana-Ortí, and Federico Silla. 2009. An efficient implementation of GPU virtualization in high performance clusters. In European Conference on Parallel Processing. Springer, 385-394.
- (2009) European Conference on Parallel Processing , pp. 385-394
- Duato, J.¹ Igual, F.D.² Mayo, R.³ Peña, A.J.⁴ Quintana-Orti, E.S.⁵ Silla, F.⁶

32
- 84858051188
- Enabling CUDA acceleration within virtual machines using rCUDA
- IEEE
- José Duato, Antonio J. Peña, Federico Silla, Juan C. Fernandez, Rafael Mayo, and Enrique S. Quintana-Ortí. 2011. Enabling CUDA acceleration within virtual machines using rCUDA. In Proceedings of the 2011 18th International Conference on High Performance Computing (HiPC'11). IEEE, 1-10.
- (2011) Proceedings of the 2011 18th International Conference on High Performance Computing (HiPC'11) , pp. 1-10
- Duato, J.¹ Peña, A.J.² Silla, F.³ Fernandez, J.C.⁴ Mayo, R.⁵ Quintana-Orti, E.S.⁶

33
- 78650853478
- Modeling the CUDA remoting virtualization behaviour in high performance networks
- José Duato, Antonio J. Peña, Federico Silla, Rafael Mayo, and Enrique S. Quintana-Orti. 2010a. Modeling the CUDA remoting virtualization behaviour in high performance networks. In Proceedings of the 1st Workshop on Language, Compiler, and Architecture Support for GPGPU.
- (2010) Proceedings of the 1st Workshop on Language, Compiler, and Architecture Support for GPGPU
- Duato, J.¹ Peña, A.J.² Silla, F.³ Mayo, R.⁴ Quintana-Orti, E.S.⁵

34
- 77956946040
- RCUDA: Reducing the number of GPU-based accelerators in high performance clusters
- IEEE
- José Duato, Antonio J. Peña, Federico Silla, Rafael Mayo, and Enrique S. Quintana-Ortí. 2010b. rCUDA: Reducing the number of GPU-based accelerators in high performance clusters. In Proceedings of the 2010 International Conference on High Performance Computing and Simulation (HPCS'10). IEEE, 224-231.
- (2010) Proceedings of the 2010 International Conference on High Performance Computing and Simulation (HPCS'10) , pp. 224-231
- Duato, J.¹ Peña, A.J.² Silla, F.³ Mayo, R.⁴ Quintana-Orti, E.S.⁵

35
- 80155140345
- Performance of CUDA virtualized remote GPUsin high performance clusters
- IEEE
- José Duato, Antonio J. Peña, Federico Silla, Rafael Mayo, and Enrique S Quintana-Ortí. 2011. Performance of CUDA virtualized remote GPUsin high performance clusters. In Proceedings of the 2011 International Conference on Parallel Processing (ICPP'11). IEEE, 365-374.
- (2011) Proceedings of the 2011 International Conference on Parallel Processing (ICPP'11) , pp. 365-374
- Duato, J.¹ Peña, A.J.² Silla, F.³ Mayo, R.⁴ Quintana-Orti, E.S.⁵

36
- 84862921993
- Ph. D. Dissertation. Citeseer
- Ashok Dwarakinath. 2008. A Fair-Share Scheduler for the Graphics Processing Unit. Ph. D. Dissertation. Citeseer.
- (2008) A Fair-share Scheduler for the Graphics Processing Unit
- Dwarakinath, A.¹

37
- 84880327377
- Generalpurpose computation on GPUs for high performance cloud computing
- 2013
- Roberto R. Expósito, Guillermo L. Taboada, Sabela Ramos, Juan Touriño, and Ramón Doallo. 2013. Generalpurpose computation on GPUs for high performance cloud computing. Concurr. Comput.: Pract. Exper. 25, 12(2013), 1628-1642.
- (2013) Concurr. Comput.: Pract. Exper. , vol.25 , Issue.12 , pp. 1628-1642
- Expósito, R.R.¹ Taboada, G.L.² Ramos, S.³ Touriño, J.⁴ Doallo, R.⁵

38
- 84963771830
- Affinityaware work-stealing for integrated CPU-GPU processors
- ACM
- Naila Farooqui, Rajkishore Barik, Brian T. Lewis, Tatiana Shpeisman, and Karsten Schwan. 2016. Affinityaware work-stealing for integrated CPU-GPU processors. In Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. ACM, 30.
- (2016) Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming , pp. 30
- Farooqui, N.¹ Barik, R.² Lewis, B.T.³ Shpeisman, T.⁴ Schwan, K.⁵

39
- 84969960811
- Retrieved from, 2014
- Denis Foley. 2014. NVLink, pascal and stacked memory: Feeding the appetite for big data. Retrieved from Nvidia.com (2014).
- (2014) NVLink, Pascal and Stacked Memory: Feeding the Appetite for Big Data
- Foley, D.¹

40
- 85027031597
- 1998
- Futuremark. 1998. 3DMark Benchmarks-See the Current Range of this Popular PC Graphics Card Test. Retrieved from http://www.futuremark.com/benchmarks/3dmark/all?-ga=1.168926249.987441096. 1470653002. (1998).
- (1998) 3DMark Benchmarks-see the Current Range of This Popular PC Graphics Card Test

41
- 84944386303
- When virtual is harder than real: Security challenges in virtual machine based computing environments
- Tal Garfinkel and Mendel Rosenblum. 2005. When virtual is harder than real: Security challenges in virtual machine based computing environments. In Proceedings of the Workshop on Hot Topics in Operating Systems (HotOS'05).
- (2005) Proceedings of the Workshop on Hot Topics in Operating Systems (HotOS'05)
- Garfinkel, T.¹ Rosenblum, M.²

42
- 84893642174
- Technical Report. Citeseer
- Carl Gebhardt and Allan Tomlinson. 2010. Challenges for Inter Virtual Machine Communication. Technical Report. Citeseer.
- (2010) Challenges for Inter Virtual Machine Communication
- Gebhardt, C.¹ Tomlinson, A.²

43
- 84874423459
- A GPU accelerated high performance cloud computing infrastructure for grid computing based virtual environmental laboratory
- Springer, Berlin, Heidelberg
- Francisco Giunta, Raffaele Montella, Giuliano Laccetti, Florin Isaila, and F. Blas. 2011. A GPU accelerated high performance cloud computing infrastructure for grid computing based virtual environmental laboratory. Adv. Grid Comput. Lecture Notes in Computer Science. Vol. 6271. Springer, Berlin, Heidelberg, 35-43.
- (2011) Adv. Grid Comput. Lecture Notes in Computer Science. , vol.6271 , pp. 35-43
- Giunta, F.¹ Montella, R.² Laccetti, G.³ Isaila, F.⁴ Blas, F.⁵

44
- 78349273083
- A GPGPU transparent virtualization component for high performance computing clouds
- Springer
- Giulio Giunta, Raffaele Montella, Giuseppe Agrillo, and Giuseppe Coviello. 2010. A GPGPU transparent virtualization component for high performance computing clouds. In Euro-Par 2010-Parallel Processing. Springer, 379-391.
- (2010) Euro-par 2010-parallel Processing , pp. 379-391
- Giunta, G.¹ Montella, R.² Agrillo, G.³ Coviello, G.⁴

45
- 84928049032
- Strong scaling of general-purpose molecular dynamics simulations on GPUs
- 2015
- Jens Glaser, Trung Dac Nguyen, Joshua A. Anderson, Pak Lui, Filippo Spiga, Jaime A. Millan, David C. Morse, and Sharon C. Glotzer. 2015. Strong scaling of general-purpose molecular dynamics simulations on GPUs. Comput. Phys. Commun. 192(2015), 97-107.
- (2015) Comput. Phys. Commun. , vol.192 , pp. 97-107
- Glaser, J.¹ Nguyen, T.D.² Anderson, J.A.³ Lui, P.⁴ Spiga, F.⁵ Millan, J.A.⁶ Morse, D.C.⁷ Glotzer, S.C.⁸

46
- 84926427148
- Survey of virtual machine research
- 1974
- Robert P. Goldberg. 1974. Survey of virtual machine research. Computer 7, 6(1974), 34-45.
- (1974) Computer , vol.7 , Issue.6 , pp. 34-45
- Goldberg, R.P.¹

47
- 84903973686
- LoGV: Lowoverhead GPGPU virtualization
- IEEE
- Mathias Gottschlag, Martin Hillenbrand, Jens Kehne, Jan Stoess, and Frank Bellosa. 2013. LoGV: Lowoverhead GPGPU virtualization. In Proceedings of the 2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC-EUC'13). IEEE, 1721-1726.
- (2013) Proceedings of the 2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC-EUC'13) , pp. 1721-1726
- Gottschlag, M.¹ Hillenbrand, M.² Kehne, J.³ Stoess, J.⁴ Bellosa, F.⁵

48
- 84861018407
- Particle simulation using CUDA
- 2010
- Simon Green. 2010. Particle simulation using cuda. NVIDIA Whitepaper 6(2010), 121-128.
- (2010) NVIDIA Whitepaper , vol.6 , pp. 121-128
- Green, S.¹

49
- 0030243005
- A high-performance, portable implementation of the MPI message passing interface standard
- 1996
- William Gropp, Ewing Lusk, Nathan Doss, and Anthony Skjellum. 1996. A high-performance, portable implementation of the MPI message passing interface standard. Parallel Comput. 22, 6(1996), 789-828.
- (1996) Parallel Comput. , vol.22 , Issue.6 , pp. 789-828
- Gropp, W.¹ Lusk, E.² Doss, N.³ Skjellum, A.⁴

50
- 79951728783
- The opencl specification
- 2008
- Khronos OpenCL Working Group et al. 2008. The opencl specification. Version 1, 29(2008), 8.
- (2008) Version , vol.1 , Issue.29 , pp. 8

51
- 70349123351
- GViM: GPU-accelerated virtual machines
- ACM
- Vishakha Gupta, Ada Gavrilovska, Karsten Schwan, Harshvardhan Kharche, Niraj Tolia, Vanish Talwar, and Parthasarathy Ranganathan. 2009. GViM: GPU-accelerated virtual machines. In Proceedings of the 3rd ACM Workshop on System-level Virtualization for High Performance Computing. ACM, 17-24.
- (2009) Proceedings of the 3rd ACM Workshop on System-level Virtualization for High Performance Computing , pp. 17-24
- Gupta, V.¹ Gavrilovska, A.² Schwan, K.³ Kharche, H.⁴ Tolia, N.⁵ Talwar, V.⁶ Ranganathan, P.⁷

52
- 84939240957
- Energy-efficient SLA guarantees for virtualized GPU in cloud gaming
- 2015
- Haibing Guan, Jianguo Yao, Zhengwei Qi, and Runze Wang. 2015. Energy-efficient SLA guarantees for virtualized GPU in cloud gaming. IEEE Trans-actions on Parallel Distrib. Syst. 26, 9(2015), 2434-2443.
- (2015) IEEE Trans-actions on Parallel Distrib. Syst. , vol.26 , Issue.9 , pp. 2434-2443
- Guan, H.¹ Yao, J.² Qi, Z.³ Wang, R.⁴

53
- 85077044984
- Pegasus: Coordinated scheduling for virtualized accelerator-based systems
- Vishakha Gupta, Karsten Schwan, Niraj Tolia, Vanish Talwar, and Parthasarathy Ranganathan. 2011. Pegasus: Coordinated scheduling for virtualized accelerator-based systems. In Proceedings of the 2011 USENIX Annual Technical Conference (USENIX ATC'11). 31.
- (2011) Proceedings of the 2011 USENIX Annual Technical Conference (USENIX ATC'11) , pp. 31
- Gupta, V.¹ Schwan, K.² Tolia, N.³ Talwar, V.⁴ Ranganathan, P.⁵

54
- 84899111809
- Haswell: The fourth-generation intel core processor
- 2014
- Per Hammarlund, Alberto J. Martinez, Atiq A. Bajwa, David L. Hill, Erik Hallnor, Hong Jiang, Martin Dixon, Michael Derr, Mikal Hunsaker, Rajesh Kumar, et al. 2014. Haswell: The fourth-generation intel core processor. IEEE Micro 34, 2(2014), 6-20.
- (2014) IEEE Micro , vol.34 , Issue.2 , pp. 6-20
- Hammarlund, P.¹ Martinez, A.J.² Bajwa, A.A.³ Hill, D.L.⁴ Hallnor, E.⁵ Jiang, H.⁶ Dixon, M.⁷ Derr, M.⁸ Hunsaker, M.⁹ Kumar, R.¹⁰

55
- 84904495073
- Blink: Advanced display multiplexing for virtualized applications
- Jacob Gorm Hansen. 2007. Blink: Advanced display multiplexing for virtualized applications. In Proceedings of the SIGMM Workshop on Network and Operating Systems Support for Digital Audio and Video (NOSSDAV'07).
- (2007) Proceedings of the SIGMM Workshop on Network and Operating Systems Support for Digital Audio and Video (NOSSDAV'07)
- Hansen, J.G.¹

56
- 85077195492
- Efficient and scalable paravirtual I/O system
- Nadav Har'El, Abel Gordon, Alex Landau, Muli Ben-Yehuda, Avishay Traeger, and Razya Ladelsky. 2013. Efficient and scalable paravirtual I/O system. In Proceedings of the USENIX Annual Technical Conference. 231-242.
- (2013) Proceedings of the USENIX Annual Technical Conference , pp. 231-242
- Har'El, N.¹ Gordon, A.² Landau, A.³ Ben-Yehuda, M.⁴ Traeger, A.⁵ Ladelsky, R.⁶

57
- 84965031169
- Enhancing the usability and utilization of accelerated architectures via docker
- IEEE
- Nicholas Haydel, Sandra Gesing, Ian Taylor, Gregory Madey, Abdul Dakkak, Simon Garcia De Gonzalo, and Wen-Mei W. Hwu. 2015. Enhancing the usability and utilization of accelerated architectures via docker. In Proceedings of the 2015 IEEE/ACM 8th International Conference on Utility and Cloud Computing (UCC'15). IEEE, 361-367.
- (2015) Proceedings of the 2015 IEEE/ACM 8th International Conference on Utility and Cloud Computing (UCC'15) , pp. 361-367
- Haydel, N.¹ Gesing, S.² Taylor, I.³ Madey, G.⁴ Dakkak, A.⁵ De Gonzalo, S.G.⁶ Hwu, W.W.⁷

58
- 84965004939
- NVIDIA GRID: Graphics accelerated VDI with the visual performance of a workstation
- 2014
- Alex Herrera. 2014. NVIDIA GRID: Graphics accelerated VDI with the visual performance of a workstation. Nvidia Corp (2014). http://www.nvidia.com/content/grid/vdi-whitepaper.pdf.
- (2014) NVIDIA Corp
- Herrera, A.¹

59
- 85083940224
- GPU consolidation for cloud games: Are we there yet?
- IEEE Press
- Hua-Jun. Hong, Tao-Ya Fan-Chiang, Che-Run Lee, Kuan-Ta Chen, Chun-Ying Huang, and Cheng-Hsin Hsu. 2014. GPU consolidation for cloud games: Are we there yet?. In Proceedings of the 13th Annual Workshop on Network and Systems Support for Games. IEEE Press, 3.
- (2014) Proceedings of the 13th Annual Workshop on Network and Systems Support for Games , pp. 3
- Hong, H.-J.¹ Fan-Chiang, T.-Y.² Lee, C.-R.³ Chen, K.-T.⁴ Huang, C.-Y.⁵ Hsu, C.-H.⁶

60
- 84976553176
- Building a KVM-based hypervisor for a heterogeneous system architecture compliant system
- ACM
- Yu-Ju Huang, Hsuan-Heng Wu, Yeh-Ching Chung, and Wei-Chung Hsu. 2016. Building a KVM-based hypervisor for a heterogeneous system architecture compliant system. In Proceedings of the12th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments. ACM, 3-15.
- (2016) Proceedings of The12th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments , pp. 3-15
- Huang, Y.-J.¹ Wu, H.-H.² Chung, Y.-C.³ Hsu, W.-C.⁴

61
- 0036993236
- Chromium: A stream-processing framework for interactive rendering on clusters
- ACM
- Greg Humphreys, Mike Houston, Ren Ng, Randall Frank, Sean Ahern, Peter D. Kirchner, and James T. Klosowski. 2002. Chromium: A stream-processing framework for interactive rendering on clusters. In ACM Transactions on Graphics, Vol. 21. ACM, 693-702.
- (2002) ACM Transactions on Graphics , vol.21 , pp. 693-702
- Humphreys, G.¹ Houston, M.² Ng, R.³ Frank, R.⁴ Ahern, S.⁵ Kirchner, P.D.⁶ Klosowski, J.T.⁷

62
- 84878588753
- Client rendering method for desktop virtualization services
- 2013
- Su Min Jang, Won Hyuk Choi, and Won Young Kim. 2013. Client rendering method for desktop virtualization services. ETRI J. 35, 2(2013), 348-351.
- (2013) ETRI J. , vol.35 , Issue.2 , pp. 348-351
- Jang, S.M.¹ Choi, W.H.² Kim, W.Y.³

63
- 59049085159
- Predictive runtime code scheduling for heterogeneous architectures
- Springer
- Víctor J. Jiménez, Lluís Vilanova, Isaac Gelado, Marisa Gil, Grigori Fursin, and Nacho Navarro. 2009. Predictive runtime code scheduling for heterogeneous architectures. In High Performance Embedded Architectures and Compilers. Springer, 19-33.
- (2009) High Performance Embedded Architectures and Compilers , pp. 19-33
- Jiménez, V.J.¹ Vilanova, L.² Gelado, I.³ Gil, M.⁴ Fursin, G.⁵ Navarro, N.⁶

64
- 84878141243
- Exploiting GPUs invirtual machine for biocloud
- 2013 2013
- Heeseung Jo, Jinkyu Jeong, Myoungho Lee, and Dong Hoon Choi. 2013a. Exploiting GPUs invirtual machine for biocloud. BioMed Res. Int. 2013 (2013).
- (2013) BioMed Res. Int
- Jo, H.¹ Jeong, J.² Lee, M.³ Choi, D.H.⁴

65
- 84874986898
- GPU virtualization using PCI direct pass-through
- Trans Tech Publ
- Hee Seung Jo, Myung Ho Lee, and Dong Hoon Choi. 2013b. GPU virtualization using PCI direct pass-through. In Applied Mechanics and Materials, Vol. 311. Trans Tech Publ, 15-19.
- (2013) Applied Mechanics and Materials , vol.311 , pp. 15-19
- Jo, H.S.¹ Lee, M.H.² Choi, D.H.³

66
- 84899707201
- David Kanter. 2010. Intels sandy bridge microarchitecture. http://www.realworldtech.com/sandy-bridge/.
- (2010) Intels Sandy Bridge Microarchitecture
- Kanter, D.¹

67
- 84892554434
- Livermore, CA 2013
- Ian Karlin, Jeff Keasler, and Rob Neely. 2013. Lulesh 2.0 updates and changes. Livermore, CA (2013). https://codesign. llnl.gov/lulesh.php.
- (2013) Lulesh 2.0 Updates and Changes
- Karlin, I.¹ Keasler, J.² Neely, R.³

68
- 84863067873
- Operating systems challenges for GPU resource management
- Shinpei Kato, Scott Brandt, Yutaka Ishikawa, and R. Rajkumar. 2011a. Operating systems challenges for GPU resource management. In Proceedings of the International Workshop on Operating Systems Platforms for Embedded Real-Time Applications. 23-32.
- (2011) Proceedings of the International Workshop on Operating Systems Platforms for Embedded Real-time Applications , pp. 23-32
- Kato, S.¹ Brandt, S.² Ishikawa, Y.³ Rajkumar, R.⁴

69
- 79957590650
- Resource sharing in GPU-accelerated windowing systems
- IEEE
- Shinpei Kato, Karthik Lakshmanan, Yutaka Ishikawa, and Ragunathan Rajkumar. 2011b. Resource sharing in GPU-accelerated windowing systems. In Proceedings of the 2011 17th IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS'11). IEEE, 191-200.
- (2011) Proceedings of the 2011 17th IEEE Real-time and Embedded Technology and Applications Symposium (RTAS'11) , pp. 191-200
- Kato, S.¹ Lakshmanan, K.² Ishikawa, Y.³ Rajkumar, R.⁴

70
- 84863015834
- RGEM: A responsive GPGPU execution model for runtime engines
- IEEE
- Shinpei Kato, Karthik Lakshmanan, Aman Kumar, Mihir Kelkar, Yutaka Ishikawa, and Ragunathan Rajkumar. 2011c. RGEM: A responsive GPGPU execution model for runtime engines. In Proceedings of the 2011 IEEE 32nd Real-Time Systems Symposium (RTSS'11). IEEE, 57-66.
- (2011) Proceedings of the 2011 IEEE 32nd Real-time Systems Symposium (RTSS'11) , pp. 57-66
- Kato, S.¹ Lakshmanan, K.² Kumar, A.³ Kelkar, M.⁴ Ishikawa, Y.⁵ Rajkumar, R.⁶

71
- 85077032008
- TimeGraph: GPU scheduling for real-time multi-tasking environments
- Shinpei Kato, Karthik Lakshmanan, Raj Rajkumar, and Yutaka Ishikawa. 2011d. TimeGraph: GPU scheduling for real-time multi-tasking environments. In Proceedings of the 2011 USENIX Annual Technical Conference (USENIX ATC'11). 17.
- (2011) Proceedings of the 2011 USENIX Annual Technical Conference (USENIX ATC'11) , pp. 17
- Kato, S.¹ Lakshmanan, K.² Rajkumar, R.³ Ishikawa, Y.⁴

72
- 85077122204
- Gdev: First-class GPU resource management in the operating system
- Shinpei Kato, Michael McThrow, Carlos Maltzahn, and Scott A. Brandt. 2012. Gdev: First-class GPU resource management in the operating system.. In Proceedings of the USENIX Annual Technical Conference (USENIX ATC'11). 401-412.
- (2012) Proceedings of the USENIX Annual Technical Conference (USENIX ATC'11) , pp. 401-412
- Kato, S.¹ McThrow, M.² Maltzahn, C.³ Brandt, S.A.⁴

73
- 84899964619
- Secure device access for automotive software
- IEEE
- Se Won Kim, Chiyoung Lee, MooWoong Jeon, Hae Young Kwon, Hyun Woo Lee, and Chuck Yoo. 2013. Secure device access for automotive software. In Proceedings of the 2013 International Conference on Connected Vehicles and Expo (ICCVE'13). IEEE, 177-181.
- (2013) Proceedings of the 2013 International Conference on Connected Vehicles and Expo (ICCVE'13) , pp. 177-181
- Kim, S.W.¹ Lee, C.² Jeon, M.³ Kwon, H.Y.⁴ Lee, H.W.⁵ Yoo, C.⁶

74
- 85102984928
- Programming massively parallel processors: A hands-on approach
- David B. Kirk and W. Hwu Wen-mei. 2012. Programming Massively Parallel Processors: A Hands-on Approach. Newnes.
- (2012) Newnes
- Kirk, D.B.¹ Wen-Mei, W.H.²

75
- 54049158076
- Kvm: The Linux virtual machine monitor
- Avi Kivity, Yaniv Kamay, Dor Laor, Uri Lublin, and Anthony Liguori. 2007. kvm: The Linux virtual machine monitor. In Proceedings of the Linux Symposium, Vol. 1. 225-230.
- (2007) Proceedings of the Linux Symposium , vol.1 , pp. 225-230
- Kivity, A.¹ Kamay, Y.² Laor, D.³ Lublin, U.⁴ Liguori, A.⁵

76
- 77952125596
- Westmere: A family of 32nm IA processors
- Nasser A. Kurd, Subramani Bhamidipati, Christopher Mozak, Jeffrey L. Miller, Timothy M. Wilson, Mahadev Nemani, and Muntaquim Chowdhury. 2010. Westmere: A family of 32nm IA processors. In Proceedings of the 2010 IEEE International Solid-State Circuits Conference (ISSCC'10).
- (2010) Proceedings of the 2010 IEEE International Solid-state Circuits Conference (ISSCC'10)
- Kurd, N.A.¹ Bhamidipati, S.² Mozak, C.³ Miller, J.L.⁴ Wilson, T.M.⁵ Nemani, M.⁶ Chowdhury, M.⁷

77
- 85026999740
- issued date: July 5 2011. Patent, Filed date: Feb. 25, 2009
- Maxim A. Kuzkin and Alexander G. Tormasov. 2011. Method and system for remote device access in virtual environment. (issued date: July 5 2011). Patent No. 7, 975, 017. Filed date: Feb. 25, 2009.
- (2011) Method and System for Remote Device Access in Virtual Environment
- Kuzkin, M.A.¹ Tormasov, A.G.²

78
- 84888133920
- Heterogeneous system architecture: A technical review
- 2012
- George Kyriazis. 2012. Heterogeneous system architecture: A technical review. In Proceedings of the AMD Fusion Developer Summit (2012).
- (2012) Proceedings of the AMD Fusion Developer Summit
- Kyriazis, G.¹

79
- 84901266785
- The high performance internet of things: Using GVirtuS to share high-end GPUs with ARM based cluster computing nodes
- Springer
- Giuliano Laccetti, Raffaele Montella, Carlo Palmieri, and Valentina Pelliccia. 2013. The high performance internet of things: Using GVirtuS to share high-end GPUs with ARM based cluster computing nodes. In International Conference on Parallel Processing and Applied Mathematics. Springer, 734-744.
- (2013) International Conference on Parallel Processing and Applied Mathematics , pp. 734-744
- Laccetti, G.¹ Montella, R.² Palmieri, C.³ Pelliccia, V.⁴

80
- 35448945624
- VMM-independent graphics acceleration
- ACM
- H. Andrés Lagar-Cavilla, Niraj Tolia, Mahadev Satyanarayanan, and Eyal De Lara. 2007. VMM-independent graphics acceleration. In Proceedings of the 3rd International Conference on Virtual Execution Environments. ACM, 33-43.
- (2007) Proceedings of the 3rd International Conference on Virtual Execution Environments , pp. 33-43
- Lagar-Cavilla, H.A.¹ Tolia, N.² Satyanarayanan, M.³ De Lara, E.⁴

81
- 84893324240
- PVOCL: Power-aware dynamic placement and migration in virtualized GPU environments
- IEEE
- Palden Lama, Yan Li, Ashwin M. Aji, Pavan Balaji, James Dinan, Shucai Xiao, Yunquan Zhang, Wu-chun Feng, Rajeev Thakur, and Xiaobo Zhou. 2013. pVOCL: Power-aware dynamic placement and migration in virtualized GPU environments. In Proceedings of the 2013 IEEE 33rd International Conference on Distributed Computing Systems (ICDCS'13). IEEE, 145-154.
- (2013) Proceedings of the 2013 IEEE 33rd International Conference on Distributed Computing Systems (ICDCS'13) , pp. 145-154
- Lama, P.¹ Li, Y.² Aji, A.M.³ Balaji, P.⁴ Dinan, J.⁵ Xiao, S.⁶ Zhang, Y.⁷ Feng, W.-C.⁸ Thakur, R.⁹ Zhou, X.¹⁰

82
- 84902458624
- Michael Larabel and M. Tippett. 2011. Phoronix test suite. https://www.phoronix-test-suite.com.
- (2011) Phoronix Test Suite
- Larabel, M.¹ Tippett, M.²

83
- 84962521288
- VADI: GPU virtualization for an automotive platform
- 2016
- Chiyoung Lee, Se-Won Kim, and Chuck Yoo. 2016. VADI: GPU virtualization for an automotive platform. IEEE Trans. Industr. Inf. 12, 1(2016), 277-290.
- (2016) IEEE Trans. Industr. Inf. , vol.12 , Issue.1 , pp. 277-290
- Lee, C.¹ Kim, S.-W.² Yoo, C.³

84
- 85088775944
- Heterogeneity-aware resource allocation and scheduling in the cloud
- Gunho Lee and Randy H. Katz. 2011. Heterogeneity-aware resource allocation and scheduling in the cloud.. In Proceedings of the 3rd USENIX Workshop on Hot Topics in Cloud Computing (HotCloud'11).
- (2011) Proceedings of the 3rd USENIX Workshop on Hot Topics in Cloud Computing (HotCloud'11)
- Lee, G.¹ Katz, R.H.²

85
- 80155183121
- GPU resource sharing and virtualization on high performance computing systems
- IEEE
- Teng Li, Vikram K. Narayana, Esam El-Araby, and Tarek El-Ghazawi. 2011. GPU resource sharing and virtualization on high performance computing systems. In Proceedings of the 2011 International Conference on Parallel Processing (ICPP'11). IEEE, 733-742.
- (2011) Proceedings of the 2011 International Conference on Parallel Processing (ICPP'11) , pp. 733-742
- Li, T.¹ Narayana, V.K.² El-Araby, E.³ El-Ghazawi, T.⁴

86
- 84862704846
- Accelerated high-performance computing through efficient multi-process GPU resource sharing
- ACM
- Teng Li, Vikram K. Narayana, and Tarek El-Ghazawi. 2012. Accelerated high-performance computing through efficient multi-process GPU resource sharing. In Proceedings of the 9th Conference on Computing Frontiers. ACM, 269-272.
- (2012) Proceedings of the 9th Conference on Computing Frontiers , pp. 269-272
- Li, T.¹ Narayana, V.K.² El-Ghazawi, T.³

87
- 84941215614
- An evaluation of unified memory technology on NVIDIA GPUs
- IEEE
- Wenqiang Li, Guanghao Jin, Xuewen Cui, and Simon See. 2015. An evaluation of unified memory technology on nvidia gpus. In Proceedings of the 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'15). IEEE, 1092-1098.
- (2015) Proceedings of the 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'15) , pp. 1092-1098
- Li, W.¹ Jin, G.² Cui, X.³ See, S.⁴

88
- 79957572280
- GridCuda: A grid-enabled CUDA programming toolkit
- IEEE
- Tyng-Yeu Liang and Yu-Wei Chang. 2011. GridCuda: A grid-enabled CUDA programming toolkit. In Proceedings of the 2011 IEEE Workshops of International Conference on Advanced Information Networking and Applications (WAINA'11). IEEE, 141-146.
- (2011) Proceedings of the 2011 IEEE Workshops of International Conference on Advanced Information Networking and Applications (WAINA'11) , pp. 141-146
- Liang, T.-Y.¹ Chang, Y.-W.²

89
- 84968735868
- Portable and transparent software managed scheduling on accelerators for fair resource sharing
- ACM
- Christos Margiolas and Michael F. P. O'Boyle. 2016. Portable and transparent software managed scheduling on accelerators for fair resource sharing. In Proceedings of the 2016 International Symposium on Code Generation and Optimization. ACM, 82-93.
- (2016) Proceedings of the 2016 International Symposium on Code Generation and Optimization , pp. 82-93
- Margiolas, C.¹ O'Boyle, M.F.P.²

90
- 84965001845
- Enabling OS research by inferring interactions in the black-box GPU stack
- Konstantinos Menychtas, Kai Shen, and Michael L. Scott. 2013. Enabling OS research by inferring interactions in the black-box GPU stack. In Proceedings of the 2013 USENIX Annual Technical Conference (USENIX ATC'13). 291-296.
- (2013) Proceedings of the 2013 USENIX Annual Technical Conference (USENIX ATC'13) , pp. 291-296
- Menychtas, K.¹ Shen, K.² Scott, M.L.³

91
- 84897749415
- Disengaged scheduling for fair, protected access to fast computational accelerators
- ACM
- Konstantinos Menychtas, Kai Shen, and Michael L. Scott. 2014. Disengaged scheduling for fair, protected access to fast computational accelerators. In ACM SIGPLAN Notices, Vol. 49. ACM, 301-316.
- (2014) ACM SIGPLAN Notices , vol.49 , pp. 301-316
- Menychtas, K.¹ Shen, K.² Scott, M.L.³

92
- 79960181836
- Shadowfax: Scaling in heterogeneous cluster systems via GPGPU assemblies
- ACM
- Alexander M. Merritt, Vishakha Gupta, Abhishek Verma, Ada Gavrilovska, and Karsten Schwan. 2011. Shadowfax: Scaling in heterogeneous cluster systems via GPGPU assemblies. In Proceedings of the 5th International Workshop on Virtualization Technologies in Distributed Computing. ACM, 3-10.
- (2011) Proceedings of the 5th International Workshop on Virtualization Technologies in Distributed Computing , pp. 3-10
- Merritt, A.M.¹ Gupta, V.² Verma, A.³ Gavrilovska, A.⁴ Schwan, K.⁵

93
- 84907440423
- A survey of methods for analyzing and improving GPU energy efficiency
- 2015
- Sparsh Mittal and Jeffrey S. Vetter. 2015. A survey of methods for analyzing and improving GPU energy efficiency. ACM Comput. Surv. 47, 2(2015), 19.
- (2015) ACM Comput. Surv. , vol.47 , Issue.2 , pp. 19
- Mittal, S.¹ Vetter, J.S.²

94
- 84865204787
- A general-purpose virtualization service for HPC on cloud computing: An application to GPUs
- Springer
- Raffaele Montella, Giuseppe Coviello, Giulio Giunta, Giuliano Laccetti, Florin Isaila, and Javier Garcia Blas. 2011. A general-purpose virtualization service for HPC on cloud computing: An application to GPUs. In International Conference on Parallel Processing and Applied Mathematics. Springer, 740-749.
- (2011) International Conference on Parallel Processing and Applied Mathematics , pp. 740-749
- Montella, R.¹ Coviello, G.² Giunta, G.³ Laccetti, G.⁴ Isaila, F.⁵ Blas, J.G.⁶

95
- 84896394744
- Virtualizing high-end GPGPUs on ARM clusters for the next generation of high performance cloud computing
- 2014
- Raffaele Montella, Giulio Giunta, and Giuliano Laccetti. 2014. Virtualizing high-end GPGPUs on ARM clusters for the next generation of high performance cloud computing. Cluster Comput. 17, 1(2014), 139-152.
- (2014) Cluster Comput. , vol.17 , Issue.1 , pp. 139-152
- Montella, R.¹ Giunta, G.² Laccetti, G.³

96
- 84964461702
- Virtualizing CUDA enabled GPGPUs on ARM clusters
- Springer
- Raffaele Montella, Giulio Giunta, Giuliano Laccetti, Marco Lapegna, Carlo Palmieri, Carmine Ferraro, and Valentina Pelliccia. 2016a. Virtualizing CUDA enabled GPGPUs on ARM clusters. In Parallel Processing and Applied Mathematics. Springer, 3-14.
- (2016) Parallel Processing and Applied Mathematics , pp. 3-14
- Montella, R.¹ Giunta, G.² Laccetti, G.³ Lapegna, M.⁴ Palmieri, C.⁵ Ferraro, C.⁶ Pelliccia, V.⁷

97
- 84991112040
- On the virtualization of CUDA based GPU remoting on ARM and X86 machines in the GVirtuS framework
- 2016
- Raffaele Montella, Giulio Giunta, Giuliano Laccetti, Marco Lapegna, Carlo Palmieri, Carmine Ferraro, Valentina Pelliccia, Cheol-Ho Hong, Ivor Spence, and Dimitrios S. Nikolopoulos. 2016b. On the virtualization of CUDA based GPU remoting on ARM and X86 machines in the GVirtuS framework. Int. J. Parallel Program. (2016), 1-22. DOI: http://dx.doi.org/10.1007/s10766-016-0462-1
- (2016) Int. J. Parallel Program , pp. 1-22
- Montella, R.¹ Giunta, G.² Laccetti, G.³ Lapegna, M.⁴ Palmieri, C.⁵ Ferraro, C.⁶ Pelliccia, V.⁷ Hong, C.-H.⁸ Spence, I.⁹ Nikolopoulos, D.S.¹⁰

98
- 0038642786
- Non-invasive interactive visualization of dynamic architectural environments
- ACM
- Christopher Niederauer, Mike Houston, Maneesh Agrawala, and Greg Humphreys. 2003. Non-invasive interactive visualization of dynamic architectural environments. In Proceedings of the 2003 Symposium on Interactive 3D Graphics. ACM, 55-58.
- (2003) Proceedings of the 2003 Symposium on Interactive 3D Graphics , pp. 55-58
- Niederauer, C.¹ Houston, M.² Agrawala, M.³ Humphreys, G.⁴

99
- 85027055720
- Nvidia. 2007a. CUDA Code Samples-NVIDIA Developer. Retrieved from https://developer.nvidia.com/cuda-code-samples.
- (2007) CUDA Code Samples-NVIDIA Developer

100
- 85027078971
- NVIDIA. 2012. HyperQ Example. Retrieved from http://docs.nvidia.com/cuda/samples/6-Advanced/simpleHyperQ/doc/HyperQ.pdf.
- (2012) HyperQ Example

101
- 85025673404
- NVIDIA. 2016a. GP100 Pascal Whitepaper. Retrieved from https://images.nvidia.com/content/pdf/tesla/whitepaper/pascal-architecture-whitepaper.pdf.
- (2016) GP100 Pascal Whitepaper

102
- 85027068144
- NVIDIA. 2016b. GPU Cloud Computing Service Providers-NVIDIA. Retrieved from http://www.nvidia.com/object/gpu-cloud-computing-services.html.
- (2016) GPU Cloud Computing Service Providers-NVIDIA

103
- 41649101136
- CUDA Nvidia. 2007b. Compute Unified Device Architecture Programming Guide. http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html.
- (2007) Compute Unified Device Architecture Programming Guide

104
- 0003727497
- Prentice Hall, Englewood Cliffs, NJ
- Katsuhiko Ogata. 1995. Discrete-Time Control Systems. Vol. 2. Prentice Hall, Englewood Cliffs, NJ.
- (1995) Discrete-time Control Systems. , vol.2
- Ogata, K.¹

105
- 84876533447
- DS-CUDA: A middleware to use many GPUs in the cloud environment
- IEEE
- Masahiro Oikawa, Atsushi Kawai, Keigo Nomura, Koichi Yasuoka, Kenichi Yoshikawa, and Tetsu Narumi. 2012. DS-CUDA: A middleware to use many GPUs in the cloud environment. In Proceedings of the 2012 SC Companion to High Performance Computing, Networking, Storage and Analysis (SCC). IEEE, 1207-1214.
- (2012) Proceedings of the 2012 SC Companion to High Performance Computing, Networking, Storage and Analysis (SCC) , pp. 1207-1214
- Oikawa, M.¹ Kawai, A.² Nomura, K.³ Yasuoka, K.⁴ Yoshikawa, K.⁵ Narumi, T.⁶

106
- 85088778464
- Exploiting hardware heterogeneity within the same instance type of Amazon EC2
- Presented in
- Zhonghong Ou, Hao Zhuang, Jukka K. Nurminen, Antti Ylä-Jääski, and Pan Hui. 2012. Exploiting hardware heterogeneity within the same instance type of Amazon EC2. Presented in the 4th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud).
- (2012) The 4th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud)
- Ou, Z.¹ Zhuang, H.² Nurminen, J.K.³ Ylä-Jääski, A.⁴ Hui, P.⁵

107
- 84897775944
- Operating systems should manage accelerators
- Sankaralingam Panneerselvam and Michael M Swift. 2012. Operating systems should manage accelerators. In Proceedings of the 4th USENIX Workshop on Hot Topics in Parallelism.
- (2012) Proceedings of the 4th USENIX Workshop on Hot Topics in Parallelism
- Panneerselvam, S.¹ Swift, M.M.²

108
- 85077125433
- FIOS: A fair, efficient flash I/O scheduler
- Stan Park and Kai Shen. 2012. FIOS: A fair, efficient flash I/O scheduler. In Proceedings of the 10th USENEX Conference on File and Storage Technologies (FAST'12). 13.
- (2012) Proceedings of the 10th USENEX Conference on File and Storage Technologies (FAST'12) , pp. 13
- Park, S.¹ Shen, K.²

109
- 85027003084
- PathScale. 2012. pathscale/pscnv. Retrieved from https://github.com/pathscale/pscnv.
- (2012) Pathscale/pscnv

110
- 84964432699
- A zero-copy fast channel for interguest and guest-host communication using VirtIO-serial
- IEEE
- Sagar Patni, Jobin George, Pratik Lahoti, and Jibi Abraham. 2015. A zero-copy fast channel for interguest and guest-host communication using VirtIO-serial. In Proceedings of the 2015 1st International Conference on Next Generation Computing Technologies (NGCT'15). IEEE, 6-9.
- (2015) Proceedings of the 2015 1st International Conference on Next Generation Computing Technologies (NGCT'15) , pp. 6-9
- Patni, S.¹ George, J.² Lahoti, P.³ Abraham, J.⁴

111
- 80955152874
- The top 10 innovations in the new NVIDIA fermi architecture, and the top 3 next challenges
- 2009
- David Patterson. 2009. The top 10 innovations in the new NVIDIA fermi architecture, and the top 3 next challenges. NVIDIA Whitepaper 47 (2009).
- (2009) NVIDIA Whitepaper , vol.47
- Patterson, D.¹

112
- 84908669300
- A complete and efficient CUDA-sharing solution for HPC clusters
- 2014
- Antonio J. Peña, Carlos Reaño, Federico Silla, Rafael Mayo, Enrique S. Quintana-Ortí, and José Duato. 2014. A complete and efficient CUDA-sharing solution for HPC clusters. Parallel Comput. 40, 10(2014), 574-588.
- (2014) Parallel Comput. , vol.40 , Issue.10 , pp. 574-588
- Peña, A.J.¹ Reaño, C.² Silla, F.³ Mayo, R.⁴ Quintana-Orti, E.S.⁵ Duato, J.⁶

113
- 84976631372
- Providing CUDA acceleration to KVM virtual machines in InfiniBand Clusters with rCUDA
- Springer
- Ferran Pérez, Carlos Reaño, and Federico Silla. 2016. Providing CUDA acceleration to KVM virtual machines in InfiniBand Clusters with rCUDA. In Distributed Applications and Interoperable Systems. Springer, 82-95.
- (2016) Distributed Applications and Interoperable Systems , pp. 82-95
- Pérez, F.¹ Reaño, C.² Silla, F.³

114
- 0041893747
- Antoine Petitet. 2004. HPL-A portable implementation of the high-performance Linpack benchmark for distributed-memory computers. Retrieved from http://www.netlib-.org/-benchmark/hpl/.
- (2004) HPL-A Portable Implementation of the High-performance Linpack Benchmark for Distributed-memory Computers
- Petitet, A.¹

115
- 27344436659
- Scalable molecular dynamics with NAMD
- 2005
- James C. Phillips, Rosemary Braun, Wei Wang, James Gumbart, Emad Tajkhorshid, Elizabeth Villa, Christophe Chipot, Robert D. Skeel, Laxmikant Kale, and Klaus Schulten. 2005. Scalable molecular dynamics with NAMD. J. Comput. Chem. 26, 16(2005), 1781-1802.
- (2005) J. Comput. Chem. , vol.26 , Issue.16 , pp. 1781-1802
- Phillips, J.C.¹ Braun, R.² Wang, W.³ Gumbart, J.⁴ Tajkhorshid, E.⁵ Villa, E.⁶ Chipot, C.⁷ Skeel, R.D.⁸ Kale, L.⁹ Schulten, K.¹⁰

116
- 84894620801
- LAMMPS-large-scale atomic/molecular massively parallel simulator
- 2007
- Steve Plimpton, Paul Crozier, and Aidan Thompson. 2007. LAMMPS-large-scale atomic/molecular massively parallel simulator. Sandia National Laboratories 18 (2007). http://lammps.sandia.gov.
- (2007) Sandia National Laboratories , vol.18
- Plimpton, S.¹ Crozier, P.² Thompson, A.³

117
- 84963767039
- CUDA acceleration for Xen virtual machines in infiniband clusters with rCUDA
- ACM
- Javier Prades, Carlos Reaño, and Federico Silla. 2016. CUDA acceleration for Xen virtual machines in infiniband clusters with rCUDA. In Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. ACM, 35.
- (2016) Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming , pp. 35
- Prades, J.¹ Reaño, C.² Silla, F.³

118
- 84907339380
- VGRIS: Virtualized GPU resource isolation and scheduling in cloud gaming
- 2014
- Zhengwei Qi, Jianguo Yao, Chao Zhang, Miao Yu, Zhizhou Yang, and Haibing Guan. 2014. VGRIS: Virtualized GPU resource isolation and scheduling in cloud gaming. ACM Trans. Arch. Code Optimiz. 11, 2(2014), 17.
- (2014) ACM Trans. Arch. Code Optimiz. , vol.11 , Issue.2 , pp. 17
- Qi, Z.¹ Yao, J.² Zhang, C.³ Yu, M.⁴ Yang, Z.⁵ Guan, H.⁶

119
- 84905836276
- Toward a paravirtual vRDMA device for VMware ESXi guests
- 2012, 2012
- Adit Ranadive and Bhavesh Davda. 2012. Toward a paravirtual vRDMA device for VMware ESXi guests. VMware Techn. J. 2012 1, 2 (2012).
- (2012) VMware Techn. J , vol.1 , Issue.2
- Ranadive, A.¹ Davda, B.²

120
- 79960506159
- Supporting GPU sharing in cloud environments with a transparent runtime consolidation framework
- ACM
- Vignesh T. Ravi, Michela Becchi, Gagan Agrawal, and Srimat Chakradhar. 2011. Supporting GPU sharing in cloud environments with a transparent runtime consolidation framework. In Proceedings of the 20th International Symposium on High Performance Distributed Computing. ACM, 217-228.
- (2011) Proceedings of the 20th International Symposium on High Performance Distributed Computing , pp. 217-228
- Ravi, V.T.¹ Becchi, M.² Agrawal, G.³ Chakradhar, S.⁴

121
- 84893593068
- Influence of InfiniBand FDR on the performance of remote GPU virtualization
- IEEE
- Carlos Reaño, Rafael Mayo, Enrique S. Quintana-Ortí, Federico Silla, José Duato, and Antonio J. Peña. 2013. Influence of InfiniBand FDR on the performance of remote GPU virtualization. In Proceedings of the 2013 IEEE International Conference on Cluster Computing (CLUSTER'13). IEEE, 1-8.
- (2013) Proceedings of the 2013 IEEE International Conference on Cluster Computing (CLUSTER'13) , pp. 1-8
- Reaño, C.¹ Mayo, R.² Quintana-Orti, E.S.³ Silla, F.⁴ Duato, J.⁵ Peña, A.J.⁶

122
- 84880307341
- Cu2rcu: Towards the complete rcuda remote GPU virtualization and sharing solution
- IEEE
- Carlos Reaño, A. J. Pea, Federico Silla, José Duato, Rafael Mayo, and Enrique S. Quintana-Ortí. 2012. Cu2rcu: Towards the complete rcuda remote gpu virtualization and sharing solution. In Proceedings of the 2012 19th International Conference on High Performance Computing (HiPC'12). IEEE, 1-10.
- (2012) Proceedings of the 2012 19th International Conference on High Performance Computing (HiPC'12) , pp. 1-10
- Reaño, C.¹ Pea, A.J.² Silla, F.³ Duato, J.⁴ Mayo, R.⁵ Quintana-Orti, E.S.⁶

123
- 84959291794
- A performance comparison of CUDA remote GPU virtualization frameworks
- IEEE
- Carlos Reaño and Federico Silla. 2015. A performance comparison of CUDA remote GPU virtualization frameworks. In Proceedings of the 2015 IEEE International Conference on Cluster Computing. IEEE, 488-489.
- (2015) Proceedings of the 2015 IEEE International Conference on Cluster Computing , pp. 488-489
- Reaño, C.¹ Silla, F.²

124
- 84941790414
- Improving the user experience of the rCUDA remote GPU virtualization framework
- 2015
- Carlos Reaño, Federico Silla, Adrián Castelló, Antonio J. Peña, Rafael Mayo, Enrique S Quintana-Ortí, and José Duato. 2015a. Improving the user experience of the rCUDA remote GPU virtualization framework. Concurr. Comput.: Pract. Exper. 27, 14(2015), 3746-3770.
- (2015) Concurr. Comput.: Pract. Exper. , vol.27 , Issue.14 , pp. 3746-3770
- Reaño, C.¹ Silla, F.² Castelló, A.³ Peña, A.J.⁴ Mayo, R.⁵ Quintana-Orti, E.S.⁶ Duato, J.⁷

125
- 84981309714
- Local and remote GPUs perform similar with EDR 100G InfiniBand
- ACM
- Carlos Reaño, Federico Silla, Gilad Shainer, and Scot Schultz. 2015b. Local and remote GPUs perform similar with EDR 100G InfiniBand. In Proceedings of the Industrial Track of the 16th International Middleware Conference. ACM, 4.
- (2015) Proceedings of the Industrial Track of the 16th International Middleware Conference , pp. 4
- Reaño, C.¹ Silla, F.² Shainer, G.³ Schultz, S.⁴

126
- 82655162782
- PTask: Operating system abstractions to manage GPUs as compute devices
- ACM
- Christopher J. Rossbach, Jon Currey, Mark Silberstein, Baishakhi Ray, and Emmett Witchel. 2011. PTask: Operating system abstractions to manage GPUs as compute devices. In Proceedings of the 23rd ACM Symposium on Operating Systems Principles. ACM, 233-248.
- (2011) Proceedings of the 23rd ACM Symposium on Operating Systems Principles , pp. 233-248
- Rossbach, C.J.¹ Currey, J.² Silberstein, M.³ Ray, B.⁴ Witchel, E.⁵

127
- 79951813794
- Cloud and heterogeneous computing solutions exist today for the emerging big data problems in biology
- 2011
- Eric E. Schadt, Michael D. Linderman, Jon Sorenson, Lawrence Lee, and Garry P. Nolan. 2011. Cloud and heterogeneous computing solutions exist today for the emerging big data problems in biology. Nat. Rev. Genet. 12, 3(2011), 224-224.
- (2011) Nat. Rev. Genet. , vol.12 , Issue.3 , pp. 224
- Schadt, E.E.¹ Linderman, M.D.² Sorenson, J.³ Lee, L.⁴ Nolan, G.P.⁵

128
- 84880061770
- Multi-tenancy on GPGPU-based servers
- ACM
- Dipanjan Sengupta, Raghavendra Belapure, and Karsten Schwan. 2013. Multi-tenancy on GPGPU-based servers. In Proceedings of the 7th International Workshop on Virtualization Technologies in Distributed Computing. ACM, 3-10.
- (2013) Proceedings of the 7th International Workshop on Virtualization Technologies in Distributed Computing , pp. 3-10
- Sengupta, D.¹ Belapure, R.² Schwan, K.³

129
- 84936950495
- Scheduling multitenant cloud workloads on accelerator-based systems
- IEEE Press
- Dipanjan Sengupta, Anshuman Goswami, Karsten Schwan, and Krishna Pallavi. 2014. Scheduling multitenant cloud workloads on accelerator-based systems. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE Press, 513-524.
- (2014) Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , pp. 513-524
- Sengupta, D.¹ Goswami, A.² Schwan, K.³ Pallavi, K.⁴

130
- 80051667116
- The development of Mellanox/NVIDIA GPUDirect over InfiniBanda new model for GPU to GPU communications
- 2011
- Gilad Shainer, Ali Ayoub, Pak Lui, Tong Liu, Michael Kagan, Christian R. Trott, Greg Scantlen, and Paul S. Crozier. 2011. The development of Mellanox/NVIDIA GPUDirect over InfiniBanda new model for GPU to GPU communications. Comput. Sci. Res. Dev. 26, 3-4(2011), 267-273.
- (2011) Comput. Sci. Res. Dev. , vol.26 , Issue.3-4 , pp. 267-273
- Shainer, G.¹ Ayoub, A.² Lui, P.³ Liu, T.⁴ Kagan, M.⁵ Trott, C.R.⁶ Scantlen, G.⁷ Crozier, P.S.⁸

131
- 84962009774
- XenGT: A software based intel graphics virtualization solution
- Haitao Shan, Kevin Tian, Eddie Dong, and David Cowperthwaite. 2013. XenGT: A software based intel graphics virtualization solution. Proceedings of the Xen Project Developer Summit.
- (2013) Proceedings of the Xen Project Developer Summit
- Shan, H.¹ Tian, K.² Dong, E.³ Cowperthwaite, D.⁴

132
- 84906752934
- On GPU pass-through performance for cloud gaming: Experiments and analysis
- IEEE
- Ryan Shea and Jiangchuan Liu. 2013. On GPU pass-through performance for cloud gaming: Experiments and analysis. In Proceedings of the 2013 12th Annual Workshop on Network and Systems Support for Games (NetGames'13). IEEE, 1-6.
- (2013) Proceedings of the 2013 12th Annual Workshop on Network and Systems Support for Games (NetGames'13) , pp. 1-6
- Shea, R.¹ Liu, J.²

133
- 70450031611
- VCUDA: GPU accelerated high performance computing in virtual machines
- IEEE
- Lin Shi, Hao Chen, and Jianhua Sun. 2009. vCUDA: GPU accelerated high performance computing in virtual machines. In Proceedings of the IEEE International Symposium on Parallel & Distributed Processing, 2009 (IPDPS'09). IEEE, 1-11.
- (2009) Proceedings of the IEEE International Symposium on Parallel & Distributed Processing, 2009 (IPDPS'09) , pp. 1-11
- Shi, L.¹ Chen, H.² Sun, J.³

134
- 84860524424
- VCUDA: GPU-accelerated high-performance computing in virtual machines
- 2012
- Lin Shi, Hao Chen, Jianhua Sun, and Kenli Li. 2012. vCUDA: GPU-accelerated high-performance computing in virtual machines. IEEE Trans. Comput. 61, 6(2012), 804-816.
- (2012) IEEE Trans. Comput. , vol.61 , Issue.6 , pp. 804-816
- Shi, L.¹ Chen, H.² Sun, J.³ Li, K.⁴

135
- 79956161825
- SHARC: A scalable 3D graphics virtual appliance delivery framework in cloud
- 2011
- Weidong Shi, Yang Lu, Zhu Li, and Jonathan Engelsma. 2011. SHARC: A scalable 3D graphics virtual appliance delivery framework in cloud. J. Netw. Comput. Appl. 34, 4(2011), 1078-1087.
- (2011) J. Netw. Comput. Appl. , vol.34 , Issue.4 , pp. 1078-1087
- Shi, W.¹ Lu, Y.² Li, Z.³ Engelsma, J.⁴

136
- 0030171894
- Efficient fair queuing using deficit round-robin
- 1996
- Madhavapeddi Shreedhar and George Varghese. 1996. Efficient fair queuing using deficit round-robin. IEEE/ACM Trans. Netw. 4, 3(1996), 375-385.
- (1996) IEEE/ACM Trans. Netw. , vol.4 , Issue.3 , pp. 375-385
- Shreedhar, M.¹ Varghese, G.²

137
- 0004233425
- Addison-Wesley, Reading, MA
- Abraham Silberschatz, Peter B. Galvin, Greg Gagne, and A. Silberschatz. 1998. Operating System Concepts. Vol. 4. Addison-Wesley, Reading, MA.
- (1998) Operating System Concepts. , vol.4
- Silberschatz, A.¹ Galvin, P.B.² Gagne, G.³ Silberschatz, A.⁴

138
- 85027023449
- KVMGT: A full GPU virtualization solution
- Jike Song, Zhiyuan Lv, and Kevin Tian. 2014. KVMGT: A full GPU virtualization solution. In KVM Forum 2014. http://www.linux-kvm.org/page/KVM-Forum-2014.
- (2014) KVM Forum 2014
- Song, J.¹ Lv, Z.² Tian, K.³

139
- 84873470137
- Parboil: A revised benchmark suite for scientific and commercial throughput computing
- 2012
- John A. Stratton, Christopher Rodrigues, I-Jui Sung, Nady Obeid, Li-Wen Chang, Nasser Anssari, Geng Daniel Liu, and Wen-mei W. Hwu. 2012. Parboil: A revised benchmark suite for scientific and commercial throughput computing. Center for Reliable and High-Performance Computing 127 (2012).
- (2012) Center for Reliable and High-performance Computing , vol.127
- Stratton, J.A.¹ Rodrigues, C.² Sung, I.-J.³ Obeid, N.⁴ Chang, L.-W.⁵ Anssari, N.⁶ Liu, G.D.⁷ Hwu, W.W.⁸

140
- 85077458357
- GPUvm: Why not virtualizing GPUs at the hypervisor?
- Yusuke Suzuki, Shinpei Kato, Hiroshi Yamada, and Kenji Kono. 2014. GPUvm: Why not virtualizing GPUs at the hypervisor?. In Proceedings of the 2014 USENIX Annual Technical Conference (USENIX ATC'14). 109-120.
- (2014) Proceedings of the 2014 USENIX Annual Technical Conference (USENIX ATC'14) , pp. 109-120
- Suzuki, Y.¹ Kato, S.² Yamada, H.³ Kono, K.⁴

141
- 84982095542
- GPUvm: GPU virtualization at the hypervisor
- 2016
- Yusuke Suzuki, Shinpei Kato, Hiroshi Yamada, and Kenji Kono. 2016. Gpuvm: Gpu virtualization at the hypervisor. IEEE Trans. Comput. 65, 9(2016), 2752-2766.
- (2016) IEEE Trans. Comput. , vol.65 , Issue.9 , pp. 2752-2766
- Suzuki, Y.¹ Kato, S.² Yamada, H.³ Kono, K.⁴

142
- 84905509992
- Enabling preemptive multiprogramming on GPUs
- IEEE Press
- Ivan Tanasic, Isaac Gelado, Javier Cabezas, Alex Ramirez, Nacho Navarro, and Mateo Valero. 2014. Enabling preemptive multiprogramming on GPUs. In ACM SIGARCH Computer Architecture News, Vol. 42. IEEE Press, 193-204.
- (2014) ACM SIGARCH Computer Architecture News , vol.42 , pp. 193-204
- Tanasic, I.¹ Gelado, I.² Cabezas, J.³ Ramirez, A.⁴ Navarro, N.⁵ Valero, M.⁶

143
- 85077449318
- A full GPU virtualization solution with mediated pass-through
- Kun Tian, Yaozu Dong, and David Cowperthwaite. 2014. A full GPU virtualization solution with mediated pass-through. In Proceedings of the 2014 USENIX Annual Technical Conference (USENIX ATC'14).
- (2014) Proceedings of the 2014 USENIX Annual Technical Conference (USENIX ATC'14)
- Tian, K.¹ Dong, Y.² Cowperthwaite, D.³

144
- 84897976339
- Enabling OpenCL support for GPGPU in Kernel-based Virtual Machine
- 2014
- Tsan-Rong Tien and Yi-Ping You. 2014. Enabling OpenCL support for GPGPU in Kernel-based Virtual Machine. Softw.: Pract. Exper. 44, 5(2014), 483-510.
- (2014) Softw.: Pract. Exper. , vol.44 , Issue.5 , pp. 483-510
- Tien, T.-R.¹ You, Y.-P.²

145
- 85016284648
- Top500. 2016. TOP500 Supercomputer Sites. Retrieved from https://www.top500.org/list/2016/06/.
- (2016) TOP500 Supercomputer Sites

146
- 20344391930
- Intel virtualization technology
- 2005
- Rich Uhlig, Gil Neiger, Dion Rodgers, Amy L. Santoni, Fernando C. M. Martins, Andrew V. Anderson, Steven M. Bennett, Alain Kagi, Felix H. Leung, and Larry Smith. 2005. Intel virtualization technology. Computer 38, 5(2005), 48-56.
- (2005) Computer , vol.38 , Issue.5 , pp. 48-56
- Uhlig, R.¹ Neiger, G.² Rodgers, D.³ Santoni, A.L.⁴ Martins, F.C.M.⁵ Anderson, A.V.⁶ Bennett, S.M.⁷ Kagi, A.⁸ Leung, F.H.⁹ Smith, L.¹⁰

147
- 33745963937
- Hardware virtualization trends
- Leendert Van Doorn. 2006. Hardware virtualization trends. In Proceedings of the 2nd International ACM/Usenix Conference on Virtual Execution Environments, Vol. 14. 45-45.
- (2006) Proceedings of the 2nd International ACM/Usenix Conference on Virtual Execution Environments , vol.14 , pp. 45
- Van Doorn, L.¹

148
- 33947412760
- New approach to virtualization is a lightweight
- 2006
- Stephen J. Vaughan-Nichols. 2006. New approach to virtualization is a lightweight. Computer 39, 11 (2006).
- (2006) Computer , vol.39 , Issue.11
- Vaughan-Nichols, S.J.¹

149
- 77955887126
- McGraw-Hill, Inc
- Anthony Velte and Toby Velte. 2009. Microsoft Virtualization with Hyper-V. McGraw-Hill, Inc.
- (2009) Microsoft Virtualization with Hyper-V
- Velte, A.¹ Velte, T.²

150
- 84874430498
- An evaluation of CUDA-enabled virtualization solutions
- IEEE
- M. S. Vinaya, Naga Vydyanathan, and Mrugesh Gajjar. 2012. An evaluation of CUDA-enabled virtualization solutions. In Proceedings of the 2012 2nd IEEE International Conference on Parallel Distributed and Grid Computing (PDGC'12). IEEE, 621-626.
- (2012) Proceedings of the 2012 2nd IEEE International Conference on Parallel Distributed and Grid Computing (PDGC'12) , pp. 621-626
- Vinaya, M.S.¹ Vydyanathan, N.² Gajjar, M.³

151
- 84901835158
- GPU virtualization for high performance general purpose computing on the ESX hypervisor
- Lan Vu, Hari Sivaraman, and Rishi Bidarkar. 2014. GPU virtualization for high performance general purpose computing on the ESX hypervisor. In Proceedings of the High Performance Computing Symposium. Society for Computer Simulation International, 2.
- (2014) Proceedings of the High Performance Computing Symposium. Society for Computer Simulation International , pp. 2
- Vu, L.¹ Sivaraman, H.² Bidarkar, R.³

152
- 84919792604
- GPU passthrough performance: A comparison of KVM, Xen, VMWare ESXi, and LXC for CUDA and OpenCL applications
- IEEE
- John Paul Walters, Andrew J. Younge, Dong In Kang, Ke Thia Yao, Mikyung Kang, Stephen P. Crago, and Geoffrey C. Fox. 2014. GPU passthrough performance: A comparison of KVM, Xen, VMWare ESXi, and LXC for CUDA and OpenCL applications. In Proceedings of the 2014 IEEE 7th International Conference on Cloud Computing (CLOUD'14). IEEE, 636-643.
- (2014) Proceedings of the 2014 IEEE 7th International Conference on Cloud Computing (CLOUD'14) , pp. 636-643
- Walters, J.P.¹ Younge, A.J.² Kang, D.I.³ Yao, K.T.⁴ Kang, M.⁵ Crago, S.P.⁶ Fox, G.C.⁷

153
- 84968876675
- A user mode CPU-GPU scheduling framework for hybrid workloads
- 2016
- Bin Wang, Ruhui Ma, Zhengwei Qi, Jianguo Yao, and Haibing Guan. 2016. A user mode CPU-GPU scheduling framework for hybrid workloads. Future Gener. Comput. Syst. 63(2016), 25-36.
- (2016) Future Gener. Comput. Syst. , vol.63 , pp. 25-36
- Wang, B.¹ Ma, R.² Qi, Z.³ Yao, J.⁴ Guan, H.⁵

154
- 57449090446
- XenLoop: A transparent high performance inter-vm network loopback
- ACM
- Jian Wang, Kwame-Lante Wright, and Kartik Gopalan. 2008. XenLoop: A transparent high performance inter-vm network loopback. In Proceedings of the 17th International Symposium on High Performance Distributed Computing. ACM, 109-118.
- (2008) Proceedings of the 17th International Symposium on High Performance Distributed Computing , pp. 109-118
- Wang, J.¹ Wright, K.-L.² Gopalan, K.³

155
- 70349253246
- Trusted computing building blocks for embedded linux-based ARM trustzone platforms
- ACM
- Johannes Winter. 2008. Trusted computing building blocks for embedded linux-based ARM trustzone platforms. In Proceedings of the 3rd ACM Workshop on Scalable Trusted Computing. ACM, 21-30.
- (2008) Proceedings of the 3rd ACM Workshop on Scalable Trusted Computing , pp. 21-30
- Winter, J.¹

156
- 79955435088
- Fermi GF100 GPU architecture
- 2011
- Craig M. Wittenbrink, Emmett Kilgariff, and Arjun Prabhu. 2011. Fermi GF100 GPU architecture. IEEE Micro 2(2011), 50-59.
- (2011) IEEE Micro , vol.2 , pp. 50-59
- Wittenbrink, C.M.¹ Kilgariff, E.² Prabhu, A.³

157
- 0003651470
- Addison-Wesley Longman Publishing Co., Inc
- Mason Woo, Jackie Neider, Tom Davis, and Dave Shreiner. 1999. OpenGL Programming Guide: The Official Guide to Learning OpenGL, Version 1.2. Addison-Wesley Longman Publishing Co., Inc.
- (1999) OpenGL Programming Guide: The Official Guide to Learning OpenGL, Version 1.2
- Woo, M.¹ Neider, J.² Davis, T.³ Shreiner, D.⁴

158
- 79961149915
- Sla-based resource allocation for software as a service provider (saas) in cloud computing environments
- IEEE
- Linlin Wu, Saurabh Kumar Garg, and Rajkumar Buyya. 2011. Sla-based resource allocation for software as a service provider (saas) in cloud computing environments. In Proceedings of the 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'11). IEEE, 195-204.
- (2011) Proceedings of the 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'11) , pp. 195-204
- Wu, L.¹ Garg, S.K.² Buyya, R.³

159
- 85027032075
- Xenproject. 2016. Xen Project Release Features. Retrieved from https://wiki.xenproject.org/wiki/Xen-Project-Release-Features.
- (2016) Xen Project Release Features

160
- 84870656041
- VOCL: An optimized environment for transparent virtualization of graphics processing units
- IEEE
- Shucai Xiao, Pavan Balaji, Qian Zhu, Rajeev Thakur, Susan Coghlan, Heshan Lin, Gaojin Wen, Jue Hong, and Wu-chun Feng. 2012. VOCL: An optimized environment for transparent virtualization of graphics processing units. In Proceedings of the Innovative Parallel Computing (InPar'12). IEEE, 1-12.
- (2012) Proceedings of the Innovative Parallel Computing (InPar'12) , pp. 1-12
- Xiao, S.¹ Balaji, P.² Zhu, Q.³ Thakur, R.⁴ Coghlan, S.⁵ Lin, H.⁶ Wen, G.⁷ Hong, J.⁸ Feng, W.-C.⁹

161
- 85027023624
- X. OrgFoundation. 2011. Nouveau: Accelerated Open Source driver for nVidia cards. Retrieved from https://nouveau.freedesktop.org/wiki/.
- (2011) Nouveau: Accelerated Open Source Driver for NVIDIA Cards

162
- 85029492411
- GScale: Scaling up GPU virtualization with dynamic sharing of graphics memory space
- Mochi Xue, Kun Tian, Yaozu Dong, Jiajun Wang, Zhengwei Qi, Bingsheng He, and Haibing Guan. 2016. gScale: Scaling up GPU virtualization with dynamic sharing of graphics memory space. In Proceedings of the 2016 USENIX Annual Technical Conference (USENIX ATC'16).
- (2016) Proceedings of the 2016 USENIX Annual Technical Conference (USENIX ATC'16)
- Xue, M.¹ Tian, K.² Dong, Y.³ Wang, J.⁴ Qi, Z.⁵ He, B.⁶ Guan, H.⁷

163
- 84898061475
- Implementation of GPU virtualization using PCI pass-through mechanism
- 2014
- Chao-Tung Yang, Jung-Chun Liu, Hsien-Yi Wang, and Ching-Hsien Hsu. 2014. Implementation of GPU virtualization using PCI pass-through mechanism. J. Supercomput. 68, 1(2014), 183-213.
- (2014) J. Supercomput. , vol.68 , Issue.1 , pp. 183-213
- Yang, C.-T.¹ Liu, J.-C.² Wang, H.-Y.³ Hsu, C.-H.⁴

164
- 84871599572
- Using pci pass-through for GPU virtualization with CUDA
- Springer
- Chao-Tung Yang, Hsien-Yi Wang, and Yu-Tso Liu. 2012a. Using pci pass-through for gpu virtualization with cuda. In Network and Parallel Computing. Springer, 445-452.
- (2012) Network and Parallel Computing , pp. 445-452
- Yang, C.-T.¹ Wang, H.-Y.² Liu, Y.-T.³

165
- 84874254048
- On implementation of GPU virtualization using PCI pass-through
- IEEE
- Chao-Tung Yang, Hsien-Yi Wang, Wei-Shen Ou, Yu-Tso Liu, and Ching-Hsien Hsu. 2012b. On implementation of GPU virtualization using PCI pass-through. In Proceedings of the 2012 IEEE 4th International Conference on Cloud Computing Technology and Science (CloudCom'12). IEEE, 711-716.
- (2012) Proceedings of the 2012 IEEE 4th International Conference on Cloud Computing Technology and Science (CloudCom'12) , pp. 711-716
- Yang, C.-T.¹ Wang, H.-Y.² Ou, W.-S.³ Liu, Y.-T.⁴ Hsu, C.-H.⁵

166
- 84883321005
- GPU virtualization support in cloud system
- Springer
- Chih-Yuan Yeh, Chung-Yao Kao, Wei-Shu Hung, Ching-Chi Lin, Pangfeng Liu, Jan-Jan Wu, and Kuang-Chih Liu. 2013. GPU virtualization support in cloud system. In International Conference on Grid and Pervasive Computing. Springer, 423-432.
- (2013) International Conference on Grid and Pervasive Computing , pp. 423-432
- Yeh, C.-Y.¹ Kao, C.-Y.² Hung, W.-S.³ Lin, C.-C.⁴ Liu, P.⁵ Wu, J.-J.⁶ Liu, K.-C.⁷

167
- 84939129434
- VirtCL: A framework for OpenCL device abstraction and management
- ACM
- Yi-Ping You, Hen-Jung Wu, Yeh-Ning Tsai, and Yen-Ting Chao. 2015. VirtCL: A framework for OpenCL device abstraction and management. In ACM SIGPLAN Notices, Vol. 50. ACM, 161-172.
- (2015) ACM SIGPLAN Notices , vol.50 , pp. 161-172
- You, Y.-P.¹ Wu, H.-J.² Tsai, Y.-N.³ Chao, Y.-T.⁴

168
- 84904545023
- Advanced virtualization techniques for high performance cloud cyberinfrastructure
- IEEE
- Andrew J. Younge and Geoffrey C. Fox. 2014. Advanced virtualization techniques for high performance cloud cyberinfrastructure. In Proceedings of the 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'14). IEEE, 583-586.
- (2014) Proceedings of the 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'14) , pp. 583-586
- Younge, A.J.¹ Fox, G.C.²

169
- 84918835088
- Evaluating GPU passthrough in Xen for high performance cloud computing
- IEEE
- Andrew J. Younge, John Paul Walters, Stephen Crago, and Geoffrey C. Fox. 2014. Evaluating GPU passthrough in Xen for high performance cloud computing. In Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPSW'14). IEEE, 852-859.
- (2014) Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPSW'14) , pp. 852-859
- Younge, A.J.¹ Walters, J.P.² Crago, S.³ Fox, G.C.⁴

170
- 84969718487
- Supporting high performance molecular dynamics in virtualized clusters using IOMMU, SR-IOV, and GPUDirect
- Andrew J. Younge, John Paul Walters, Stephen P. Crago, and Geoffrey C. Fox. 2015. Supporting high performance molecular dynamics in virtualized clusters using IOMMU, SR-IOV, and GPUDirect. In Proceedings of the 11th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments. ACM, 31-38.
- (2015) Proceedings of the 11th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments , pp. 31-38
- Younge, A.J.¹ Walters, J.P.² Crago, S.P.³ Fox, G.C.⁴

171
- 84904498998
- Vgasa: Adaptive scheduling algorithm of virtualized GPU resource in cloud gaming
- 2014
- Chao Zhang, Jianguo Yao, Zhengwei Qi, Miao Yu, and Haibing Guan. 2014. vgasa: Adaptive scheduling algorithm of virtualized gpu resource in cloud gaming. IEEE Trans. Parallel Distrib. Syst. 25, 11(2014), 3036-3045.
- (2014) IEEE Trans. Parallel Distrib. Syst. , vol.25 , Issue.11 , pp. 3036-3045
- Zhang, C.¹ Yao, J.² Qi, Z.³ Yu, M.⁴ Guan, H.⁵

172
- 84963813616
- A cloud gaming system based on user-level virtualization and its resource scheduling
- 2016
- Youhui Zhang, Peng Qu, Jiang Cihang, and Weimin Zheng. 2016. A cloud gaming system based on user-level virtualization and its resource scheduling. IEEE Trans. Parallel Distrib. Syst. 27, 5(2016), 1239-1252.
- (2016) IEEE Trans. Parallel Distrib. Syst. , vol.27 , Issue.5 , pp. 1239-1252
- Zhang, Y.¹ Qu, P.² Cihang, J.³ Zheng, W.⁴

173
- 84944682522
- GPES: A preemptive execution system for GPGPU computing
- IEEE
- Husheng Zhou, Guangmo Tong, and Cong Liu. 2015. GPES: A preemptive execution system for GPGPU computing. In Proceedings of the 21st IEEE Real-Time and Embedded Technology and Applications Symposium. IEEE, 87-97.
- (2015) Proceedings of the 21st IEEE Real-time and Embedded Technology and Applications Symposium , pp. 87-97
- Zhou, H.¹ Tong, G.² Liu, C.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.