-
1
-
-
34247885571
-
Intel virtualization technology for directed I/O
-
2006
-
Darren Abramson, Jeff Jackson, Sridhar Muthrasanallur, Gil Neiger, Greg Regnier, Rajesh Sankaran, Ioannis Schoinas, Rich Uhlig, Balaji Vembu, and John Wiegert. 2006. Intel virtualization technology for directed I/O. Intel Technol. J. 10, 3 (2006).
-
(2006)
Intel Technol. J.
, vol.10
, Issue.3
-
-
Abramson, D.1
Jackson, J.2
Muthrasanallur, S.3
Neiger, G.4
Regnier, G.5
Sankaran, R.6
Schoinas, I.7
Uhlig, R.8
Vembu, B.9
Wiegert, J.10
-
3
-
-
85027038682
-
-
2009
-
AMD. 2009. R6xx-3D-Registers.pdf. Retrieved from http://amd-dev. wpengine.netdna-cdn.com/wordpress/media/2013/10/R6xx-3D-Registers.pdf. (2009).
-
(2009)
R6xx-3D-Registers.pdf
-
-
-
4
-
-
84905472992
-
HOOMD-blue, general-purpose many-body dynamics on the GPU
-
Joshua Anderson, Aaron Keys, Carolyn Phillips, Trung Dac Nguyen, and Sharon Glotzer. 2010. HOOMD-blue, general-purpose many-body dynamics on the GPU. In APS Meeting Abstracts, Vol. 1. 18008.
-
(2010)
APS Meeting Abstracts
, vol.1
, pp. 18008
-
-
Anderson, J.1
Keys, A.2
Phillips, C.3
Nguyen, T.D.4
Glotzer, S.5
-
5
-
-
35648995516
-
-
EECS Department Technical Report, University of California, Berkeley
-
Krste Asanovic, Ras Bodik, Bryan Christopher Catanzaro, Joseph James Gebis, Parry Husbands, Kurt Keutzer, David A. Patterson, William Lester Plishker, John Shalf, Samuel Webb Williams, and others. 2006. The Landscape of Parallel Computing Research: A View from Berkeley. EECS Department Technical Report UCB/EECS-2006-183. University of California, Berkeley.
-
(2006)
The Landscape of Parallel Computing Research: A View from Berkeley
-
-
Asanovic, K.1
Bodik, R.2
Catanzaro, B.C.3
Gebis, J.J.4
Husbands, P.5
Keutzer, K.6
Patterson, D.A.7
Plishker, W.L.8
Shalf, J.9
Williams, S.W.10
-
6
-
-
70350729133
-
Accelerating monte carlo simulations of photon transport in a voxelized geometry using a massively parallel graphics processing unit
-
2009
-
Andreu Badal and Aldo Badano. 2009. Accelerating monte carlo simulations of photon transport in a voxelized geometry using a massively parallel graphics processing unit. Med. Phys. 36, 11(2009), 4878-4880.
-
(2009)
Med. Phys.
, vol.36
, Issue.11
, pp. 4878-4880
-
-
Badal, A.1
Badano, A.2
-
7
-
-
21644433634
-
Xen and the art of virtualization
-
2003
-
Paul Barham, Boris Dragovic, Keir Fraser, Steven Hand, Tim Harris, Alex Ho, Rolf Neugebauer, Ian Pratt, and Andrew Warfield. 2003. Xen and the art of virtualization. ACM SIGOPS Operat. Syst. Rev. 37, 5(2003), 164-177.
-
(2003)
ACM SIGOPS Operat. Syst. Rev.
, vol.37
, Issue.5
, pp. 164-177
-
-
Barham, P.1
Dragovic, B.2
Fraser, K.3
Hand, S.4
Harris, T.5
Ho, A.6
Neugebauer, R.7
Pratt, I.8
Warfield, A.9
-
8
-
-
84899626479
-
GPU acceleration for support vector machines
-
TU Delft; EWI; MM; PRB, Delft, The Netherlands
-
Andreas Athanasopoulos, Anastasios Dimou, Vasileios Mezaris, and Ioannis Kompatsiaris. 2011. GPU acceleration for support vector machines. In 12th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS'11). TU Delft; EWI; MM; PRB, Delft, The Netherlands.
-
(2011)
12th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS'11)
-
-
Athanasopoulos, A.1
Dimou, A.2
Mezaris, V.3
Kompatsiaris, I.4
-
10
-
-
84863973589
-
A virtual memory based runtime to support multi-tenancy in clusters with GPUs
-
ACM
-
Michela Becchi, Kittisak Sajjapongse, Ian Graves, Adam Procter, Vignesh Ravi, and Srimat Chakradhar. 2012. A virtual memory based runtime to support multi-tenancy in clusters with GPUs. In Proceedings of the 21st International Symposium on High-Performance Parallel and Distributed Computing. ACM, 97-108.
-
(2012)
Proceedings of the 21st International Symposium on High-performance Parallel and Distributed Computing
, pp. 97-108
-
-
Becchi, M.1
Sajjapongse, K.2
Graves, I.3
Procter, A.4
Ravi, V.5
Chakradhar, S.6
-
11
-
-
0035481820
-
Credit-based fair queueing (CBFQ): A simple service-scheduling algorithm for packet-switched networks
-
2001
-
Brahim Bensaou, Danny H. K. Tsang, and King Tung Chan. 2001. Credit-based fair queueing (CBFQ): A simple service-scheduling algorithm for packet-switched networks. IEEE/ACM Trans. Network. 9, 5(2001), 591-604.
-
(2001)
IEEE/ACM Trans. Network.
, vol.9
, Issue.5
, pp. 591-604
-
-
Bensaou, B.1
Tsang, D.H.K.2
Chan, K.T.3
-
12
-
-
33749245662
-
The direct3d 10 system
-
ACM
-
David Blythe. 2006. The direct3d 10 system. In ACM Transactions on Graphics, Vol. 25. ACM, 724-734.
-
(2006)
ACM Transactions on Graphics
, vol.25
, pp. 724-734
-
-
Blythe, D.1
-
13
-
-
84988905990
-
Understanding GPU power: A survey of profiling, modeling, and simulation methods
-
2016
-
Robert A. Bridges, Neena Imam, and Tiffany M Mintz. 2016. Understanding GPU power: A survey of profiling, modeling, and simulation methods. ACM Comput. Surv. 49, 3(2016), 41.
-
(2016)
ACM Comput. Surv.
, vol.49
, Issue.3
, pp. 41
-
-
Bridges, R.A.1
Imam, N.2
Mintz, T.M.3
-
15
-
-
84959287454
-
Exploring the suitability of remote GPGPU virtualization for the OpenACC programming model using rCUDA
-
IEEE
-
Adrián Castelló, Antonio J. Peña, Rafael Mayo, Pavan Balaji, and Enrique S. Quintana-Ortí. 2015. Exploring the suitability of remote GPGPU virtualization for the OpenACC programming model using rCUDA. In Proceedings of the 2015 IEEE International Conference on Cluster Computing. IEEE, 92-95.
-
(2015)
Proceedings of the 2015 IEEE International Conference on Cluster Computing
, pp. 92-95
-
-
Castelló, A.1
Peña, A.J.2
Mayo, R.3
Balaji, P.4
Quintana-Orti, E.S.5
-
17
-
-
84866630668
-
The architecture of vmware esxi
-
2008
-
Charu Chaubal. 2008. The architecture of vmware esxi. VMware White Pap. 1, 7 (2008).
-
(2008)
VMware White Pap.
, vol.1
, Issue.7
-
-
Chaubal, C.1
-
18
-
-
70649092154
-
Rodinia: A benchmark suite for heterogeneous computing
-
IEEE
-
Shuai Che, Michael Boyer, Jiayuan Meng, David Tarjan, Jeremy W. Sheaffer, Sang-Ha Lee, and Kevin Skadron. 2009. Rodinia: A benchmark suite for heterogeneous computing. In Proceedings of the IEEE International Symposium on Workload Characterization, 2009 (IISWC'09). IEEE, 44-54.
-
(2009)
Proceedings of the IEEE International Symposium on Workload Characterization, 2009 (IISWC'09)
, pp. 44-54
-
-
Che, S.1
Boyer, M.2
Meng, J.3
Tarjan, D.4
Sheaffer, J.W.5
Lee, S.-H.6
Skadron, K.7
-
21
-
-
80955130221
-
Heterogeneous cloud computing
-
IEEE
-
Steve Crago, Kyle Dunn, Patrick Eads, Lorin Hochstein, Dong-In Kang, Mikyung Kang, Devendra Modium, Karandeep Singh, Jinwoo Suh, and John Paul Walters. 2011. Heterogeneous cloud computing. In Proceedings of the 2011 IEEE International Conference on Cluster Computing. IEEE, 378-385.
-
(2011)
Proceedings of the 2011 IEEE International Conference on Cluster Computing
, pp. 378-385
-
-
Crago, S.1
Dunn, K.2
Eads, P.3
Hochstein, L.4
Kang, D.-I.5
Kang, M.6
Modium, D.7
Singh, K.8
Suh, J.9
Walters, J.P.10
-
22
-
-
71749121484
-
Trusted virtual platforms: A key enabler for converged client devices
-
2009
-
Chris I. Dalton, David Plaquin, Wolfgang Weidner, Dirk Kuhlmann, Boris Balacheff, and Richard Brown. 2009. Trusted virtual platforms: A key enabler for converged client devices. ACM SIGOPS Operat. Syst. Rev. 43, 1(2009), 36-43.
-
(2009)
ACM SIGOPS Operat. Syst. Rev.
, vol.43
, Issue.1
, pp. 36-43
-
-
Dalton, C.I.1
Plaquin, D.2
Weidner, W.3
Kuhlmann, D.4
Balacheff, B.5
Brown, R.6
-
23
-
-
77952273045
-
The scalable heterogeneous computing (SHOC) benchmark suite
-
ACM
-
Anthony Danalis, Gabriel Marin, Collin McCurdy, Jeremy S. Meredith, Philip C. Roth, Kyle Spafford, Vinod Tipparaju, and Jeffrey S. Vetter. 2010. The scalable heterogeneous computing (SHOC) benchmark suite. In Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units. ACM, 63-74.
-
(2010)
Proceedings of the 3rd Workshop on General-purpose Computation on Graphics Processing Units
, pp. 63-74
-
-
Danalis, A.1
Marin, G.2
McCurdy, C.3
Meredith, J.S.4
Roth, P.C.5
Spafford, K.6
Tipparaju, V.7
Vetter, J.S.8
-
27
-
-
85069163055
-
Boosting GPU virtualization performance with hybrid shadow page tables
-
Yaozu Dong, Mochi Xue, Xiao Zheng, Jiajun Wang, Zhengwei Qi, and Haibing Guan. 2015. Boosting GPU virtualization performance with hybrid shadow page tables. In Proceedings of the 2015 USENIX Annual Technical Conference (USENIX ATC'15). 517-528.
-
(2015)
Proceedings of the 2015 USENIX Annual Technical Conference (USENIX ATC'15)
, pp. 517-528
-
-
Dong, Y.1
Xue, M.2
Zheng, X.3
Wang, J.4
Qi, Z.5
Guan, H.6
-
28
-
-
84866114929
-
High performance network virtualization with SR-IOV
-
2012
-
Yaozu Dong, Xiaowei Yang, Jianhui Li, Guangdeng Liao, Kun Tian, and Haibing Guan. 2012. High performance network virtualization with SR-IOV. J. Parallel Distrib. Comput. 72, 11(2012), 1471-1480.
-
(2012)
J. Parallel Distrib. Comput.
, vol.72
, Issue.11
, pp. 1471-1480
-
-
Dong, Y.1
Yang, X.2
Li, J.3
Liao, G.4
Tian, K.5
Guan, H.6
-
29
-
-
0042674307
-
The LINPACK benchmark: Past, present and future
-
2003
-
Jack J. Dongarra, Piotr Luszczek, and Antoine Petitet. 2003. The LINPACK benchmark: Past, present and future. Concurr. Comput.: Pract. Exper. 15, 9(2003), 803-820.
-
(2003)
Concurr. Comput.: Pract. Exper.
, vol.15
, Issue.9
, pp. 803-820
-
-
Dongarra, J.J.1
Luszczek, P.2
Petitet, A.3
-
30
-
-
77952266871
-
GPU virtualization on VMware's hosted I/O architecture
-
2009
-
Micah Dowty and Jeremy Sugerman. 2009. GPU virtualization on VMware's hosted I/O architecture. ACM SIGOPS Operat. Syst. Rev. 43, 3(2009), 73-82.
-
(2009)
ACM SIGOPS Operat. Syst. Rev.
, vol.43
, Issue.3
, pp. 73-82
-
-
Dowty, M.1
Sugerman, J.2
-
31
-
-
77954589384
-
An efficient implementation of GPU virtualization in high performance clusters
-
Springer
-
José Duato, Francisco D. Igual, Rafael Mayo, Antonio J. Peña, Enrique S. Quintana-Ortí, and Federico Silla. 2009. An efficient implementation of GPU virtualization in high performance clusters. In European Conference on Parallel Processing. Springer, 385-394.
-
(2009)
European Conference on Parallel Processing
, pp. 385-394
-
-
Duato, J.1
Igual, F.D.2
Mayo, R.3
Peña, A.J.4
Quintana-Orti, E.S.5
Silla, F.6
-
32
-
-
84858051188
-
Enabling CUDA acceleration within virtual machines using rCUDA
-
IEEE
-
José Duato, Antonio J. Peña, Federico Silla, Juan C. Fernandez, Rafael Mayo, and Enrique S. Quintana-Ortí. 2011. Enabling CUDA acceleration within virtual machines using rCUDA. In Proceedings of the 2011 18th International Conference on High Performance Computing (HiPC'11). IEEE, 1-10.
-
(2011)
Proceedings of the 2011 18th International Conference on High Performance Computing (HiPC'11)
, pp. 1-10
-
-
Duato, J.1
Peña, A.J.2
Silla, F.3
Fernandez, J.C.4
Mayo, R.5
Quintana-Orti, E.S.6
-
33
-
-
78650853478
-
Modeling the CUDA remoting virtualization behaviour in high performance networks
-
José Duato, Antonio J. Peña, Federico Silla, Rafael Mayo, and Enrique S. Quintana-Orti. 2010a. Modeling the CUDA remoting virtualization behaviour in high performance networks. In Proceedings of the 1st Workshop on Language, Compiler, and Architecture Support for GPGPU.
-
(2010)
Proceedings of the 1st Workshop on Language, Compiler, and Architecture Support for GPGPU
-
-
Duato, J.1
Peña, A.J.2
Silla, F.3
Mayo, R.4
Quintana-Orti, E.S.5
-
34
-
-
77956946040
-
RCUDA: Reducing the number of GPU-based accelerators in high performance clusters
-
IEEE
-
José Duato, Antonio J. Peña, Federico Silla, Rafael Mayo, and Enrique S. Quintana-Ortí. 2010b. rCUDA: Reducing the number of GPU-based accelerators in high performance clusters. In Proceedings of the 2010 International Conference on High Performance Computing and Simulation (HPCS'10). IEEE, 224-231.
-
(2010)
Proceedings of the 2010 International Conference on High Performance Computing and Simulation (HPCS'10)
, pp. 224-231
-
-
Duato, J.1
Peña, A.J.2
Silla, F.3
Mayo, R.4
Quintana-Orti, E.S.5
-
35
-
-
80155140345
-
Performance of CUDA virtualized remote GPUsin high performance clusters
-
IEEE
-
José Duato, Antonio J. Peña, Federico Silla, Rafael Mayo, and Enrique S Quintana-Ortí. 2011. Performance of CUDA virtualized remote GPUsin high performance clusters. In Proceedings of the 2011 International Conference on Parallel Processing (ICPP'11). IEEE, 365-374.
-
(2011)
Proceedings of the 2011 International Conference on Parallel Processing (ICPP'11)
, pp. 365-374
-
-
Duato, J.1
Peña, A.J.2
Silla, F.3
Mayo, R.4
Quintana-Orti, E.S.5
-
37
-
-
84880327377
-
Generalpurpose computation on GPUs for high performance cloud computing
-
2013
-
Roberto R. Expósito, Guillermo L. Taboada, Sabela Ramos, Juan Touriño, and Ramón Doallo. 2013. Generalpurpose computation on GPUs for high performance cloud computing. Concurr. Comput.: Pract. Exper. 25, 12(2013), 1628-1642.
-
(2013)
Concurr. Comput.: Pract. Exper.
, vol.25
, Issue.12
, pp. 1628-1642
-
-
Expósito, R.R.1
Taboada, G.L.2
Ramos, S.3
Touriño, J.4
Doallo, R.5
-
38
-
-
84963771830
-
Affinityaware work-stealing for integrated CPU-GPU processors
-
ACM
-
Naila Farooqui, Rajkishore Barik, Brian T. Lewis, Tatiana Shpeisman, and Karsten Schwan. 2016. Affinityaware work-stealing for integrated CPU-GPU processors. In Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. ACM, 30.
-
(2016)
Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 30
-
-
Farooqui, N.1
Barik, R.2
Lewis, B.T.3
Shpeisman, T.4
Schwan, K.5
-
43
-
-
84874423459
-
A GPU accelerated high performance cloud computing infrastructure for grid computing based virtual environmental laboratory
-
Springer, Berlin, Heidelberg
-
Francisco Giunta, Raffaele Montella, Giuliano Laccetti, Florin Isaila, and F. Blas. 2011. A GPU accelerated high performance cloud computing infrastructure for grid computing based virtual environmental laboratory. Adv. Grid Comput. Lecture Notes in Computer Science. Vol. 6271. Springer, Berlin, Heidelberg, 35-43.
-
(2011)
Adv. Grid Comput. Lecture Notes in Computer Science.
, vol.6271
, pp. 35-43
-
-
Giunta, F.1
Montella, R.2
Laccetti, G.3
Isaila, F.4
Blas, F.5
-
44
-
-
78349273083
-
A GPGPU transparent virtualization component for high performance computing clouds
-
Springer
-
Giulio Giunta, Raffaele Montella, Giuseppe Agrillo, and Giuseppe Coviello. 2010. A GPGPU transparent virtualization component for high performance computing clouds. In Euro-Par 2010-Parallel Processing. Springer, 379-391.
-
(2010)
Euro-par 2010-parallel Processing
, pp. 379-391
-
-
Giunta, G.1
Montella, R.2
Agrillo, G.3
Coviello, G.4
-
45
-
-
84928049032
-
Strong scaling of general-purpose molecular dynamics simulations on GPUs
-
2015
-
Jens Glaser, Trung Dac Nguyen, Joshua A. Anderson, Pak Lui, Filippo Spiga, Jaime A. Millan, David C. Morse, and Sharon C. Glotzer. 2015. Strong scaling of general-purpose molecular dynamics simulations on GPUs. Comput. Phys. Commun. 192(2015), 97-107.
-
(2015)
Comput. Phys. Commun.
, vol.192
, pp. 97-107
-
-
Glaser, J.1
Nguyen, T.D.2
Anderson, J.A.3
Lui, P.4
Spiga, F.5
Millan, J.A.6
Morse, D.C.7
Glotzer, S.C.8
-
46
-
-
84926427148
-
Survey of virtual machine research
-
1974
-
Robert P. Goldberg. 1974. Survey of virtual machine research. Computer 7, 6(1974), 34-45.
-
(1974)
Computer
, vol.7
, Issue.6
, pp. 34-45
-
-
Goldberg, R.P.1
-
47
-
-
84903973686
-
LoGV: Lowoverhead GPGPU virtualization
-
IEEE
-
Mathias Gottschlag, Martin Hillenbrand, Jens Kehne, Jan Stoess, and Frank Bellosa. 2013. LoGV: Lowoverhead GPGPU virtualization. In Proceedings of the 2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC-EUC'13). IEEE, 1721-1726.
-
(2013)
Proceedings of the 2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC-EUC'13)
, pp. 1721-1726
-
-
Gottschlag, M.1
Hillenbrand, M.2
Kehne, J.3
Stoess, J.4
Bellosa, F.5
-
48
-
-
84861018407
-
Particle simulation using CUDA
-
2010
-
Simon Green. 2010. Particle simulation using cuda. NVIDIA Whitepaper 6(2010), 121-128.
-
(2010)
NVIDIA Whitepaper
, vol.6
, pp. 121-128
-
-
Green, S.1
-
49
-
-
0030243005
-
A high-performance, portable implementation of the MPI message passing interface standard
-
1996
-
William Gropp, Ewing Lusk, Nathan Doss, and Anthony Skjellum. 1996. A high-performance, portable implementation of the MPI message passing interface standard. Parallel Comput. 22, 6(1996), 789-828.
-
(1996)
Parallel Comput.
, vol.22
, Issue.6
, pp. 789-828
-
-
Gropp, W.1
Lusk, E.2
Doss, N.3
Skjellum, A.4
-
50
-
-
79951728783
-
The opencl specification
-
2008
-
Khronos OpenCL Working Group et al. 2008. The opencl specification. Version 1, 29(2008), 8.
-
(2008)
Version
, vol.1
, Issue.29
, pp. 8
-
-
-
51
-
-
70349123351
-
GViM: GPU-accelerated virtual machines
-
ACM
-
Vishakha Gupta, Ada Gavrilovska, Karsten Schwan, Harshvardhan Kharche, Niraj Tolia, Vanish Talwar, and Parthasarathy Ranganathan. 2009. GViM: GPU-accelerated virtual machines. In Proceedings of the 3rd ACM Workshop on System-level Virtualization for High Performance Computing. ACM, 17-24.
-
(2009)
Proceedings of the 3rd ACM Workshop on System-level Virtualization for High Performance Computing
, pp. 17-24
-
-
Gupta, V.1
Gavrilovska, A.2
Schwan, K.3
Kharche, H.4
Tolia, N.5
Talwar, V.6
Ranganathan, P.7
-
52
-
-
84939240957
-
Energy-efficient SLA guarantees for virtualized GPU in cloud gaming
-
2015
-
Haibing Guan, Jianguo Yao, Zhengwei Qi, and Runze Wang. 2015. Energy-efficient SLA guarantees for virtualized GPU in cloud gaming. IEEE Trans-actions on Parallel Distrib. Syst. 26, 9(2015), 2434-2443.
-
(2015)
IEEE Trans-actions on Parallel Distrib. Syst.
, vol.26
, Issue.9
, pp. 2434-2443
-
-
Guan, H.1
Yao, J.2
Qi, Z.3
Wang, R.4
-
53
-
-
85077044984
-
Pegasus: Coordinated scheduling for virtualized accelerator-based systems
-
Vishakha Gupta, Karsten Schwan, Niraj Tolia, Vanish Talwar, and Parthasarathy Ranganathan. 2011. Pegasus: Coordinated scheduling for virtualized accelerator-based systems. In Proceedings of the 2011 USENIX Annual Technical Conference (USENIX ATC'11). 31.
-
(2011)
Proceedings of the 2011 USENIX Annual Technical Conference (USENIX ATC'11)
, pp. 31
-
-
Gupta, V.1
Schwan, K.2
Tolia, N.3
Talwar, V.4
Ranganathan, P.5
-
54
-
-
84899111809
-
Haswell: The fourth-generation intel core processor
-
2014
-
Per Hammarlund, Alberto J. Martinez, Atiq A. Bajwa, David L. Hill, Erik Hallnor, Hong Jiang, Martin Dixon, Michael Derr, Mikal Hunsaker, Rajesh Kumar, et al. 2014. Haswell: The fourth-generation intel core processor. IEEE Micro 34, 2(2014), 6-20.
-
(2014)
IEEE Micro
, vol.34
, Issue.2
, pp. 6-20
-
-
Hammarlund, P.1
Martinez, A.J.2
Bajwa, A.A.3
Hill, D.L.4
Hallnor, E.5
Jiang, H.6
Dixon, M.7
Derr, M.8
Hunsaker, M.9
Kumar, R.10
-
56
-
-
85077195492
-
Efficient and scalable paravirtual I/O system
-
Nadav Har'El, Abel Gordon, Alex Landau, Muli Ben-Yehuda, Avishay Traeger, and Razya Ladelsky. 2013. Efficient and scalable paravirtual I/O system. In Proceedings of the USENIX Annual Technical Conference. 231-242.
-
(2013)
Proceedings of the USENIX Annual Technical Conference
, pp. 231-242
-
-
Har'El, N.1
Gordon, A.2
Landau, A.3
Ben-Yehuda, M.4
Traeger, A.5
Ladelsky, R.6
-
57
-
-
84965031169
-
Enhancing the usability and utilization of accelerated architectures via docker
-
IEEE
-
Nicholas Haydel, Sandra Gesing, Ian Taylor, Gregory Madey, Abdul Dakkak, Simon Garcia De Gonzalo, and Wen-Mei W. Hwu. 2015. Enhancing the usability and utilization of accelerated architectures via docker. In Proceedings of the 2015 IEEE/ACM 8th International Conference on Utility and Cloud Computing (UCC'15). IEEE, 361-367.
-
(2015)
Proceedings of the 2015 IEEE/ACM 8th International Conference on Utility and Cloud Computing (UCC'15)
, pp. 361-367
-
-
Haydel, N.1
Gesing, S.2
Taylor, I.3
Madey, G.4
Dakkak, A.5
De Gonzalo, S.G.6
Hwu, W.W.7
-
58
-
-
84965004939
-
NVIDIA GRID: Graphics accelerated VDI with the visual performance of a workstation
-
2014
-
Alex Herrera. 2014. NVIDIA GRID: Graphics accelerated VDI with the visual performance of a workstation. Nvidia Corp (2014). http://www.nvidia.com/content/grid/vdi-whitepaper.pdf.
-
(2014)
NVIDIA Corp
-
-
Herrera, A.1
-
59
-
-
85083940224
-
GPU consolidation for cloud games: Are we there yet?
-
IEEE Press
-
Hua-Jun. Hong, Tao-Ya Fan-Chiang, Che-Run Lee, Kuan-Ta Chen, Chun-Ying Huang, and Cheng-Hsin Hsu. 2014. GPU consolidation for cloud games: Are we there yet?. In Proceedings of the 13th Annual Workshop on Network and Systems Support for Games. IEEE Press, 3.
-
(2014)
Proceedings of the 13th Annual Workshop on Network and Systems Support for Games
, pp. 3
-
-
Hong, H.-J.1
Fan-Chiang, T.-Y.2
Lee, C.-R.3
Chen, K.-T.4
Huang, C.-Y.5
Hsu, C.-H.6
-
61
-
-
0036993236
-
Chromium: A stream-processing framework for interactive rendering on clusters
-
ACM
-
Greg Humphreys, Mike Houston, Ren Ng, Randall Frank, Sean Ahern, Peter D. Kirchner, and James T. Klosowski. 2002. Chromium: A stream-processing framework for interactive rendering on clusters. In ACM Transactions on Graphics, Vol. 21. ACM, 693-702.
-
(2002)
ACM Transactions on Graphics
, vol.21
, pp. 693-702
-
-
Humphreys, G.1
Houston, M.2
Ng, R.3
Frank, R.4
Ahern, S.5
Kirchner, P.D.6
Klosowski, J.T.7
-
62
-
-
84878588753
-
Client rendering method for desktop virtualization services
-
2013
-
Su Min Jang, Won Hyuk Choi, and Won Young Kim. 2013. Client rendering method for desktop virtualization services. ETRI J. 35, 2(2013), 348-351.
-
(2013)
ETRI J.
, vol.35
, Issue.2
, pp. 348-351
-
-
Jang, S.M.1
Choi, W.H.2
Kim, W.Y.3
-
63
-
-
59049085159
-
Predictive runtime code scheduling for heterogeneous architectures
-
Springer
-
Víctor J. Jiménez, Lluís Vilanova, Isaac Gelado, Marisa Gil, Grigori Fursin, and Nacho Navarro. 2009. Predictive runtime code scheduling for heterogeneous architectures. In High Performance Embedded Architectures and Compilers. Springer, 19-33.
-
(2009)
High Performance Embedded Architectures and Compilers
, pp. 19-33
-
-
Jiménez, V.J.1
Vilanova, L.2
Gelado, I.3
Gil, M.4
Fursin, G.5
Navarro, N.6
-
64
-
-
84878141243
-
Exploiting GPUs invirtual machine for biocloud
-
2013 2013
-
Heeseung Jo, Jinkyu Jeong, Myoungho Lee, and Dong Hoon Choi. 2013a. Exploiting GPUs invirtual machine for biocloud. BioMed Res. Int. 2013 (2013).
-
(2013)
BioMed Res. Int
-
-
Jo, H.1
Jeong, J.2
Lee, M.3
Choi, D.H.4
-
65
-
-
84874986898
-
GPU virtualization using PCI direct pass-through
-
Trans Tech Publ
-
Hee Seung Jo, Myung Ho Lee, and Dong Hoon Choi. 2013b. GPU virtualization using PCI direct pass-through. In Applied Mechanics and Materials, Vol. 311. Trans Tech Publ, 15-19.
-
(2013)
Applied Mechanics and Materials
, vol.311
, pp. 15-19
-
-
Jo, H.S.1
Lee, M.H.2
Choi, D.H.3
-
70
-
-
84863015834
-
RGEM: A responsive GPGPU execution model for runtime engines
-
IEEE
-
Shinpei Kato, Karthik Lakshmanan, Aman Kumar, Mihir Kelkar, Yutaka Ishikawa, and Ragunathan Rajkumar. 2011c. RGEM: A responsive GPGPU execution model for runtime engines. In Proceedings of the 2011 IEEE 32nd Real-Time Systems Symposium (RTSS'11). IEEE, 57-66.
-
(2011)
Proceedings of the 2011 IEEE 32nd Real-time Systems Symposium (RTSS'11)
, pp. 57-66
-
-
Kato, S.1
Lakshmanan, K.2
Kumar, A.3
Kelkar, M.4
Ishikawa, Y.5
Rajkumar, R.6
-
73
-
-
84899964619
-
Secure device access for automotive software
-
IEEE
-
Se Won Kim, Chiyoung Lee, MooWoong Jeon, Hae Young Kwon, Hyun Woo Lee, and Chuck Yoo. 2013. Secure device access for automotive software. In Proceedings of the 2013 International Conference on Connected Vehicles and Expo (ICCVE'13). IEEE, 177-181.
-
(2013)
Proceedings of the 2013 International Conference on Connected Vehicles and Expo (ICCVE'13)
, pp. 177-181
-
-
Kim, S.W.1
Lee, C.2
Jeon, M.3
Kwon, H.Y.4
Lee, H.W.5
Yoo, C.6
-
74
-
-
85102984928
-
Programming massively parallel processors: A hands-on approach
-
David B. Kirk and W. Hwu Wen-mei. 2012. Programming Massively Parallel Processors: A Hands-on Approach. Newnes.
-
(2012)
Newnes
-
-
Kirk, D.B.1
Wen-Mei, W.H.2
-
75
-
-
54049158076
-
Kvm: The Linux virtual machine monitor
-
Avi Kivity, Yaniv Kamay, Dor Laor, Uri Lublin, and Anthony Liguori. 2007. kvm: The Linux virtual machine monitor. In Proceedings of the Linux Symposium, Vol. 1. 225-230.
-
(2007)
Proceedings of the Linux Symposium
, vol.1
, pp. 225-230
-
-
Kivity, A.1
Kamay, Y.2
Laor, D.3
Lublin, U.4
Liguori, A.5
-
76
-
-
77952125596
-
Westmere: A family of 32nm IA processors
-
Nasser A. Kurd, Subramani Bhamidipati, Christopher Mozak, Jeffrey L. Miller, Timothy M. Wilson, Mahadev Nemani, and Muntaquim Chowdhury. 2010. Westmere: A family of 32nm IA processors. In Proceedings of the 2010 IEEE International Solid-State Circuits Conference (ISSCC'10).
-
(2010)
Proceedings of the 2010 IEEE International Solid-state Circuits Conference (ISSCC'10)
-
-
Kurd, N.A.1
Bhamidipati, S.2
Mozak, C.3
Miller, J.L.4
Wilson, T.M.5
Nemani, M.6
Chowdhury, M.7
-
81
-
-
84893324240
-
PVOCL: Power-aware dynamic placement and migration in virtualized GPU environments
-
IEEE
-
Palden Lama, Yan Li, Ashwin M. Aji, Pavan Balaji, James Dinan, Shucai Xiao, Yunquan Zhang, Wu-chun Feng, Rajeev Thakur, and Xiaobo Zhou. 2013. pVOCL: Power-aware dynamic placement and migration in virtualized GPU environments. In Proceedings of the 2013 IEEE 33rd International Conference on Distributed Computing Systems (ICDCS'13). IEEE, 145-154.
-
(2013)
Proceedings of the 2013 IEEE 33rd International Conference on Distributed Computing Systems (ICDCS'13)
, pp. 145-154
-
-
Lama, P.1
Li, Y.2
Aji, A.M.3
Balaji, P.4
Dinan, J.5
Xiao, S.6
Zhang, Y.7
Feng, W.-C.8
Thakur, R.9
Zhou, X.10
-
83
-
-
84962521288
-
VADI: GPU virtualization for an automotive platform
-
2016
-
Chiyoung Lee, Se-Won Kim, and Chuck Yoo. 2016. VADI: GPU virtualization for an automotive platform. IEEE Trans. Industr. Inf. 12, 1(2016), 277-290.
-
(2016)
IEEE Trans. Industr. Inf.
, vol.12
, Issue.1
, pp. 277-290
-
-
Lee, C.1
Kim, S.-W.2
Yoo, C.3
-
87
-
-
84941215614
-
An evaluation of unified memory technology on NVIDIA GPUs
-
IEEE
-
Wenqiang Li, Guanghao Jin, Xuewen Cui, and Simon See. 2015. An evaluation of unified memory technology on nvidia gpus. In Proceedings of the 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'15). IEEE, 1092-1098.
-
(2015)
Proceedings of the 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'15)
, pp. 1092-1098
-
-
Li, W.1
Jin, G.2
Cui, X.3
See, S.4
-
91
-
-
84897749415
-
Disengaged scheduling for fair, protected access to fast computational accelerators
-
ACM
-
Konstantinos Menychtas, Kai Shen, and Michael L. Scott. 2014. Disengaged scheduling for fair, protected access to fast computational accelerators. In ACM SIGPLAN Notices, Vol. 49. ACM, 301-316.
-
(2014)
ACM SIGPLAN Notices
, vol.49
, pp. 301-316
-
-
Menychtas, K.1
Shen, K.2
Scott, M.L.3
-
92
-
-
79960181836
-
Shadowfax: Scaling in heterogeneous cluster systems via GPGPU assemblies
-
ACM
-
Alexander M. Merritt, Vishakha Gupta, Abhishek Verma, Ada Gavrilovska, and Karsten Schwan. 2011. Shadowfax: Scaling in heterogeneous cluster systems via GPGPU assemblies. In Proceedings of the 5th International Workshop on Virtualization Technologies in Distributed Computing. ACM, 3-10.
-
(2011)
Proceedings of the 5th International Workshop on Virtualization Technologies in Distributed Computing
, pp. 3-10
-
-
Merritt, A.M.1
Gupta, V.2
Verma, A.3
Gavrilovska, A.4
Schwan, K.5
-
93
-
-
84907440423
-
A survey of methods for analyzing and improving GPU energy efficiency
-
2015
-
Sparsh Mittal and Jeffrey S. Vetter. 2015. A survey of methods for analyzing and improving GPU energy efficiency. ACM Comput. Surv. 47, 2(2015), 19.
-
(2015)
ACM Comput. Surv.
, vol.47
, Issue.2
, pp. 19
-
-
Mittal, S.1
Vetter, J.S.2
-
94
-
-
84865204787
-
A general-purpose virtualization service for HPC on cloud computing: An application to GPUs
-
Springer
-
Raffaele Montella, Giuseppe Coviello, Giulio Giunta, Giuliano Laccetti, Florin Isaila, and Javier Garcia Blas. 2011. A general-purpose virtualization service for HPC on cloud computing: An application to GPUs. In International Conference on Parallel Processing and Applied Mathematics. Springer, 740-749.
-
(2011)
International Conference on Parallel Processing and Applied Mathematics
, pp. 740-749
-
-
Montella, R.1
Coviello, G.2
Giunta, G.3
Laccetti, G.4
Isaila, F.5
Blas, J.G.6
-
95
-
-
84896394744
-
Virtualizing high-end GPGPUs on ARM clusters for the next generation of high performance cloud computing
-
2014
-
Raffaele Montella, Giulio Giunta, and Giuliano Laccetti. 2014. Virtualizing high-end GPGPUs on ARM clusters for the next generation of high performance cloud computing. Cluster Comput. 17, 1(2014), 139-152.
-
(2014)
Cluster Comput.
, vol.17
, Issue.1
, pp. 139-152
-
-
Montella, R.1
Giunta, G.2
Laccetti, G.3
-
96
-
-
84964461702
-
Virtualizing CUDA enabled GPGPUs on ARM clusters
-
Springer
-
Raffaele Montella, Giulio Giunta, Giuliano Laccetti, Marco Lapegna, Carlo Palmieri, Carmine Ferraro, and Valentina Pelliccia. 2016a. Virtualizing CUDA enabled GPGPUs on ARM clusters. In Parallel Processing and Applied Mathematics. Springer, 3-14.
-
(2016)
Parallel Processing and Applied Mathematics
, pp. 3-14
-
-
Montella, R.1
Giunta, G.2
Laccetti, G.3
Lapegna, M.4
Palmieri, C.5
Ferraro, C.6
Pelliccia, V.7
-
97
-
-
84991112040
-
On the virtualization of CUDA based GPU remoting on ARM and X86 machines in the GVirtuS framework
-
2016
-
Raffaele Montella, Giulio Giunta, Giuliano Laccetti, Marco Lapegna, Carlo Palmieri, Carmine Ferraro, Valentina Pelliccia, Cheol-Ho Hong, Ivor Spence, and Dimitrios S. Nikolopoulos. 2016b. On the virtualization of CUDA based GPU remoting on ARM and X86 machines in the GVirtuS framework. Int. J. Parallel Program. (2016), 1-22. DOI: http://dx.doi.org/10.1007/s10766-016-0462-1
-
(2016)
Int. J. Parallel Program
, pp. 1-22
-
-
Montella, R.1
Giunta, G.2
Laccetti, G.3
Lapegna, M.4
Palmieri, C.5
Ferraro, C.6
Pelliccia, V.7
Hong, C.-H.8
Spence, I.9
Nikolopoulos, D.S.10
-
100
-
-
85027078971
-
-
NVIDIA. 2012. HyperQ Example. Retrieved from http://docs.nvidia.com/cuda/samples/6-Advanced/simpleHyperQ/doc/HyperQ.pdf.
-
(2012)
HyperQ Example
-
-
-
101
-
-
85025673404
-
-
NVIDIA. 2016a. GP100 Pascal Whitepaper. Retrieved from https://images.nvidia.com/content/pdf/tesla/whitepaper/pascal-architecture-whitepaper.pdf.
-
(2016)
GP100 Pascal Whitepaper
-
-
-
104
-
-
0003727497
-
-
Prentice Hall, Englewood Cliffs, NJ
-
Katsuhiko Ogata. 1995. Discrete-Time Control Systems. Vol. 2. Prentice Hall, Englewood Cliffs, NJ.
-
(1995)
Discrete-time Control Systems.
, vol.2
-
-
Ogata, K.1
-
105
-
-
84876533447
-
DS-CUDA: A middleware to use many GPUs in the cloud environment
-
IEEE
-
Masahiro Oikawa, Atsushi Kawai, Keigo Nomura, Koichi Yasuoka, Kenichi Yoshikawa, and Tetsu Narumi. 2012. DS-CUDA: A middleware to use many GPUs in the cloud environment. In Proceedings of the 2012 SC Companion to High Performance Computing, Networking, Storage and Analysis (SCC). IEEE, 1207-1214.
-
(2012)
Proceedings of the 2012 SC Companion to High Performance Computing, Networking, Storage and Analysis (SCC)
, pp. 1207-1214
-
-
Oikawa, M.1
Kawai, A.2
Nomura, K.3
Yasuoka, K.4
Yoshikawa, K.5
Narumi, T.6
-
109
-
-
85027003084
-
-
PathScale. 2012. pathscale/pscnv. Retrieved from https://github.com/pathscale/pscnv.
-
(2012)
Pathscale/pscnv
-
-
-
111
-
-
80955152874
-
The top 10 innovations in the new NVIDIA fermi architecture, and the top 3 next challenges
-
2009
-
David Patterson. 2009. The top 10 innovations in the new NVIDIA fermi architecture, and the top 3 next challenges. NVIDIA Whitepaper 47 (2009).
-
(2009)
NVIDIA Whitepaper
, vol.47
-
-
Patterson, D.1
-
112
-
-
84908669300
-
A complete and efficient CUDA-sharing solution for HPC clusters
-
2014
-
Antonio J. Peña, Carlos Reaño, Federico Silla, Rafael Mayo, Enrique S. Quintana-Ortí, and José Duato. 2014. A complete and efficient CUDA-sharing solution for HPC clusters. Parallel Comput. 40, 10(2014), 574-588.
-
(2014)
Parallel Comput.
, vol.40
, Issue.10
, pp. 574-588
-
-
Peña, A.J.1
Reaño, C.2
Silla, F.3
Mayo, R.4
Quintana-Orti, E.S.5
Duato, J.6
-
113
-
-
84976631372
-
Providing CUDA acceleration to KVM virtual machines in InfiniBand Clusters with rCUDA
-
Springer
-
Ferran Pérez, Carlos Reaño, and Federico Silla. 2016. Providing CUDA acceleration to KVM virtual machines in InfiniBand Clusters with rCUDA. In Distributed Applications and Interoperable Systems. Springer, 82-95.
-
(2016)
Distributed Applications and Interoperable Systems
, pp. 82-95
-
-
Pérez, F.1
Reaño, C.2
Silla, F.3
-
115
-
-
27344436659
-
Scalable molecular dynamics with NAMD
-
2005
-
James C. Phillips, Rosemary Braun, Wei Wang, James Gumbart, Emad Tajkhorshid, Elizabeth Villa, Christophe Chipot, Robert D. Skeel, Laxmikant Kale, and Klaus Schulten. 2005. Scalable molecular dynamics with NAMD. J. Comput. Chem. 26, 16(2005), 1781-1802.
-
(2005)
J. Comput. Chem.
, vol.26
, Issue.16
, pp. 1781-1802
-
-
Phillips, J.C.1
Braun, R.2
Wang, W.3
Gumbart, J.4
Tajkhorshid, E.5
Villa, E.6
Chipot, C.7
Skeel, R.D.8
Kale, L.9
Schulten, K.10
-
116
-
-
84894620801
-
LAMMPS-large-scale atomic/molecular massively parallel simulator
-
2007
-
Steve Plimpton, Paul Crozier, and Aidan Thompson. 2007. LAMMPS-large-scale atomic/molecular massively parallel simulator. Sandia National Laboratories 18 (2007). http://lammps.sandia.gov.
-
(2007)
Sandia National Laboratories
, vol.18
-
-
Plimpton, S.1
Crozier, P.2
Thompson, A.3
-
118
-
-
84907339380
-
VGRIS: Virtualized GPU resource isolation and scheduling in cloud gaming
-
2014
-
Zhengwei Qi, Jianguo Yao, Chao Zhang, Miao Yu, Zhizhou Yang, and Haibing Guan. 2014. VGRIS: Virtualized GPU resource isolation and scheduling in cloud gaming. ACM Trans. Arch. Code Optimiz. 11, 2(2014), 17.
-
(2014)
ACM Trans. Arch. Code Optimiz.
, vol.11
, Issue.2
, pp. 17
-
-
Qi, Z.1
Yao, J.2
Zhang, C.3
Yu, M.4
Yang, Z.5
Guan, H.6
-
119
-
-
84905836276
-
Toward a paravirtual vRDMA device for VMware ESXi guests
-
2012, 2012
-
Adit Ranadive and Bhavesh Davda. 2012. Toward a paravirtual vRDMA device for VMware ESXi guests. VMware Techn. J. 2012 1, 2 (2012).
-
(2012)
VMware Techn. J
, vol.1
, Issue.2
-
-
Ranadive, A.1
Davda, B.2
-
121
-
-
84893593068
-
Influence of InfiniBand FDR on the performance of remote GPU virtualization
-
IEEE
-
Carlos Reaño, Rafael Mayo, Enrique S. Quintana-Ortí, Federico Silla, José Duato, and Antonio J. Peña. 2013. Influence of InfiniBand FDR on the performance of remote GPU virtualization. In Proceedings of the 2013 IEEE International Conference on Cluster Computing (CLUSTER'13). IEEE, 1-8.
-
(2013)
Proceedings of the 2013 IEEE International Conference on Cluster Computing (CLUSTER'13)
, pp. 1-8
-
-
Reaño, C.1
Mayo, R.2
Quintana-Orti, E.S.3
Silla, F.4
Duato, J.5
Peña, A.J.6
-
122
-
-
84880307341
-
Cu2rcu: Towards the complete rcuda remote GPU virtualization and sharing solution
-
IEEE
-
Carlos Reaño, A. J. Pea, Federico Silla, José Duato, Rafael Mayo, and Enrique S. Quintana-Ortí. 2012. Cu2rcu: Towards the complete rcuda remote gpu virtualization and sharing solution. In Proceedings of the 2012 19th International Conference on High Performance Computing (HiPC'12). IEEE, 1-10.
-
(2012)
Proceedings of the 2012 19th International Conference on High Performance Computing (HiPC'12)
, pp. 1-10
-
-
Reaño, C.1
Pea, A.J.2
Silla, F.3
Duato, J.4
Mayo, R.5
Quintana-Orti, E.S.6
-
124
-
-
84941790414
-
Improving the user experience of the rCUDA remote GPU virtualization framework
-
2015
-
Carlos Reaño, Federico Silla, Adrián Castelló, Antonio J. Peña, Rafael Mayo, Enrique S Quintana-Ortí, and José Duato. 2015a. Improving the user experience of the rCUDA remote GPU virtualization framework. Concurr. Comput.: Pract. Exper. 27, 14(2015), 3746-3770.
-
(2015)
Concurr. Comput.: Pract. Exper.
, vol.27
, Issue.14
, pp. 3746-3770
-
-
Reaño, C.1
Silla, F.2
Castelló, A.3
Peña, A.J.4
Mayo, R.5
Quintana-Orti, E.S.6
Duato, J.7
-
126
-
-
82655162782
-
PTask: Operating system abstractions to manage GPUs as compute devices
-
ACM
-
Christopher J. Rossbach, Jon Currey, Mark Silberstein, Baishakhi Ray, and Emmett Witchel. 2011. PTask: Operating system abstractions to manage GPUs as compute devices. In Proceedings of the 23rd ACM Symposium on Operating Systems Principles. ACM, 233-248.
-
(2011)
Proceedings of the 23rd ACM Symposium on Operating Systems Principles
, pp. 233-248
-
-
Rossbach, C.J.1
Currey, J.2
Silberstein, M.3
Ray, B.4
Witchel, E.5
-
127
-
-
79951813794
-
Cloud and heterogeneous computing solutions exist today for the emerging big data problems in biology
-
2011
-
Eric E. Schadt, Michael D. Linderman, Jon Sorenson, Lawrence Lee, and Garry P. Nolan. 2011. Cloud and heterogeneous computing solutions exist today for the emerging big data problems in biology. Nat. Rev. Genet. 12, 3(2011), 224-224.
-
(2011)
Nat. Rev. Genet.
, vol.12
, Issue.3
, pp. 224
-
-
Schadt, E.E.1
Linderman, M.D.2
Sorenson, J.3
Lee, L.4
Nolan, G.P.5
-
129
-
-
84936950495
-
Scheduling multitenant cloud workloads on accelerator-based systems
-
IEEE Press
-
Dipanjan Sengupta, Anshuman Goswami, Karsten Schwan, and Krishna Pallavi. 2014. Scheduling multitenant cloud workloads on accelerator-based systems. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE Press, 513-524.
-
(2014)
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
, pp. 513-524
-
-
Sengupta, D.1
Goswami, A.2
Schwan, K.3
Pallavi, K.4
-
130
-
-
80051667116
-
The development of Mellanox/NVIDIA GPUDirect over InfiniBanda new model for GPU to GPU communications
-
2011
-
Gilad Shainer, Ali Ayoub, Pak Lui, Tong Liu, Michael Kagan, Christian R. Trott, Greg Scantlen, and Paul S. Crozier. 2011. The development of Mellanox/NVIDIA GPUDirect over InfiniBanda new model for GPU to GPU communications. Comput. Sci. Res. Dev. 26, 3-4(2011), 267-273.
-
(2011)
Comput. Sci. Res. Dev.
, vol.26
, Issue.3-4
, pp. 267-273
-
-
Shainer, G.1
Ayoub, A.2
Lui, P.3
Liu, T.4
Kagan, M.5
Trott, C.R.6
Scantlen, G.7
Crozier, P.S.8
-
134
-
-
84860524424
-
VCUDA: GPU-accelerated high-performance computing in virtual machines
-
2012
-
Lin Shi, Hao Chen, Jianhua Sun, and Kenli Li. 2012. vCUDA: GPU-accelerated high-performance computing in virtual machines. IEEE Trans. Comput. 61, 6(2012), 804-816.
-
(2012)
IEEE Trans. Comput.
, vol.61
, Issue.6
, pp. 804-816
-
-
Shi, L.1
Chen, H.2
Sun, J.3
Li, K.4
-
135
-
-
79956161825
-
SHARC: A scalable 3D graphics virtual appliance delivery framework in cloud
-
2011
-
Weidong Shi, Yang Lu, Zhu Li, and Jonathan Engelsma. 2011. SHARC: A scalable 3D graphics virtual appliance delivery framework in cloud. J. Netw. Comput. Appl. 34, 4(2011), 1078-1087.
-
(2011)
J. Netw. Comput. Appl.
, vol.34
, Issue.4
, pp. 1078-1087
-
-
Shi, W.1
Lu, Y.2
Li, Z.3
Engelsma, J.4
-
136
-
-
0030171894
-
Efficient fair queuing using deficit round-robin
-
1996
-
Madhavapeddi Shreedhar and George Varghese. 1996. Efficient fair queuing using deficit round-robin. IEEE/ACM Trans. Netw. 4, 3(1996), 375-385.
-
(1996)
IEEE/ACM Trans. Netw.
, vol.4
, Issue.3
, pp. 375-385
-
-
Shreedhar, M.1
Varghese, G.2
-
137
-
-
0004233425
-
-
Addison-Wesley, Reading, MA
-
Abraham Silberschatz, Peter B. Galvin, Greg Gagne, and A. Silberschatz. 1998. Operating System Concepts. Vol. 4. Addison-Wesley, Reading, MA.
-
(1998)
Operating System Concepts.
, vol.4
-
-
Silberschatz, A.1
Galvin, P.B.2
Gagne, G.3
Silberschatz, A.4
-
138
-
-
85027023449
-
KVMGT: A full GPU virtualization solution
-
Jike Song, Zhiyuan Lv, and Kevin Tian. 2014. KVMGT: A full GPU virtualization solution. In KVM Forum 2014. http://www.linux-kvm.org/page/KVM-Forum-2014.
-
(2014)
KVM Forum 2014
-
-
Song, J.1
Lv, Z.2
Tian, K.3
-
139
-
-
84873470137
-
Parboil: A revised benchmark suite for scientific and commercial throughput computing
-
2012
-
John A. Stratton, Christopher Rodrigues, I-Jui Sung, Nady Obeid, Li-Wen Chang, Nasser Anssari, Geng Daniel Liu, and Wen-mei W. Hwu. 2012. Parboil: A revised benchmark suite for scientific and commercial throughput computing. Center for Reliable and High-Performance Computing 127 (2012).
-
(2012)
Center for Reliable and High-performance Computing
, vol.127
-
-
Stratton, J.A.1
Rodrigues, C.2
Sung, I.-J.3
Obeid, N.4
Chang, L.-W.5
Anssari, N.6
Liu, G.D.7
Hwu, W.W.8
-
141
-
-
84982095542
-
GPUvm: GPU virtualization at the hypervisor
-
2016
-
Yusuke Suzuki, Shinpei Kato, Hiroshi Yamada, and Kenji Kono. 2016. Gpuvm: Gpu virtualization at the hypervisor. IEEE Trans. Comput. 65, 9(2016), 2752-2766.
-
(2016)
IEEE Trans. Comput.
, vol.65
, Issue.9
, pp. 2752-2766
-
-
Suzuki, Y.1
Kato, S.2
Yamada, H.3
Kono, K.4
-
142
-
-
84905509992
-
Enabling preemptive multiprogramming on GPUs
-
IEEE Press
-
Ivan Tanasic, Isaac Gelado, Javier Cabezas, Alex Ramirez, Nacho Navarro, and Mateo Valero. 2014. Enabling preemptive multiprogramming on GPUs. In ACM SIGARCH Computer Architecture News, Vol. 42. IEEE Press, 193-204.
-
(2014)
ACM SIGARCH Computer Architecture News
, vol.42
, pp. 193-204
-
-
Tanasic, I.1
Gelado, I.2
Cabezas, J.3
Ramirez, A.4
Navarro, N.5
Valero, M.6
-
144
-
-
84897976339
-
Enabling OpenCL support for GPGPU in Kernel-based Virtual Machine
-
2014
-
Tsan-Rong Tien and Yi-Ping You. 2014. Enabling OpenCL support for GPGPU in Kernel-based Virtual Machine. Softw.: Pract. Exper. 44, 5(2014), 483-510.
-
(2014)
Softw.: Pract. Exper.
, vol.44
, Issue.5
, pp. 483-510
-
-
Tien, T.-R.1
You, Y.-P.2
-
145
-
-
85016284648
-
-
Top500. 2016. TOP500 Supercomputer Sites. Retrieved from https://www.top500.org/list/2016/06/.
-
(2016)
TOP500 Supercomputer Sites
-
-
-
146
-
-
20344391930
-
Intel virtualization technology
-
2005
-
Rich Uhlig, Gil Neiger, Dion Rodgers, Amy L. Santoni, Fernando C. M. Martins, Andrew V. Anderson, Steven M. Bennett, Alain Kagi, Felix H. Leung, and Larry Smith. 2005. Intel virtualization technology. Computer 38, 5(2005), 48-56.
-
(2005)
Computer
, vol.38
, Issue.5
, pp. 48-56
-
-
Uhlig, R.1
Neiger, G.2
Rodgers, D.3
Santoni, A.L.4
Martins, F.C.M.5
Anderson, A.V.6
Bennett, S.M.7
Kagi, A.8
Leung, F.H.9
Smith, L.10
-
148
-
-
33947412760
-
New approach to virtualization is a lightweight
-
2006
-
Stephen J. Vaughan-Nichols. 2006. New approach to virtualization is a lightweight. Computer 39, 11 (2006).
-
(2006)
Computer
, vol.39
, Issue.11
-
-
Vaughan-Nichols, S.J.1
-
152
-
-
84919792604
-
GPU passthrough performance: A comparison of KVM, Xen, VMWare ESXi, and LXC for CUDA and OpenCL applications
-
IEEE
-
John Paul Walters, Andrew J. Younge, Dong In Kang, Ke Thia Yao, Mikyung Kang, Stephen P. Crago, and Geoffrey C. Fox. 2014. GPU passthrough performance: A comparison of KVM, Xen, VMWare ESXi, and LXC for CUDA and OpenCL applications. In Proceedings of the 2014 IEEE 7th International Conference on Cloud Computing (CLOUD'14). IEEE, 636-643.
-
(2014)
Proceedings of the 2014 IEEE 7th International Conference on Cloud Computing (CLOUD'14)
, pp. 636-643
-
-
Walters, J.P.1
Younge, A.J.2
Kang, D.I.3
Yao, K.T.4
Kang, M.5
Crago, S.P.6
Fox, G.C.7
-
153
-
-
84968876675
-
A user mode CPU-GPU scheduling framework for hybrid workloads
-
2016
-
Bin Wang, Ruhui Ma, Zhengwei Qi, Jianguo Yao, and Haibing Guan. 2016. A user mode CPU-GPU scheduling framework for hybrid workloads. Future Gener. Comput. Syst. 63(2016), 25-36.
-
(2016)
Future Gener. Comput. Syst.
, vol.63
, pp. 25-36
-
-
Wang, B.1
Ma, R.2
Qi, Z.3
Yao, J.4
Guan, H.5
-
155
-
-
70349253246
-
Trusted computing building blocks for embedded linux-based ARM trustzone platforms
-
ACM
-
Johannes Winter. 2008. Trusted computing building blocks for embedded linux-based ARM trustzone platforms. In Proceedings of the 3rd ACM Workshop on Scalable Trusted Computing. ACM, 21-30.
-
(2008)
Proceedings of the 3rd ACM Workshop on Scalable Trusted Computing
, pp. 21-30
-
-
Winter, J.1
-
157
-
-
0003651470
-
-
Addison-Wesley Longman Publishing Co., Inc
-
Mason Woo, Jackie Neider, Tom Davis, and Dave Shreiner. 1999. OpenGL Programming Guide: The Official Guide to Learning OpenGL, Version 1.2. Addison-Wesley Longman Publishing Co., Inc.
-
(1999)
OpenGL Programming Guide: The Official Guide to Learning OpenGL, Version 1.2
-
-
Woo, M.1
Neider, J.2
Davis, T.3
Shreiner, D.4
-
159
-
-
85027032075
-
-
Xenproject. 2016. Xen Project Release Features. Retrieved from https://wiki.xenproject.org/wiki/Xen-Project-Release-Features.
-
(2016)
Xen Project Release Features
-
-
-
160
-
-
84870656041
-
VOCL: An optimized environment for transparent virtualization of graphics processing units
-
IEEE
-
Shucai Xiao, Pavan Balaji, Qian Zhu, Rajeev Thakur, Susan Coghlan, Heshan Lin, Gaojin Wen, Jue Hong, and Wu-chun Feng. 2012. VOCL: An optimized environment for transparent virtualization of graphics processing units. In Proceedings of the Innovative Parallel Computing (InPar'12). IEEE, 1-12.
-
(2012)
Proceedings of the Innovative Parallel Computing (InPar'12)
, pp. 1-12
-
-
Xiao, S.1
Balaji, P.2
Zhu, Q.3
Thakur, R.4
Coghlan, S.5
Lin, H.6
Wen, G.7
Hong, J.8
Feng, W.-C.9
-
162
-
-
85029492411
-
GScale: Scaling up GPU virtualization with dynamic sharing of graphics memory space
-
Mochi Xue, Kun Tian, Yaozu Dong, Jiajun Wang, Zhengwei Qi, Bingsheng He, and Haibing Guan. 2016. gScale: Scaling up GPU virtualization with dynamic sharing of graphics memory space. In Proceedings of the 2016 USENIX Annual Technical Conference (USENIX ATC'16).
-
(2016)
Proceedings of the 2016 USENIX Annual Technical Conference (USENIX ATC'16)
-
-
Xue, M.1
Tian, K.2
Dong, Y.3
Wang, J.4
Qi, Z.5
He, B.6
Guan, H.7
-
163
-
-
84898061475
-
Implementation of GPU virtualization using PCI pass-through mechanism
-
2014
-
Chao-Tung Yang, Jung-Chun Liu, Hsien-Yi Wang, and Ching-Hsien Hsu. 2014. Implementation of GPU virtualization using PCI pass-through mechanism. J. Supercomput. 68, 1(2014), 183-213.
-
(2014)
J. Supercomput.
, vol.68
, Issue.1
, pp. 183-213
-
-
Yang, C.-T.1
Liu, J.-C.2
Wang, H.-Y.3
Hsu, C.-H.4
-
164
-
-
84871599572
-
Using pci pass-through for GPU virtualization with CUDA
-
Springer
-
Chao-Tung Yang, Hsien-Yi Wang, and Yu-Tso Liu. 2012a. Using pci pass-through for gpu virtualization with cuda. In Network and Parallel Computing. Springer, 445-452.
-
(2012)
Network and Parallel Computing
, pp. 445-452
-
-
Yang, C.-T.1
Wang, H.-Y.2
Liu, Y.-T.3
-
165
-
-
84874254048
-
On implementation of GPU virtualization using PCI pass-through
-
IEEE
-
Chao-Tung Yang, Hsien-Yi Wang, Wei-Shen Ou, Yu-Tso Liu, and Ching-Hsien Hsu. 2012b. On implementation of GPU virtualization using PCI pass-through. In Proceedings of the 2012 IEEE 4th International Conference on Cloud Computing Technology and Science (CloudCom'12). IEEE, 711-716.
-
(2012)
Proceedings of the 2012 IEEE 4th International Conference on Cloud Computing Technology and Science (CloudCom'12)
, pp. 711-716
-
-
Yang, C.-T.1
Wang, H.-Y.2
Ou, W.-S.3
Liu, Y.-T.4
Hsu, C.-H.5
-
166
-
-
84883321005
-
GPU virtualization support in cloud system
-
Springer
-
Chih-Yuan Yeh, Chung-Yao Kao, Wei-Shu Hung, Ching-Chi Lin, Pangfeng Liu, Jan-Jan Wu, and Kuang-Chih Liu. 2013. GPU virtualization support in cloud system. In International Conference on Grid and Pervasive Computing. Springer, 423-432.
-
(2013)
International Conference on Grid and Pervasive Computing
, pp. 423-432
-
-
Yeh, C.-Y.1
Kao, C.-Y.2
Hung, W.-S.3
Lin, C.-C.4
Liu, P.5
Wu, J.-J.6
Liu, K.-C.7
-
167
-
-
84939129434
-
VirtCL: A framework for OpenCL device abstraction and management
-
ACM
-
Yi-Ping You, Hen-Jung Wu, Yeh-Ning Tsai, and Yen-Ting Chao. 2015. VirtCL: A framework for OpenCL device abstraction and management. In ACM SIGPLAN Notices, Vol. 50. ACM, 161-172.
-
(2015)
ACM SIGPLAN Notices
, vol.50
, pp. 161-172
-
-
You, Y.-P.1
Wu, H.-J.2
Tsai, Y.-N.3
Chao, Y.-T.4
-
170
-
-
84969718487
-
Supporting high performance molecular dynamics in virtualized clusters using IOMMU, SR-IOV, and GPUDirect
-
Andrew J. Younge, John Paul Walters, Stephen P. Crago, and Geoffrey C. Fox. 2015. Supporting high performance molecular dynamics in virtualized clusters using IOMMU, SR-IOV, and GPUDirect. In Proceedings of the 11th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments. ACM, 31-38.
-
(2015)
Proceedings of the 11th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments
, pp. 31-38
-
-
Younge, A.J.1
Walters, J.P.2
Crago, S.P.3
Fox, G.C.4
-
171
-
-
84904498998
-
Vgasa: Adaptive scheduling algorithm of virtualized GPU resource in cloud gaming
-
2014
-
Chao Zhang, Jianguo Yao, Zhengwei Qi, Miao Yu, and Haibing Guan. 2014. vgasa: Adaptive scheduling algorithm of virtualized gpu resource in cloud gaming. IEEE Trans. Parallel Distrib. Syst. 25, 11(2014), 3036-3045.
-
(2014)
IEEE Trans. Parallel Distrib. Syst.
, vol.25
, Issue.11
, pp. 3036-3045
-
-
Zhang, C.1
Yao, J.2
Qi, Z.3
Yu, M.4
Guan, H.5
-
172
-
-
84963813616
-
A cloud gaming system based on user-level virtualization and its resource scheduling
-
2016
-
Youhui Zhang, Peng Qu, Jiang Cihang, and Weimin Zheng. 2016. A cloud gaming system based on user-level virtualization and its resource scheduling. IEEE Trans. Parallel Distrib. Syst. 27, 5(2016), 1239-1252.
-
(2016)
IEEE Trans. Parallel Distrib. Syst.
, vol.27
, Issue.5
, pp. 1239-1252
-
-
Zhang, Y.1
Qu, P.2
Cihang, J.3
Zheng, W.4
|