-
1
-
-
77954743119
-
Fast sort on CPUs and GPUs: A case for bandwidth oblivious SIMD sort
-
N. Satish, et al., "Fast sort on CPUs and GPUs: A case for bandwidth oblivious SIMD sort, " in Proc. ACM SIGMOD Int. Conf. Manage. Data, 2010, pp. 351-362.
-
(2010)
Proc. ACM SIGMOD Int. Conf. Manage. Data
, pp. 351-362
-
-
Satish, N.1
-
2
-
-
84880102288
-
GPUfs: Integrating a file system with GPUs
-
M. Silberstein, B. Ford, I. Keidar, and E. Witchel, "GPUfs: Integrating a file system with GPUs, " ACM SIGARCH Comput. Archit. News, vol. 41, no. 1, pp. 485-498, 2013.
-
(2013)
ACM SIGARCH Comput. Archit. News
, vol.41
, Issue.1
, pp. 485-498
-
-
Silberstein, M.1
Ford, B.2
Keidar, I.3
Witchel, E.4
-
3
-
-
78649999328
-
Case study for running HPC applications in public clouds
-
Q. He, S. Zhou, B. Kobler, D. Duffy, and T. McGlynn, "Case study for running HPC applications in public clouds, " in Proc. 19th ACM Int. Symp. High Perform. Distrib. Comput., 2010, pp. 395-401.
-
(2010)
Proc. 19th ACM Int. Symp. High Perform. Distrib. Comput.
, pp. 395-401
-
-
He, Q.1
Zhou, S.2
Kobler, B.3
Duffy, D.4
McGlynn, T.5
-
4
-
-
85025673404
-
-
NVIDIA, "GP100 pascal whitepaper, " 2016. [Online]. Available: https://images. nvidia. com/content/pdf/tesla/whitepaper/ pascal-architecture-whitepaper. pdf
-
(2016)
GP100 Pascal Whitepaper
-
-
-
5
-
-
84905509992
-
Enabling preemptive multiprogramming on GPUs
-
I. Tanasic, I. Gelado, J. Cabezas, A. Ramirez, N. Navarro, and M. Valero, "Enabling preemptive multiprogramming on GPUs, " ACM SIGARCH Comput. Archit. News, vol. 42, no. 3, pp. 193-204, 2014.
-
(2014)
ACM SIGARCH Comput. Archit. News
, vol.42
, Issue.3
, pp. 193-204
-
-
Tanasic, I.1
Gelado, I.2
Cabezas, J.3
Ramirez, A.4
Navarro, N.5
Valero, M.6
-
7
-
-
78349273083
-
A GPGPU transparent virtualization component for high performance computing clouds
-
G. Giunta, R. Montella, G. Agrillo, and G. Coviello, "A GPGPU transparent virtualization component for high performance computing clouds, " in Euro-Par 2010-Parallel Processing. Berlin, Germany: Springer, 2010, pp. 379-391.
-
(2010)
Euro-Par 2010-Parallel Processing. Berlin, Germany: Springer
, pp. 379-391
-
-
Giunta, G.1
Montella, R.2
Agrillo, G.3
Coviello, G.4
-
8
-
-
85077032008
-
TimeGraph: GPU scheduling for real-time multi-tasking environments
-
S. Kato, K. Lakshmanan, R. Rajkumar, and Y. Ishikawa, "TimeGraph: GPU scheduling for real-time multi-tasking environments, " in Proc. USENIX Annu. Tech. Conf., 2011, Art. no. 17.
-
(2011)
Proc. USENIX Annu. Tech. Conf.
-
-
Kato, S.1
Lakshmanan, K.2
Rajkumar, R.3
Ishikawa, Y.4
-
9
-
-
85077122204
-
Gdev: Firstclass GPU resource management in the operating system
-
S. Kato, M. McThrow, C. Maltzahn, and S. A. Brandt, "Gdev: Firstclass GPU resource management in the operating system, " in Proc. USENIX Annu. Tech. Conf., 2012, pp. 401-412.
-
(2012)
Proc. USENIX Annu. Tech. Conf.
, pp. 401-412
-
-
Kato, S.1
McThrow, M.2
Maltzahn, C.3
Brandt, S.A.4
-
10
-
-
85077044984
-
Pegasus: Coordinated scheduling for virtualized acceleratorbased systems
-
V. Gupta, K. Schwan, N. Tolia, V. Talwar, and P. Ranganathan, "Pegasus: Coordinated scheduling for virtualized acceleratorbased systems, " in Proc. USENIX Annu. Tech. Conf., 2011, Art. no. 31.
-
(2011)
Proc. USENIX Annu. Tech. Conf.
-
-
Gupta, V.1
Schwan, K.2
Tolia, N.3
Talwar, V.4
Ranganathan, P.5
-
11
-
-
84860524424
-
VCUDA: GPU-accelerated highperformance computing in virtual machines
-
Jun.
-
L. Shi, H. Chen, J. Sun, and K. Li, "vCUDA: GPU-accelerated highperformance computing in virtual machines, " IEEE Trans. Comput., vol. 61, no. 6, pp. 804-816, Jun. 2012.
-
(2012)
IEEE Trans. Comput.
, vol.61
, Issue.6
, pp. 804-816
-
-
Shi, L.1
Chen, H.2
Sun, J.3
Li, K.4
-
12
-
-
85077458357
-
GPUvm: Why not virtualizing GPUs at the hypervisor?
-
Y. Suzuki, S. Kato, H. Yamada, and K. Kono, "GPUvm: Why not virtualizing GPUs at the hypervisor?" in Proc. USENIX Annu. Tech. Conf., 2014, pp. 109-120.
-
(2014)
Proc. USENIX Annu. Tech. Conf.
, pp. 109-120
-
-
Suzuki, Y.1
Kato, S.2
Yamada, H.3
Kono, K.4
-
13
-
-
77956946040
-
RCUDA: Reducing the number of GPU-based accelerators in high performance clusters
-
J. Duato, A. J. Pena, F. Silla, R. Mayo, and E. S. Quintana-Ort -?, "rCUDA: Reducing the number of GPU-based accelerators in high performance clusters, " in Proc. Int. Conf. High Perform. Comput. Simul., 2010, pp. 224-231.
-
(2010)
Proc. Int. Conf. High Perform. Comput. Simul.
, pp. 224-231
-
-
Duato, J.1
Pena, A.J.2
Silla, F.3
Mayo, R.4
Quintana-Ort, E.S.5
-
14
-
-
84897749415
-
Disengaged scheduling for fair, protected access to fast computational accelerators
-
K. Menychtas, K. Shen, and M. L. Scott, "Disengaged scheduling for fair, protected access to fast computational accelerators, " ACM SIGPLAN Notices, vol. 49, no. 4, pp. 301-316, 2014.
-
(2014)
ACM SIGPLAN Notices
, vol.49
, Issue.4
, pp. 301-316
-
-
Menychtas, K.1
Shen, K.2
Scott, M.L.3
-
15
-
-
85027078923
-
GPU virtualization and scheduling methods: A comprehensive survey
-
C.-H. Hong, I. Spence, and S. D. Nikolopoulos, "GPU virtualization and scheduling methods: A comprehensive survey, " ACM Comput. Surveys (CSUR), vol. 50, no. 3, p. 35, 2017.
-
(2017)
ACM Comput. Surveys (CSUR)
, vol.50
, Issue.3
, pp. 35
-
-
Hong, C.-H.1
Spence, I.2
Nikolopoulos, S.D.3
-
16
-
-
84962521288
-
VADI: GPU virtualization for an automotive platform
-
Feb.
-
C. Lee, S.-W. Kim, and C. Yoo, "VADI: GPU virtualization for an automotive platform, " IEEE Trans. Ind. Informat., vol. 12, no. 1, pp. 277-290, Feb. 2016.
-
(2016)
IEEE Trans. Ind. Informat.
, vol.12
, Issue.1
, pp. 277-290
-
-
Lee, C.1
Kim, S.-W.2
Yoo, C.3
-
17
-
-
84858051188
-
Enabling CUDA acceleration within virtual machines using rCUDA
-
2011
-
J. Duato, A. J. Pena, F. Silla, J. C. Fernandez, R. Mayo, and E. S. Quintana-Ort -?, "Enabling CUDA acceleration within virtual machines using rCUDA, " in Proc. 18th Int. Conf. High Perform. Comput., 2011, pp. 1-10.
-
Proc. 18th Int. Conf. High Perform. Comput.
, pp. 1-10
-
-
Duato, J.1
Pena, A.J.2
Silla, F.3
Fernandez, J.C.4
Mayo, R.5
Quintana-Ort, E.S.6
-
18
-
-
84991112040
-
On the virtualization of CUDA based GPU remoting on ARM and X86 machines in the GVirtuS framework
-
R. Montella, et al., "On the virtualization of CUDA based GPU remoting on ARM and X86 machines in the GVirtuS framework, " Int. J. Parallel Program., pp. 1-22, 2016. [Online]. Available: http:// dx. doi. org/10. 1007/s10766-016-0462-1
-
(2016)
Int. J. Parallel Program.
, pp. 1-22
-
-
Montella, R.1
-
20
-
-
85027003084
-
-
PathScale, "pathscale/pscnv, " 2012. [Online]. Available: https:// github. com/pathscale/pscnv
-
(2012)
Pathscale/pscnv
-
-
-
21
-
-
85077449318
-
A full GPU virtualization solution with mediated pass-through
-
K. Tian, Y. Dong, and D. Cowperthwaite, "A full GPU virtualization solution with mediated pass-through, " in Proc. USENIX Conf. USENIX Annu. Tech. Conf., 2014, pp. 121-132.
-
(2014)
Proc. USENIX Conf. USENIX Annu. Tech. Conf.
, pp. 121-132
-
-
Tian, K.1
Dong, Y.2
Cowperthwaite, D.3
-
22
-
-
85027023449
-
KVMGT: A full GPU virtualization solution
-
J. Song, Z. Lv, and K. Tian, "KVMGT: A full GPU virtualization solution, " in KVM Forum, 2014, http://www. linux-kvm. org/ page/KVM-Forum-2014
-
(2014)
KVM Forum
-
-
Song, J.1
Lv, Z.2
Tian, K.3
-
23
-
-
84965004939
-
NVIDIA grid: Graphics accelerated VDI with the visual performance of a workstation
-
Santa Clara, CA, USA
-
A. Herrera, "NVIDIA grid: Graphics accelerated VDI with the visual performance of a workstation, " NVIDIA Corp, Santa Clara, CA, USA, 2014.
-
(2014)
NVIDIA Corp
-
-
Herrera, A.1
-
24
-
-
77952266871
-
GPU virtualization on VMware's hosted I/O architecture
-
M. Dowty and J. Sugerman, "GPU virtualization on VMware's hosted I/O architecture, " ACM SIGOPS Operating Syst. Rev., vol. 43, no. 3, pp. 73-82, 2009.
-
(2009)
ACM SIGOPS Operating Syst. Rev.
, vol.43
, Issue.3
, pp. 73-82
-
-
Dowty, M.1
Sugerman, J.2
-
26
-
-
21644433634
-
Xen and the art of virtualization
-
P. Barham, et al., "Xen and the art of virtualization, " ACM SIGOPS Operating Syst. Rev., vol. 37, no. 5, pp. 164-177, 2003.
-
(2003)
ACM SIGOPS Operating Syst. Rev.
, vol.37
, Issue.5
, pp. 164-177
-
-
Barham, P.1
-
27
-
-
79959411439
-
FastForward for efficient pipeline parallelism: A cache-optimized concurrent lockfree queue
-
J. Giacomoni, T. Moseley, and M. Vachharajani, "FastForward for efficient pipeline parallelism: A cache-optimized concurrent lockfree queue, " in Proc. 13th ACM SIGPLAN Symp. Principles Practice Parallel Program., 2008, pp. 43-52.
-
(2008)
Proc. 13th ACM SIGPLAN Symp. Principles Practice Parallel Program.
, pp. 43-52
-
-
Giacomoni, J.1
Moseley, T.2
Vachharajani, M.3
-
29
-
-
85077195492
-
Efficient and scalable paravirtual I/O system
-
N. Har'El, A. Gordon, A. Landau, M. Ben-Yehuda, A. Traeger, and R. Ladelsky, "Efficient and scalable paravirtual I/O system, " in Proc. USENIX Annu. Tech. Conf., 2013, pp. 231-242.
-
(2013)
Proc. USENIX Annu. Tech. Conf.
, pp. 231-242
-
-
Har'el, N.1
Gordon, A.2
Landau, A.3
Ben-Yehuda, M.4
Traeger, A.5
Ladelsky, R.6
-
30
-
-
84976660469
-
How fair is fair queuing
-
A. G. Greenberg and N. Madras, "How fair is fair queuing, " J. ACM, vol. 39, no. 3, pp. 568-598, 1992.
-
(1992)
J. ACM
, vol.39
, Issue.3
, pp. 568-598
-
-
Greenberg, A.G.1
Madras, N.2
-
32
-
-
79960501403
-
Dynamic adaptive scheduling for virtual machines
-
C. Weng, Q. Liu, L. Yu, and M. Li, "Dynamic adaptive scheduling for virtual machines, " in Proc. 20th Int. Symp. High Perform. Distrib. Comput., 2011, pp. 239-250.
-
(2011)
Proc. 20th Int. Symp. High Perform. Distrib. Comput.
, pp. 239-250
-
-
Weng, C.1
Liu, Q.2
Yu, L.3
Li, M.4
-
33
-
-
67650046427
-
Task-aware virtual machine scheduling for I/O performance
-
H. Kim, H. Lim, J. Jeong, H. Jo, and J. Lee, "Task-aware virtual machine scheduling for I/O performance, " in Proc. ACM SIGPLAN/SIGOPS Int. Conf. Virtual Execution Environments, 2009, pp. 101-110.
-
(2009)
Proc. ACM SIGPLAN/SIGOPS Int. Conf. Virtual Execution Environments
, pp. 101-110
-
-
Kim, H.1
Lim, H.2
Jeong, J.3
Jo, H.4
Lee, J.5
-
34
-
-
79951800982
-
Completely fair scheduler
-
C. S. Pabla, "Completely fair scheduler, " Linux J., vol. 2009, no. 184, 2009, Art. no. 4.
-
(2009)
Linux J.
, vol.2009
, Issue.184
-
-
Pabla, C.S.1
-
35
-
-
84965001845
-
Enabling OS research by inferring interactions in the black-box GPU stack
-
K. Menychtas, K. Shen, and M. L. Scott, "Enabling OS research by inferring interactions in the black-box GPU stack, " in Proc. USENIX Annu. Tech. Conf., 2013, pp. 291-296.
-
(2013)
Proc. USENIX Annu. Tech. Conf.
, pp. 291-296
-
-
Menychtas, K.1
Shen, K.2
Scott, M.L.3
-
36
-
-
84968735868
-
Portable and transparent software managed scheduling on accelerators for fair resource sharing
-
C. Margiolas and M. F. O'Boyle, "Portable and transparent software managed scheduling on accelerators for fair resource sharing, " in Proc. Int. Symp. Code Generation Optimization, 2016, pp. 82-93.
-
(2016)
Proc. Int. Symp. Code Generation Optimization
, pp. 82-93
-
-
Margiolas, C.1
O'Boyle, M.F.2
-
37
-
-
84944682522
-
GPES: A preemptive execution system for GPGPU computing
-
H. Zhou, G. Tong, and C. Liu, "GPES: A preemptive execution system for GPGPU computing, " in Proc. 21st IEEE Real-Time Embedded Technol. Appl. Symp., 2015, pp. 87-97.
-
(2015)
Proc. 21st IEEE Real-Time Embedded Technol. Appl. Symp.
, pp. 87-97
-
-
Zhou, H.1
Tong, G.2
Liu, C.3
-
38
-
-
70649092154
-
Rodinia: A benchmark suite for heterogeneous computing
-
S. Che, et al., "Rodinia: A benchmark suite for heterogeneous computing, " in Proc. IEEE Int. Symp. Workload Characterization, 2009, pp. 44-54.
-
(2009)
Proc. IEEE Int. Symp. Workload Characterization
, pp. 44-54
-
-
Che, S.1
-
39
-
-
85076924934
-
Performance isolation and fairness for multi-tenant cloud storage
-
D. Shue, M. J. Freedman, and A. Shaikh, "Performance isolation and fairness for multi-tenant cloud storage, " in Proc. 10th USENIX Conf. Operating Syst. Des. Implementation, 2012, pp. 349-362.
-
(2012)
Proc. 10th USENIX Conf. Operating Syst. Des. Implementation
, pp. 349-362
-
-
Shue, D.1
Freedman, M.J.2
Shaikh, A.3
|