-
1
-
-
70349100958
-
-
KHRONOS OpenCL Working Group December
-
KHRONOS OpenCL Working Group. The OpenCL Specification, December 2008.
-
(2008)
The OpenCL Specification
-
-
-
2
-
-
67650694407
-
-
NVIDIA NVIDIA Corporation, Santa Clara, California, 2.1 edition, October
-
NVIDIA. NVIDIA CUDA Compute Unified Device Architecture. NVIDIA Corporation, Santa Clara, California, 2.1 edition, October 2008.
-
(2008)
NVIDIA CUDA Compute Unified Device Architecture
-
-
-
3
-
-
70649102016
-
-
NVIDIA NVIDIA Corporation, Santa Clara, California, 1.3 edition, October
-
NVIDIA. NVIDIA Compute PTX: Parallel Thread Execution. NVIDIA Corporation, Santa Clara, California, 1.3 edition, October 2008.
-
(2008)
NVIDIA Compute PTX: Parallel Thread Execution
-
-
-
5
-
-
78149233155
-
Ocelot: A dynamic optimization framework for bulk-synchronous applications in heterogeneous systems
-
New York, NY, USA, ACM
-
Gregory Diamos, Andrew Kerr, Sudhakar Yalamanchili, and Nathan Clark. Ocelot: a dynamic optimization framework for bulk-synchronous applications in heterogeneous systems. In Proceedings of the 19th international conference on Parallel architectures and compilation techniques, PACT '10, pages 353-364, New York, NY, USA, 2010. ACM.
-
(2010)
Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, PACT '10
, pp. 353-364
-
-
Diamos, G.1
Kerr, A.2
Yalamanchili, S.3
Clark, N.4
-
7
-
-
0026243790
-
Efficiently computing static single assignment form and the control dependence graph
-
Oct
-
Ron Cytron, Jeanne Ferrante, Barry K. Rosen, Mark N. Wegman, and F. Kenneth Zadeck. Efficiently computing static single assignment form and the control dependence graph. ACM Transactions on Programming Languages and Systems, 13(4):451-490, Oct 1991.
-
(1991)
ACM Transactions on Programming Languages and Systems
, vol.13
, Issue.4
, pp. 451-490
-
-
Cytron, R.1
Ferrante, J.2
Rosen, B.K.3
Wegman, M.N.4
Zadeck, F.K.5
-
12
-
-
79955071201
-
-
NVIDIA NVIDIA Corporation, Santa Clara, California, 1.0 edition, October
-
NVIDIA. NVIDIA Compute Visual Profiler. NVIDIA Corporation, Santa Clara, California, 1.0 edition, October 2010.
-
(2010)
NVIDIA Compute Visual Profiler
-
-
-
13
-
-
70349123351
-
Gvim: Gpu-accelerated virtual machines
-
New York, NY, USA, ACM
-
Vishakha Gupta, Ada Gavrilovska, Karsten Schwan, Harshvardhan Kharche, Niraj Tolia, Vanish Talwar, and Parthasarathy Ranganathan. Gvim: Gpu-accelerated virtual machines. In Proceedings of the 3rd ACM Workshop on System-level Virtualization for High Performance Computing, HPCVirt '09, pages 17-24, New York, NY, USA, 2009. ACM.
-
(2009)
Proceedings of the 3rd ACM Workshop on System-level Virtualization for High Performance Computing, HPCVirt '09
, pp. 17-24
-
-
Gupta, V.1
Gavrilovska, A.2
Schwan, K.3
Kharche, H.4
Tolia, N.5
Talwar, V.6
Ranganathan, P.7
-
15
-
-
31944440969
-
Pin: Building customized program analysis tools with dynamic instrumentation
-
New York, NY, USA, ACM
-
Chi-Keung Luk, Robert Cohn, Robert Muth, Harish Patil, Artur Klauser, Geoff Lowney, Steven Wallace, Vijay Janapa Reddi, and Kim Hazelwood. Pin: building customized program analysis tools with dynamic instrumentation. In Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation, PLDI '05, pages 190-200, New York, NY, USA, 2005. ACM.
-
(2005)
Proceedings of the 2005 ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI '05
, pp. 190-200
-
-
Luk, C.-K.1
Cohn, R.2
Muth, R.3
Patil, H.4
Klauser, A.5
Lowney, G.6
Wallace, S.7
Reddi, V.J.8
Hazelwood, K.9
-
17
-
-
70349169075
-
Analyzing cuda workloads using a detailed gpu simulator
-
Ali Bakhoda, George Yuan, Wilson W. L. Fung, Henry Wong, and Tor M. Aamodt. Analyzing cuda workloads using a detailed gpu simulator. In IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Boston, MA, USA, April 2009.
-
IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Boston, MA, USA, April 2009
-
-
Bakhoda, A.1
Yuan, G.2
Fung, W.W.L.3
Wong, H.4
Aamodt, T.M.5
-
20
-
-
70450231944
-
An analytical model for a gpu architecture with memory-level and thread-level parallelism awareness
-
Sunpyo Hong and Hyesoon Kim. An analytical model for a gpu architecture with memory-level and thread-level parallelism awareness. SIGARCH Comput. Archit. News, 37(3):152-163, 2009.
-
(2009)
SIGARCH Comput. Archit. News
, vol.37
, Issue.3
, pp. 152-163
-
-
Hong, S.1
Kim, H.2
|