-
1
-
-
85092760065
-
-
CUDA Programming Guide 3.1. June
-
CUDA Programming Guide 3.1. http://developer. download.nvidia.com/compute/cuda/3_1/toolkit/ docs/NVIDIA_CUDA_C_ProgrammingGuide_3.1.pdf, June 2010.
-
(2010)
-
-
-
2
-
-
70349169075
-
Analyzing cuda workloads using a detailed gpu simulator
-
174, april
-
A. Bakhoda, G. Yuan, W. Fung, H. Wong, and T. Aamodt. Analyzing cuda workloads using a detailed gpu simulator. In Performance Analysis of Systems and Software, 2009. ISPASS 2009. IEEE International Symposium on, pages 163 {174, april 2009.
-
(2009)
Performance Analysis of Systems and Software, 2009. ISPASS 2009. IEEE International Symposium on
, pp. 163
-
-
Bakhoda, A.1
Yuan, G.2
Fung, W.3
Wong, H.4
Aamodt, T.5
-
3
-
-
77953985375
-
Dynamic load balancing on single- and multi-gpu systems
-
12, april
-
L. Chen, O. Villa, S. Krishnamoorthy, and G. Gao. Dynamic load balancing on single- and multi-gpu systems. In Parallel Distributed Processing (IPDPS), 2010 IEEE International Symposium on, pages 1 {12, april 2010.
-
(2010)
Parallel Distributed Processing (IPDPS), 2010 IEEE International Symposium on
, pp. 1
-
-
Chen, L.1
Villa, O.2
Krishnamoorthy, S.3
Gao, G.4
-
5
-
-
85092795092
-
CUDA optimization strategies for compute-and memory-bound neuroimaging algorithms
-
December
-
D. Lee, I. Dinov, B. Dong, B. Gutman, I. Yanovsky, and A. Toga. CUDA optimization strategies for compute-and memory-bound neuroimaging algorithms. Computer methods and programs in biomedicine, pages 1{13, December 2010.
-
(2010)
Computer methods and programs in biomedicine
, pp. 1-13
-
-
Lee, D.1
Dinov, I.2
Dong, B.3
Gutman, B.4
Yanovsky, I.5
Toga, A.6
-
6
-
-
80155183121
-
Gpu resource sharing and virtualization on high performance computing systems
-
742, sept
-
T. Li, V. Narayana, E. El-Araby, and T. El-Ghazawi. Gpu resource sharing and virtualization on high performance computing systems. In Parallel Processing (ICPP), 2011 International Conference on, pages 733 {742, sept. 2011.
-
(2011)
Parallel Processing (ICPP), 2011 International Conference on
, pp. 733
-
-
Li, T.1
Narayana, V.2
El-Araby, E.3
El-Ghazawi, T.4
-
7
-
-
79953080838
-
Kernel fusion: An e_ective method for better power e_ciency on multithreaded gpu
-
350, dec
-
G. Wang, Y. Lin, and W. Yi. Kernel fusion: An e_ective method for better power e_ciency on multithreaded gpu. In Green Computing and Communications (GreenCom), 2010 IEEE/ACM Int'l Conference on Int'l Conference on Cyber, Physical and Social Computing (CPSCom), pages 344 {350, dec. 2010.
-
(2010)
Green Computing and Communications (GreenCom), 2010 IEEE/ACM Int'l Conference on Int'l Conference on Cyber, Physical and Social Computing (CPSCom)
, pp. 344
-
-
Wang, G.1
Lin, Y.2
Yi, W.3
-
8
-
-
80052985746
-
Exploiting concurrent kernel execution on graphic processing units
-
32, july
-
L. Wang, M. Huang, and T. El-Ghazawi. Exploiting concurrent kernel execution on graphic processing units. In High Performance Computing and Simulation (HPCS), 2011 International Conference on, pages 24 {32, july 2011.
-
(2011)
High Performance Computing and Simulation (HPCS), 2011 International Conference on
, pp. 24
-
-
Wang, L.1
Huang, M.2
El-Ghazawi, T.3
|