-
1
-
-
70350662847
-
GpuCV: An opensource GPU-accelerated framework for image processing and computer vision
-
ACM
-
Y. Allusse, P. Horain, A. Agarwal, and C. Saipriyadarshan. GpuCV: an opensource GPU-accelerated framework for image processing and computer vision. In MM '08: Proceeding of the 16th ACM international conference on Multimedia, pages 1089-1092. ACM, 2008.
-
(2008)
MM '08: Proceeding of the 16th ACM International Conference on Multimedia
, pp. 1089-1092
-
-
Allusse, Y.1
Horain, P.2
Agarwal, A.3
Saipriyadarshan, C.4
-
2
-
-
0036683380
-
Towards a general framework for FPGA based image processing using hardware skeletons
-
K. Benkrid, D. Crookes, and A. Benkrid. Towards a general framework for FPGA based image processing using hardware skeletons. Parallel Computing, 28(7-8):1141-1154, 2002.
-
(2002)
Parallel Computing
, vol.28
, Issue.7-8
, pp. 1141-1154
-
-
Benkrid, K.1
Crookes, D.2
Benkrid, A.3
-
6
-
-
80155175370
-
High-performance SIMT code generation in an active visual effects library
-
ACM
-
J. L. Cornwall, L. Howes, P. H. Kelly, P. Parsonage, and B. Nicoletti. High-performance SIMT code generation in an active visual effects library. In CF '09: Proceedings of the 6th ACM conference on Computing frontiers, pages 175-184. ACM, 2009.
-
(2009)
CF '09: Proceedings of the 6th ACM Conference on Computing Frontiers
, pp. 175-184
-
-
Cornwall, J.L.1
Howes, L.2
Kelly, P.H.3
Parsonage, P.4
Nicoletti, B.5
-
7
-
-
78349252088
-
SkePU: A multi-backend skeleton programming library for multi-GPU systems
-
New York, NY, USA, ACM
-
J. Enmyren and C. W. Kessler. SkePU: a multi-backend skeleton programming library for multi-GPU systems. In Proceedings of the fourth international workshop on High-level parallel programming and applications, HLPP '10, pages 5-14, New York, NY, USA, 2010. ACM.
-
(2010)
Proceedings of the Fourth International Workshop on High-level Parallel Programming and Applications, HLPP '10
, pp. 5-14
-
-
Enmyren, J.1
Kessler, C.W.2
-
8
-
-
78149258346
-
Understanding throughput-oriented architectures
-
November
-
M. Garland and D. B. Kirk. Understanding throughput-oriented architectures. Communications of the ACM, 53:58-66, November 2010.
-
(2010)
Communications of the ACM
, vol.53
, pp. 58-66
-
-
Garland, M.1
Kirk, D.B.2
-
11
-
-
79955675214
-
A design pattern language for engineering (parallel) software
-
K. Keutzer and T. Mattson. A Design Pattern Language for Engineering (Parallel) Software. In Intel Technology Journal, 2010.
-
(2010)
Intel Technology Journal
-
-
Keutzer, K.1
Mattson, T.2
-
13
-
-
84937421176
-
Automatic SIMD parallelization of embedded applications based on pattern recognition
-
A. Bode, T. Ludwig, W. Karl, and R. Wismuller, editors
-
R. Manniesing, I. Karkowski, and H. Corporaal. Automatic SIMD Parallelization of Embedded Applications Based on Pattern Recognition. In A. Bode, T. Ludwig, W. Karl, and R. Wismuller, editors, Euro-Par 2000 Parallel Processing, pages 349-356, 2000.
-
(2000)
Euro-Par 2000 Parallel Processing
, pp. 349-356
-
-
Manniesing, R.1
Karkowski, I.2
Corporaal, H.3
-
15
-
-
80955152874
-
The top 10 innovations in the new Fermi architecture, and the top 3 next challenges
-
D. Patterson. The Top 10 Innovations in the New Fermi Architecture, and the Top 3 Next Challenges. NVIDIA Whitepaper, 2009.
-
(2009)
NVIDIA Whitepaper
-
-
Patterson, D.1
-
18
-
-
72449173321
-
A skeletal parallel framework with fusion optimizer for GPGPU programming
-
Z. Hu, editor, Programming Languages and Systems. Springer Berlin Heidelberg
-
S. Sato and H. Iwasaki. A Skeletal Parallel Framework with Fusion Optimizer for GPGPU Programming. In Z. Hu, editor, Programming Languages and Systems, volume 5904 of Lecture Notes in Computer Science, pages 79-94. Springer Berlin Heidelberg, 2009.
-
(2009)
Lecture Notes in Computer Science
, vol.5904
, pp. 79-94
-
-
Sato, S.1
Iwasaki, H.2
-
19
-
-
78650298274
-
GPGPU kernel implementation and refinement using Obsidian
-
ICCS 2010
-
J. Svensson, K. Claessen, and M. Sheeran. GPGPU kernel implementation and refinement using Obsidian. Procedia Computer Science, 1(1):2059-2068, 2010. ICCS 2010.
-
(2010)
Procedia Computer Science
, vol.1
, Issue.1
, pp. 2059-2068
-
-
Svensson, J.1
Claessen, K.2
Sheeran, M.3
-
20
-
-
80155175375
-
-
TunaCode
-
TunaCode. CUVIlib. http://www.cuvilib.com.
-
-
-
-
21
-
-
58449127539
-
CUDA-Lite: Reducing GPU programming complexity
-
Languages and Compilers for Parallel Computing. Springer Berlin
-
S.-Z. Ueng, M. Lathara, S. Baghsorkhi, and W.-m. Hwu. CUDA-Lite: Reducing GPU Programming Complexity. In Languages and Compilers for Parallel Computing, volume 5335 of Lecture Notes in Computer Science, pages 1-15. Springer Berlin, 2008.
-
(2008)
Lecture Notes in Computer Science
, vol.5335
, pp. 1-15
-
-
Ueng, S.-Z.1
Lathara, M.2
Baghsorkhi, S.3
Hwu, W.-M.4
|