-
2
-
-
70450101161
-
-
GPGPU community website
-
GPGPU community website. http ://www. gpgpu . org.
-
-
-
-
3
-
-
70450117428
-
-
Torch5 library. http://torch5.sourceforge.net.
-
Torch5 library
-
-
-
4
-
-
70449932975
-
-
Advanced Micro Devices, Inc
-
Advanced Micro Devices, Inc. AMD Stream Computing SDK. http://ati.amd.com/technology/ streamcomputing/index.html.
-
AMD Stream Computing SDK
-
-
-
5
-
-
10644248153
-
Brook for GPUs: Stream computing on graphics hardware
-
I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian, M. Houston, and P. Hanrahan. Brook for GPUs: Stream computing on graphics hardware. In SIGGRAPH '04: ACM SIGGRAPH 2004 Papers, pages 777-786, 2004.
-
(2004)
SIGGRAPH '04: ACM SIGGRAPH 2004 Papers
, pp. 777-786
-
-
Buck, I.1
Foley, T.2
Horn, D.3
Sugerman, J.4
Fatahalian, K.5
Houston, M.6
Hanrahan, P.7
-
7
-
-
77957966090
-
Grading nuclear pleomorphism on histological micrographs
-
E. Cosatto, M. Miller, H. P. Graf, and J. S. Meyer. Grading nuclear pleomorphism on histological micrographs. In Proc. Int. Conf. Pattern Recognition, pages 1-4, 2008.
-
(2008)
Proc. Int. Conf. Pattern Recognition
, pp. 1-4
-
-
Cosatto, E.1
Miller, M.2
Graf, H.P.3
Meyer, J.S.4
-
8
-
-
70450036077
-
-
P. Dubey. A Platform 2015 Workload Model: Recognition, Mining and Synthesis Moves Computers to the Era of Tera, 2007. ftp ://download. intel. com/technology/computing/archinnov/ platform2015/download/RMS.pdf.
-
P. Dubey. A Platform 2015 Workload Model: Recognition, Mining and Synthesis Moves Computers to the Era of Tera, 2007. ftp ://download. intel. com/technology/computing/archinnov/ platform2015/download/RMS.pdf.
-
-
-
-
10
-
-
57349092386
-
CUBA: An architecture for efficient CPU/coprocessor data communication
-
Jun
-
I. Gelado, J. Kelm, S. Ryoo, S. Lumetta, N. Navarro, and W. mei Hwu. CUBA: An architecture for efficient CPU/coprocessor data communication. In ICS '08: Proceedings of the 22nd annual international conference on Supercomput-ing, Jun 2008.
-
(2008)
ICS '08: Proceedings of the 22nd annual international conference on Supercomput-ing
-
-
Gelado, I.1
Kelm, J.2
Ryoo, S.3
Lumetta, S.4
Navarro, N.5
mei Hwu, W.6
-
11
-
-
63549097654
-
Mars : A mapreduce framework for graphics processors
-
October
-
B. He, W. Fang, Q. Luo, N. K. Govindarajulu, and T Wang. Mars : A mapreduce framework for graphics processors. In Proc. Int. Conf. on Parallel Architectures and Compilation Techniques (PACT), October 2008.
-
(2008)
Proc. Int. Conf. on Parallel Architectures and Compilation Techniques (PACT)
-
-
He, B.1
Fang, W.2
Luo, Q.3
Govindarajulu, N.K.4
Wang, T.5
-
12
-
-
34748865391
-
-
T .J. Knight, J. Young, Park, M. Ren, M. Houston, M. Erez, K. Fatahalian, A. Aiken, W. J. Dally, and P. Hanrahan. Compilation for explicitly managed memory hierarchies. In PPoPP '07: Proceedings of the 12th ACM SIGPLAN Symposium on Principles and practice of parallel programming, March 2007.
-
T .J. Knight, J. Young, Park, M. Ren, M. Houston, M. Erez, K. Fatahalian, A. Aiken, W. J. Dally, and P. Hanrahan. Compilation for explicitly managed memory hierarchies. In PPoPP '07: Proceedings of the 12th ACM SIGPLAN Symposium on Principles and practice of parallel programming, March 2007.
-
-
-
-
15
-
-
70450109401
-
-
NVIDIA Corporation. NVIDIA CUDA, 2007. http:// nvidia.com/cuda.
-
NVIDIA Corporation. NVIDIA CUDA, 2007. http:// nvidia.com/cuda.
-
-
-
-
16
-
-
70450115089
-
Toward automatic parallelization and auto-tuning of affine kernels for GPUs
-
July
-
J. Ramanujam. Toward automatic parallelization and auto-tuning of affine kernels for GPUs. In Workshop on Automatic Tuning for Petascale Systems, July 2008.
-
(2008)
Workshop on Automatic Tuning for Petascale Systems
-
-
Ramanujam, J.1
-
18
-
-
79959466764
-
-
S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. mei W. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. In PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, pages 73-82, New York, NY, USA, 2008. ACM.
-
S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. mei W. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. In PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, pages 73-82, New York, NY, USA, 2008. ACM.
-
-
-
-
19
-
-
43449094719
-
W Hwu. Program optimization space pruning for a multithreaded GPU
-
New York, NY, USA, ACM
-
S. Ryoo, C. I. Rodrigues, S. S. Stone, S. S. Baghsorkhi, S.-Z. Ueng, J. A. Stratton, and W mei W Hwu. Program optimization space pruning for a multithreaded GPU. In CGO '08: Proceedings of the sixth annual IEEE/ACM international symposium on Code generation and optimization, pages 195-204, New York, NY, USA, 2008. ACM.
-
(2008)
CGO '08: Proceedings of the sixth annual IEEE/ACM international symposium on Code generation and optimization
, pp. 195-204
-
-
Ryoo, S.1
Rodrigues, C.I.2
Stone, S.S.3
Baghsorkhi, S.S.4
Ueng, S.-Z.5
Stratton, J.A.6
mei, W.7
-
21
-
-
35448978324
-
EXOCHI: Architecture and programming environment for a heterogeneous multi-core multithreaded system
-
Jun
-
P. Wang, J. Collins, G. Chinya, H. Jiang, X. Tian, M. Girkar, N. Yang, G.-Y Lueh, and H. Wang. EXOCHI: Architecture and programming environment for a heterogeneous multi-core multithreaded system. In PLDI '07: Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation, Jun 2007.
-
(2007)
PLDI '07: Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
-
-
Wang, P.1
Collins, J.2
Chinya, G.3
Jiang, H.4
Tian, X.5
Girkar, M.6
Yang, N.7
Lueh, G.-Y.8
Wang, H.9
|