-
1
-
-
84866429900
-
-
Parboil Benchmark Suite
-
Parboil Benchmark Suite. http://impact.crhc.illinois.edu/parboil.php, 2011.
-
(2011)
-
-
-
2
-
-
78650145768
-
Lime: A Javacompatible and synthesizable language for heterogeneous architectures
-
J. Auerbach, D. F. Bacon, P. Cheng, and R. Rabbah. Lime: A Javacompatible and synthesizable language for heterogeneous architectures. In OOPSLA, 2010.
-
(2010)
OOPSLA
-
-
Auerbach, J.1
Bacon, D.F.2
Cheng, P.3
Rabbah, R.4
-
3
-
-
84877609547
-
Brook for GPUs: Stream computing on graphics hardware
-
I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian, M. Houston, and P. Hanrahan. Brook for GPUs: Stream computing on graphics hardware. In SIGGRAPH, 2004.
-
(2004)
SIGGRAPH
-
-
Buck, I.1
Foley, T.2
Horn, D.3
Sugerman, J.4
Fatahalian, K.5
Houston, M.6
Hanrahan, P.7
-
4
-
-
84862632175
-
GPU programming in a high level language: Compiling X10 to CUDA
-
D. Cunningham, R. Bordewekar, and V. Saraswat. GPU programming in a high level language: Compiling X10 to CUDA. In X10 Worksop, 2011.
-
(2011)
X10 Worksop
-
-
Cunningham, D.1
Bordewekar, R.2
Saraswat, V.3
-
5
-
-
34547423880
-
Exploiting coarsegrained task, data, and pipeline parallelism in stream programs
-
M. I. Gordon, W. Thies, and S. Amarasinghe. Exploiting coarsegrained task, data, and pipeline parallelism in stream programs. In ASPLOS, 2006.
-
(2006)
ASPLOS
-
-
Gordon, M.I.1
Thies, W.2
Amarasinghe, S.3
-
7
-
-
79953071805
-
Sponge: Portable stream programming on graphics engines
-
A. H. Hormati, M. Samadi, M. Woh, T. Mudge, and S. Mahlke. Sponge: Portable stream programming on graphics engines. In ASPLOS, 2011.
-
(2011)
ASPLOS
-
-
Hormati, A.H.1
Samadi, M.2
Woh, M.3
Mudge, T.4
Mahlke, S.5
-
8
-
-
79959904195
-
Automatic CPU-GPU communication management and optimization
-
T. B. Jablin, P. Prabhu, J. A. Jablin, N. P. Johnson, S. R. Beard, and D. I. August. Automatic CPU-GPU communication management and optimization. In PLDI, 2011.
-
(2011)
PLDI
-
-
Jablin, T.B.1
Prabhu, P.2
Jablin, J.A.3
Johnson, N.P.4
Beard, S.R.5
August, D.I.6
-
9
-
-
70349100958
-
-
Khronos OpenCL Working Group
-
Khronos OpenCL Working Group. The OpenCL Specification.
-
The OpenCL Specification
-
-
-
10
-
-
67650081010
-
OpenMP to GPGPU: A compiler framework for automatic translation and optimization
-
S. Lee, S.-J.Min, and R. Eigenmann. OpenMP to GPGPU: A compiler framework for automatic translation and optimization. In PPoPP, 2009.
-
(2009)
PPoPP
-
-
Lee, S.1
Min, S.-J.2
Eigenmann, R.3
-
11
-
-
76749140917
-
Qilin: Exploiting parallelism on heterogeneous multiprocessors with adaptive mapping
-
C.-K. Luk, S. Hong, and H. Kim. Qilin: Exploiting parallelism on heterogeneous multiprocessors with adaptive mapping. In MICRO, 2009.
-
(2009)
MICRO
-
-
Luk, C.-K.1
Hong, S.2
Kim, H.3
-
13
-
-
0002412041
-
Analysis and development of Java Grande benchmarks
-
New York, NY, USA, ACM
-
J. A. Mathew, P. D. Coddington, and K. A. Hawick. Analysis and development of Java Grande benchmarks. In Proceedings of the ACM 1999 conference on Java Grande, JAVA '99, pp. 72-80, New York, NY, USA, 1999. ACM.
-
(1999)
Proceedings of the ACM 1999 Conference on Java Grande JAVA '99
, pp. 72-80
-
-
Mathew, J.A.1
Coddington, P.D.2
Hawick, K.A.3
-
14
-
-
79957475280
-
Intel's Array Building Blocks: A retargetable, dynamic compiler and embedded language
-
C. Newburn, B. So, Z. Liu, M. McCool, A. Ghuloum, S. Toit, Z. G. Wang, Z. H. Du, Y. Chen, G. Wu, P. Guo, Z. Liu, and D. Zhang. Intel's Array Building Blocks: A retargetable, dynamic compiler and embedded language. In CGO, 2011.
-
(2011)
CGO
-
-
Newburn, C.1
So, B.2
Liu, Z.3
McCool, M.4
Ghuloum, A.5
Toit, S.6
Wang, Z.G.7
Du, Z.H.8
Chen, Y.9
Wu, G.10
Guo, P.11
Liu, Z.12
Zhang, D.13
-
16
-
-
63349107315
-
SoC-C: Efficient programming abstractions for heterogeneous multicore systems on chip
-
A. D. Reid, K. Flautner, E. Grimley-Evans, and Y. Lin. SoC-C: Efficient programming abstractions for heterogeneous multicore systems on chip. In CASES, 2008.
-
(2008)
CASES
-
-
Reid, A.D.1
Flautner, K.2
Grimley-Evans, E.3
Lin, Y.4
-
17
-
-
79959466764
-
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
-
S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W.-m. W. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. In PPoPP, 2008.
-
(2008)
PPoPP
-
-
Ryoo, S.1
Rodrigues, C.I.2
Baghsorkhi, S.S.3
Stone, S.S.4
Kirk, D.B.5
W. Hwu, W.-M.6
-
18
-
-
33947595619
-
Accelerator: Using data parallelism to program GPUs for general-purpose uses
-
D. Tarditi, S. Puri, and J. Oglesby. Accelerator: Using data parallelism to program GPUs for general-purpose uses. In ASPLOS, 2006.
-
(2006)
ASPLOS
-
-
Tarditi, D.1
Puri, S.2
Oglesby, J.3
-
22
-
-
35448978324
-
EXOCHI: Architecture and programming environment for a heterogeneous multi-core multithreaded system
-
P. H. Wang, J. D. Collins, G. N. Chinya, H. Jiang, X. Tian, M. Girkar, N. Y. Yang, G.-Y. Lueh, and H.Wang. EXOCHI: Architecture and programming environment for a heterogeneous multi-core multithreaded system. In PLDI, 2007.
-
(2007)
PLDI
-
-
Wang, P.H.1
Collins, J.D.2
Chinya, G.N.3
Jiang, H.4
Tian, X.5
Girkar, M.6
Yang, N.Y.7
Lueh, G.-Y.8
Wang, H.9
-
23
-
-
77954691442
-
A GPGPU compiler for memory optimization and parallelism management
-
Y. Yang, P. Xiang, J. Kong, and H. Zhou. A GPGPU compiler for memory optimization and parallelism management. In PLDI, 2010.
-
(2010)
PLDI
-
-
Yang, Y.1
Xiang, P.2
Kong, J.3
Zhou, H.4
|