-
1
-
-
70649099919
-
-
NVIDIA, 2nd ed. NVIDIA Corporation, Santa Clara, California, October
-
NVIDIA, NVIDIA CUDA SDK 2.1, 2nd ed., NVIDIA Corporation, Santa Clara, California, October 2008.
-
(2008)
NVIDIA CUDA SDK 2.1
-
-
-
2
-
-
70649109789
-
-
K. O. W. Group, December, [Online]. Available
-
K. O. W. Group, The OpenCL Specication, December 2008. [Online]. Available: http://www.khronos.org/registry/cl/specs/opencl-1.0.29.pdf.
-
(2008)
The OpenCL Specication
-
-
-
3
-
-
70449873244
-
Ct: A flexible parallel programming model for tera-scale architectures
-
October
-
A. Ghuloum, E. Sprangle, J. Fang, G. Wu, and X. Zhou, "Ct: A flexible parallel programming model for tera-scale architectures, " Intel Technology Journal, vol. 11, no. 4, October 2007.
-
(2007)
Intel Technology Journal
, vol.11
, Issue.4
-
-
Ghuloum, A.1
Sprangle, E.2
Fang, J.3
Wu, G.4
Zhou, X.5
-
4
-
-
70649102016
-
-
NVIDIA, NVIDIA Corporation, Santa Clara, California, October
-
NVIDIA, NVIDIA Compute PTX: Parallel Thread Execution, 1st ed., NVIDIA Corporation, Santa Clara, California, October 2008.
-
(2008)
NVIDIA Compute PTX: Parallel Thread Execution, 1st Ed
-
-
-
5
-
-
67650668134
-
VSIPL 1.3 api
-
D. Schwartz, R. Judd, W. Harrod, and D. Manley, "VSIPL 1.3 api, " VSIPL Forum, Tech. Rep., 2008.
-
(2008)
VSIPL Forum, Tech. Rep
-
-
Schwartz, D.1
Judd, R.2
Harrod, W.3
Manley, D.4
-
6
-
-
54749089017
-
Relational joins on graphics processors
-
New York, NY, USA: ACM
-
B. He, K. Yang, R. Fang, M. Lu, N. Govindaraju, Q. Luo, and P. Sander, "Relational joins on graphics processors, " in SIGMOD '08: Proceedings of the 2008 ACM SIGMOD international conference on Management of data. New York, NY, USA: ACM, 2008, pp. 511-524.
-
(2008)
SIGMOD '08: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data
, pp. 511-524
-
-
He, B.1
Yang, K.2
Fang, R.3
Lu, M.4
Govindaraju, N.5
Luo, Q.6
Sander, P.7
-
7
-
-
85088003777
-
Gpu computing with nvidia CUDA
-
New York, NY, USA: ACM
-
I. Buck, "Gpu computing with nvidia CUDA, " in SIGGRAPH '07: ACM SIGGRAPH 2007 courses. New York, NY, USA: ACM, 2007, p. 6.
-
(2007)
SIGGRAPH '07: ACM SIGGRAPH 2007 Courses
, pp. 6
-
-
Buck, I.1
-
9
-
-
70649096881
-
-
Tech. Rep. hal-00359342
-
S. Collange, D. Defour, and D. Parello, "Barra, a modular functional gpu simulator for gpgpu, " Tech. Rep. hal-00359342, 2009.
-
(2009)
Barra, A Modular Functional Gpu Simulator for Gpgpu
-
-
Collange, S.1
Defour, D.2
Parello, D.3
-
10
-
-
70349169075
-
Analyzing CUDA workloads using a detailed GPU simulator
-
Boston, MA, USA, April
-
A. Bakhoda, G. Yuan, W. W. L. Fung, H. Wong, and T. M. Aamodt, "Analyzing CUDA workloads using a detailed GPU simulator, " in IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Boston, MA, USA, April 2009.
-
(2009)
IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)
-
-
Bakhoda, A.1
Yuan, G.2
Fung, W.W.L.3
Wong, H.4
Aamodt, T.M.5
-
12
-
-
53749087821
-
Mcuda: An efficient implementation of CUDA kernels on multi-cores
-
March, [Online]. Available
-
J. Stratton, S. Stone, and W. mei Hwu, "Mcuda: An efficient implementation of CUDA kernels on multi-cores, " University of Illinois at Urbana-Champaign, Tech. Rep. IMPACT-08-01, March 2008. [Online]. Available: http://www.gigascale.org/pubs/1278.html.
-
(2008)
University of Illinois at Urbana-Champaign, Tech. Rep. IMPACT-08-01
-
-
Stratton, J.1
Stone, S.2
Hwu, W.M.3
-
13
-
-
70649094184
-
-
Tech. Rep. 0901, January, [Online]. Available
-
G. Diamos, A. Kerr, and M. Kesavan, "Translating gpu binaries to tiered simd architectures with ocelot, " Tech. Rep. 0901, January 2009. [Online]. Availablehttp://www.cercs.gatech.edu/tech-reports/tr2009/abstracts/01. html.
-
(2009)
Translating Gpu Binaries to Tiered Simd Architectures with Ocelot
-
-
Diamos, G.1
Kerr, A.2
Kesavan, M.3
-
15
-
-
30744459395
-
Rpu: A programmable ray processing unit for realtime ray tracing
-
July, [Online]. Available
-
J. S. Sven Woop and P. Slusallek, "Rpu: A programmable ray processing unit for realtime ray tracing, " in Proceedings of ACM SIGGRAPH 2005, July 2005. [Online]. Available: http://www.saarcor.de/.
-
(2005)
Proceedings of ACM SIGGRAPH 2005
-
-
Woop, J.S.S.1
Slusallek, P.2
-
16
-
-
67650692011
-
-
[Online]. Available
-
IMPACT, "The parboil benchmark suite, " 2007. [Online]. Available: http://www.crhc.uiuc.edu/IMPACT/parboil.php.
-
(2007)
The Parboil Benchmark Suite
-
-
-
18
-
-
67650699933
-
GPU vsipl: High-performance vsipl implementation for GPUs
-
Lexington, MA, USA
-
A. Kerr, D. Campbell, and M. Richards, "GPU vsipl: High-performance vsipl implementation for GPUs, " in HPEC'08: High Performance Em- bedded Computing Workshop, Lexington, MA, USA, 2008.
-
(2008)
HPEC'08: High Performance Em- Bedded Computing Workshop
-
-
Kerr, A.1
Campbell, D.2
Richards, M.3
-
19
-
-
47349104432
-
Dynamic warp formation and scheduling for efficient GPU control flow
-
Washington, DC, USA: IEEE Computer Society
-
W. W. L. Fung, I. Sham, G. Yuan, and T. M. Aamodt, "Dynamic warp formation and scheduling for efficient GPU control flow, " in MICRO '07: Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture. Washington, DC, USA: IEEE Computer Society, 2007, pp. 407-420.
-
(2007)
MICRO '07: Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 407-420
-
-
Fung, W.W.L.1
Sham, I.2
Yuan, G.3
Aamodt, T.M.4
-
20
-
-
64549155924
-
-
NVIDIA, 2nd ed. NVIDIA Corporation, Santa Clara, California, October
-
NVIDIA, NVIDIA CUDA Compute Unied Device Architecture, 2nd ed., NVIDIA Corporation, Santa Clara, California, October 2008.
-
(2008)
NVIDIA CUDA Compute Unied Device Architecture
-
-
-
21
-
-
56449089553
-
Characterizing and improving the performance of the intel threading building blocks runtime system
-
September, [Online]. Available
-
G. Contreras and M. Martonosi, "Characterizing and improving the performance of the intel threading building blocks runtime system, " in International Symposium on Workload Characterization (IISWC 2008), September 2008. [Online]. Available: http://www.gigascale.org/pubs/1350.html.
-
(2008)
International Symposium on Workload Characterization (IISWC 2008)
-
-
Contreras, G.1
Martonosi, M.2
|