-
1
-
-
78651550268
-
Scalable Parallel Programming with CUDA
-
Nickolls, J., Buck, I., Garland, M., Skadron, K.: Scalable Parallel Programming with CUDA. Queue 6(2) (2008) 40-53
-
(2008)
Queue
, vol.6
, Issue.2
, pp. 40-53
-
-
Nickolls, J.1
Buck, I.2
Garland, M.3
Skadron, K.4
-
2
-
-
36148965527
-
JaMP: An Implementation of OpenMP for a Java DSM
-
Klemm, M., Bezold, M., Veldema, R., Philippsen, M.: JaMP: An Implementation of OpenMP for a Java DSM. Concurrency and Computation: Practice and Experience 18(19) (2007) 2333-2352
-
(2007)
Concurrency and Computation: Practice and Experience
, vol.18
, Issue.19
, pp. 2333-2352
-
-
Klemm, M.1
Bezold, M.2
Veldema, R.3
Philippsen, M.4
-
3
-
-
77949526061
-
-
Prentice Hall PTR, Upper Saddle River, NJ
-
Scarpino, M.: Programming the Cell Processor: For Games, Graphics, and Computation. Prentice Hall PTR, Upper Saddle River, NJ (2008)
-
(2008)
Programming the Cell Processor: For Games, Graphics, and Computation
-
-
Scarpino, M.1
-
4
-
-
10644248153
-
Brook for GPUs: Stream computing on graphics hardware
-
Los Angeles, CA
-
Buck, I., Foley, T., Horn, D., Sugerman, J., Fatahalian, K., Houston, M., Hanrahan, P.: Brook for GPUs: stream computing on graphics hardware. In: SIGGRAPH '04, Los Angeles, CA (2004) 777-786
-
(2004)
SIGGRAPH '04
, pp. 777-786
-
-
Buck, I.1
Foley, T.2
Horn, D.3
Sugerman, J.4
Fatahalian, K.5
Houston, M.6
Hanrahan, P.7
-
5
-
-
63149128672
-
Larrabee: A Many-Core x86 Architecture for Visual Computing
-
Seiler, L., Carmean, D., Sprangle, E., Forsyth, T., Dubey, P., Junkins, S., Lake, A., Cavin, R., Espasa, R., Grochowski, E., Juan, T., Abrash, M., Sugerman, J., Hanrahan, P.: Larrabee: A Many-Core x86 Architecture for Visual Computing. IEEE Micro 29(1) (2009) 10-21
-
(2009)
IEEE Micro
, vol.29
, Issue.1
, pp. 10-21
-
-
Seiler, L.1
Carmean, D.2
Sprangle, E.3
Forsyth, T.4
Dubey, P.5
Junkins, S.6
Lake, A.7
Cavin, R.8
Espasa, R.9
Grochowski, E.10
Juan, T.11
Abrash, M.12
Sugerman, J.13
Hanrahan, P.14
-
6
-
-
67650081010
-
OpenMP to GPGPU: A compiler framework for automatic translation and optimization
-
Lee, S., Min, S.J., Eigenmann, R.: OpenMP to GPGPU: a compiler framework for automatic translation and optimization. In: Symp. on Principles and Practice of Parallel Programming, Raleigh, NC (2008) 101-110
-
Symp. on Principles and Practice of Parallel Programming, Raleigh, NC (2008)
, pp. 101-110
-
-
Lee, S.1
Min, S.J.2
Eigenmann, R.3
-
7
-
-
23944431967
-
Automatic scoping of variables in parallel regions of an openmp program
-
Chapman, B.M., ed.: WOMPAT. Springer
-
Lin, Y., Terboven, C., an Mey, D., Copty, N.: Automatic scoping of variables in parallel regions of an openmp program. In Chapman, B.M., ed.: WOMPAT. Volume 3349 of Lecture Notes in Computer Science., Springer (2004) 83-97
-
(2004)
Lecture Notes in Computer Science.
, vol.3349
, pp. 83-97
-
-
Lin, Y.1
Terboven, C.2
An Mey, D.3
Copty, N.4
-
8
-
-
78049344975
-
Java for Numerically Intensive Computing: From Flops to Gigaflops
-
Midkiff, S., Moreira, J., Snir, M.: Java For Numerically Intensive Computing: From Flops To Gigaflops. In: Symp. on the Frontiers of Massively Parallel Computation, Annapolis, MA (1999) 251-261
-
Symp. on the Frontiers of Massively Parallel Computation, Annapolis, MA (1999)
, pp. 251-261
-
-
Midkiff, S.1
Moreira, J.2
Snir, M.3
-
9
-
-
85015692260
-
The pricing of options and corporate liabilities
-
Black, F., Scholes, M.: The pricing of options and corporate liabilities. Journal of Political Economy 81(3) (1973) 637-54
-
(1973)
Journal of Political Economy
, vol.81
, Issue.3
, pp. 637-654
-
-
Black, F.1
Scholes, M.2
-
10
-
-
0000126237
-
Lattice-Gas Cellular Automata and Lattice Boltzmann Models
-
Springer
-
Wolf-Gladrow, D.: Lattice-Gas Cellular Automata and Lattice Boltzmann Models. Number 1725 in Lecture Notes in Mathematics. Springer (2000)
-
(2000)
Lecture Notes in Mathematics
, vol.1725
-
-
Wolf-Gladrow, D.1
-
11
-
-
0031599142
-
Mersenne Twister: A 623-dimensionally Equidistributed Uniform Pseudo-random Number Generator
-
Matsumoto, M., Nishimura, T.: Mersenne Twister: a 623-dimensionally Equidistributed Uniform Pseudo-random Number Generator. ACM Trans. Model. Comput. Simul. 8(1) (1998) 3-30
-
(1998)
ACM Trans. Model. Comput. Simul.
, vol.8
, Issue.1
, pp. 3-30
-
-
Matsumoto, M.1
Nishimura, T.2
-
12
-
-
77954601187
-
-
JCuda. http://www.jcuda.org/
-
-
-
-
13
-
-
50949166640
-
Evaluation and tuning of the Level 3 CUBLAS for graphics processors
-
Barrachina, S., Castillo, M., Igual, F., Mayo, R., Quintana-Orti, E.: Evaluation and tuning of the Level 3 CUBLAS for graphics processors. In: Intl. Parallel and Distributed Processing Symp., Miami, FL (2008) 1-8
-
Intl. Parallel and Distributed Processing Symp., Miami, FL (2008)
, pp. 1-8
-
-
Barrachina, S.1
Castillo, M.2
Igual, F.3
Mayo, R.4
Quintana-Orti, E.5
-
14
-
-
58449109179
-
-
Edmonton, Canada
-
Stratton., J., Stone., S., Hwu, W.M.W.: MCUDA: An Efficient Implementation of CUDA Kernels for Multi-core CPUs, Edmonton, Canada (2008) 16-30
-
(2008)
MCUDA: An Efficient Implementation of CUDA Kernels for Multi-core CPUs
, pp. 16-30
-
-
Stratton, J.1
Stone, S.2
Hwu, W.M.W.3
-
17
-
-
58449127539
-
CUDA-Lite: Reducing GPU Programming Complexity
-
Edmonton, Canada
-
Ueng, S.Z., Lathara, M., Baghsorkhi, S., Hwu, W.M.W.: CUDA-Lite: Reducing GPU Programming Complexity. In: Languages and Compilers for Parallel Computing, Edmonton, Canada (2008) 1-15
-
(2008)
Languages and Compilers for Parallel Computing
, pp. 1-15
-
-
Ueng, S.Z.1
Lathara, M.2
Baghsorkhi, S.3
Hwu, W.M.W.4
-
18
-
-
77954594951
-
-
Khronos. http://www.khronos.org/opencl/
-
-
-
-
20
-
-
79959486752
-
Programming with Tiles
-
Guo, J., Bikshandi, G., Fraguela, B.B., Garzaran, M.J., Padua, D.: Programming with Tiles. In: PPoPP '08: Proc. of the 13th ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming, Salt Lake City, UT (2008) 111-122
-
PPoPP '08: Proc. of the 13th ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming, Salt Lake City, UT (2008)
, pp. 111-122
-
-
Guo, J.1
Bikshandi, G.2
Fraguela, B.B.3
Garzaran, M.J.4
Padua, D.5
|