-
1
-
-
77954006904
-
-
ATI Stream
-
AMD. ATI Stream. http://www.amd.com.
-
-
-
-
2
-
-
70350641505
-
StarPU: A unified platform for task scheduling on heterogeneous multicore architectures
-
Delft, Netherlands
-
C. Augonnet, S. Thibault, R. Namyst, and P.-A. Wacrenier. StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures. In Euro-Par 2009, pages 863-874, Delft, Netherlands, 2009.
-
Euro-Par 2009
, vol.2009
, pp. 863-874
-
-
Augonnet, C.1
Thibault, S.2
Namyst, R.3
Wacrenier, P.-A.4
-
4
-
-
70450059008
-
Accelerating leukocyte tracking using CUDA: A case study in leveraging manycore coprocessors
-
M. Boyer, D. T., S. A., and K. S. Accelerating leukocyte tracking using CUDA: A case study in leveraging manycore coprocessors. In IPDPS 2009, pages 1-12, 2009.
-
(2009)
IPDPS
, vol.2009
, pp. 1-12
-
-
Boyeri, M.1
T, D.2
A, S.3
S, K.4
-
5
-
-
0001782767
-
Parallelization of charmm for MIMD machines
-
B. Brooks and H. M. Parallelization of Charmm for MIMD Machines. Chemical Design Automation News, 7(16):16-22, 1992.
-
(1992)
Chemical Design Automation News
, vol.7
, Issue.16
, pp. 16-22
-
-
Brooks, B.1
M, H.2
-
6
-
-
77954018826
-
On dynamic load balancing on graphics processors
-
D. Cederman and P. T. On Dynamic Load Balancing on Graphics Processors. In GH 2008, pages 57-64, 2008.
-
(2008)
GH 2008
, pp. 57-64
-
-
Cederman, D.1
T, P.2
-
7
-
-
0029179685
-
Modeling the benefits of mixed data and task parallelism
-
New York, NY, USA, ACM
-
S. Chakrabarti, J. Demmel, and K. Yelick. Modeling the benefits of mixed data and task parallelism. In SPAA'95, pages 74-83, New York, NY, USA, 1995. ACM.
-
(1995)
SPAA'95
, pp. 74-83
-
-
Chakrabarti, S.1
Demmel, J.2
Yelick, K.3
-
8
-
-
77953970436
-
Parallel molecular dynamics
-
March
-
T. Clark, M. J.A., and S. L.R. Parallel Molecular Dynamics. In SIAMPP'91, pages 338-344, March 1991.
-
(1991)
SIAMPP'91
, pp. 338-344
-
-
Clark, T.1
J, A.M.2
L, R.S.3
-
10
-
-
33750913667
-
Kd-tree acceleration structures for a gpu raytracer
-
New York, NY, USA
-
T. Foley and J. Sugerman. Kd-tree acceleration structures for a gpu raytracer. In HWWS'05, pages 15-22, New York, NY, USA, 2005.
-
(2005)
HWWS'05
, pp. 15-22
-
-
Foley, T.1
Sugerman, J.2
-
11
-
-
84870726202
-
-
D. Frenkel and B. Smit, editors, Academic Press, Inc., Orlando, FL, USA
-
D. Frenkel and B. Smit, editors. Understanding Molecular Simulation: From Algorithms to Applications. Academic Press, Inc., Orlando, FL, USA, 1996.
-
(1996)
Understanding Molecular Simulation: From Algorithms to Applications
-
-
-
12
-
-
77955990292
-
Enabling task parallelism in the cuda scheduler
-
M. Guevara, C. Gregg, and S. K. Enabling task parallelism in the cuda scheduler. In PEMA 2009, 2009.
-
PEMA 2009
, vol.2009
-
-
Guevara, M.1
Gregg, C.2
K, S.3
-
13
-
-
38349041620
-
Accelerating large graph algorithms on the gpu using cuda
-
P. Harish and N. P.J. Accelerating large graph algorithms on the gpu using cuda. In HiPC, pages 197-208, 2007.
-
(2007)
HiPC
, pp. 197-208
-
-
Harish, P.1
P, J.N.2
-
14
-
-
0025917643
-
Wait-free synchronization
-
M. Herlihy. Wait-free synchronization. ACM TPLS., 13(1):124-149, 1991.
-
(1991)
ACM TPLS
, vol.13
, Issue.1
, pp. 124-149
-
-
Herlihy, M.1
-
15
-
-
77954019183
-
-
OpenCL
-
Khronos. OpenCL. http://www.khronos.org.
-
-
-
-
16
-
-
67650046428
-
Merge: A programming model for heterogeneous multi-core systems
-
M. D. Linderman, J. D. Collins, H. Wang, and T. H. M. Merge: a programming model for heterogeneous multi-core systems. SIG- PLANNot., 43(3):287-296, 2008.
-
(2008)
SIG-PLANNot
, vol.43
, Issue.3
, pp. 287-296
-
-
Linderman, M.D.1
Collins, J.D.2
Wang, H.3
H, M.T.4
-
18
-
-
34249052630
-
Adaptive load balancing for raycasting of non-uniformly bricked volumes
-
Parallel Graphics and Visualization
-
M. Mller, C. and Strengert and T. Ertl. Adaptive load balancing for raycasting of non-uniformly bricked volumes. Parallel Computing, 33(6):406-419, 2007. Parallel Graphics and Visualization.
-
(2007)
Parallel Computing
, vol.33
, Issue.6
, pp. 406-419
-
-
Mller, M.C.1
Strengert2
Ertl, T.3
-
19
-
-
78651550268
-
Scalable parallel programming with CUDA
-
J. Nickolls, I. Buck, M. G., and K. S. Scalable Parallel Programming with CUDA. Queue, 6(2):40-53, 2008.
-
(2008)
Queue
, vol.6
, Issue.2
, pp. 40-53
-
-
Nickolls, J.1
Buck, I.2
G, M.3
S, K.4
-
20
-
-
77953976782
-
-
CUDA
-
Nvidia. CUDA. http://www.nvidia.com.
-
-
-
-
22
-
-
60649087529
-
A task parallel algorithm for computing the costs of all-pairs shortest paths on the cuda-compatible gpu
-
T. Okuyama, F. I., and K. H. A task parallel algorithm for computing the costs of all-pairs shortest paths on the cuda-compatible gpu. In ISPA'08, pages 284-291, 2008.
-
(2008)
ISPA'08
, pp. 284-291
-
-
Okuyama, T.1
I, F.2
H, K.3
-
23
-
-
79959466764
-
Optimization principles and application performance evaluation of a multithreaded gpu using cuda
-
S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. M. Hwu. Optimization principles and application performance evaluation of a multithreaded gpu using cuda. In PPoPP'08, pages 73-82, 2008.
-
(2008)
PPoPP'08
, pp. 73-82
-
-
Ryoo, S.1
Rodrigues, C.I.2
Baghsorkhi, S.S.3
Stone, S.S.4
Kirk, D.B.5
Hwu, W.M.6
-
24
-
-
0026120011
-
Molecular dynamics on hypercube parallel computers
-
W. Smith. Molecular dynamics on hypercube parallel computers. Computer Physics Communications, 62:229-248, 1991.
-
(1991)
Computer Physics Communications
, vol.62
, pp. 229-248
-
-
Smith, W.1
-
25
-
-
70350771131
-
Benchmarking GPUs to tune dense linear algebra
-
V. Volkov and J. W. Demmel. Benchmarking GPUs to tune dense linear algebra. In SC 2008, pages 1-11, 2008.
-
(2008)
SC
, vol.2008
, pp. 1-11
-
-
Volkov, V.1
Demmel, J.W.2
|