-
1
-
-
0025256474
-
A Simple Randomized Parallel Algorithm for List-Ranking
-
ANDERSON, R. J., AND MILLER, G. L. A Simple Randomized Parallel Algorithm for List-Ranking. Information Processing Letters 33, 5 (1990), 269-273.
-
(1990)
Information Processing Letters
, vol.33
, Issue.5
, pp. 269-273
-
-
Anderson, R.J.1
Miller, G.L.2
-
2
-
-
34548718683
-
On the Design and Analysis of Irregular Algorithms on the Cell Processor: A Case Study of List Ranking
-
BADER, D. A., AGARWAL, V., AND MADDURI, K. On the Design and Analysis of Irregular Algorithms on the Cell Processor: A Case Study of List Ranking. In Proc. of IEEE IPDPS (2007), pp. 1-10.
-
Proc. of IEEE IPDPS (2007)
, pp. 1-10
-
-
Bader, D.A.1
Agarwal, V.2
Madduri, K.3
-
3
-
-
0024684158
-
Faster Optimal Parallel Prefix sums and List Ranking
-
COLE, R., AND VISHKIN, U. Faster Optimal Parallel Prefix sums and List Ranking. Information and Computation 81, 3 (1989), 334-352.
-
(1989)
Information and Computation
, vol.81
, Issue.3
, pp. 334-352
-
-
Cole, R.1
Vishkin, U.2
-
4
-
-
0009346826
-
LogP: Towards a Realistic Model of Parallel Computation
-
CULLER, D., KARP, R., PATTERSON, D., A. SAHAY, K. E. S., SANTOS, E., SUBRAMONIAN, R., AND VON EICKEN, T. LogP: Towards a Realistic Model of Parallel Computation. In Proc. ACM PPoPP (1993), pp. 1-12.
-
Proc. ACM PPoPP (1993)
, pp. 1-12
-
-
Culler, D.1
Karp, R.2
Patterson, D.3
A Sahay, K.E.S.4
Santos, E.5
Subramonian, R.6
Von Eicken, T.7
-
7
-
-
0032107941
-
The Queue-Read Queue-Write PRAM Model: Accounting for Contention in Parallel Algorithms
-
GIBBONS, P. B., MATIAS, Y., AND RAMACHANDRAN, V. The Queue-Read Queue-Write PRAM Model: Accounting for Contention in Parallel Algorithms. SIAM J. Comp. 28, 2 (1999), 733-769.
-
(1999)
SIAM J. Comp.
, vol.28
, Issue.2
, pp. 733-769
-
-
Gibbons, P.B.1
Matias, Y.2
Ramachandran, V.3
-
8
-
-
35948931417
-
Cache-efficient numerical algorithms using graphics hardware
-
DOI 10.1016/j.parco.2007.09.006, PII S0167819107001056, High-Performance Computing Using Accelerators
-
GOVINDARAJU, N., AND MANOCHA, D. Cache-efficient Numerical Algorithms using Graphics Hardware. Parallel Computing 33, 10-11 (2007), 663-684. (Pubitemid 350064315)
-
(2007)
Parallel Computing
, vol.33
, Issue.10-11
, pp. 663-684
-
-
Govindaraju, N.K.1
Manocha, D.2
-
9
-
-
58349086140
-
Memory Locality Exploitation Strategies for FFT on the CUDA Architecture
-
GUTIERREZ, E., ROMERO, S., TRENAS, M. A., AND ZAPATA, E. L. Memory Locality Exploitation Strategies for FFT on the CUDA Architecture. In Proc. of High Perf. Comp. for Comp. Sci. (2008), pp. 430-443.
-
Proc. of High Perf. Comp. for Comp. Sci. (2008)
, pp. 430-443
-
-
Gutierrez, E.1
Romero, S.2
Trenas, M.A.3
Zapata, E.L.4
-
11
-
-
84979025439
-
Designing Practical Efficient Algorithms for Symmetric Multiprocessors
-
HELMAN, D. R., AND JÀ JÀ, J. Designing Practical Efficient Algorithms for Symmetric Multiprocessors. In Proc. ALENEX (1999), pp. 37-56.
-
Proc. ALENEX (1999)
, pp. 37-56
-
-
Helman, D.R.1
Jàjà, J.2
-
12
-
-
70450231944
-
An Analytical Model for a GPU Architecture with Memory-Level and Thread-Level Parallelism Awareness
-
ACM
-
HONG, S., AND KIM, H. An Analytical Model for a GPU Architecture with Memory-Level and Thread-Level Parallelism Awareness. In ISCA '09: Proceedings of the 36th Annual International Symposium on Computer Architecture (New York, NY, USA, 2009), ACM, pp. 152-163.
-
ISCA '09: Proceedings of the 36th Annual International Symposium on Computer Architecture (New York, NY, USA, 2009)
, pp. 152-163
-
-
Hong, S.1
Kim, H.2
-
16
-
-
70449723385
-
Performance Modeling and Automatic Ghost Zone Optimization for Iterative Stencil Loops on GPUs
-
ACM
-
MENG, J., AND SKADRON, K. Performance Modeling and Automatic Ghost Zone Optimization for Iterative Stencil Loops on GPUs. In ICS '09: Proceedings of the 23rd international conference on Supercomputing (New York, NY, USA, 2009), ACM, pp. 256-265.
-
ICS '09: Proceedings of the 23rd International Conference on Supercomputing (New York, NY, USA, 2009)
, pp. 256-265
-
-
Meng, J.1
Skadron, K.2
-
17
-
-
55649109070
-
-
Addison-Wesley Professional
-
NGUYEN, H. GPU Gems 3. Addison-Wesley Professional, 2007.
-
(2007)
GPU Gems 3
-
-
Nguyen, H.1
-
18
-
-
78651550268
-
Scalable Parallel Programming with CUDA
-
NICKOLLS, J., BUCK, I., GARLAND, M., AND SKADRON, K. Scalable Parallel Programming with CUDA. ACM Queue 6, 2 (2008), 40-53.
-
(2008)
ACM Queue
, vol.6
, Issue.2
, pp. 40-53
-
-
Nickolls, J.1
Buck, I.2
Garland, M.3
Skadron, K.4
-
21
-
-
70449700267
-
Fast and Scalable List Ranking on the GPU
-
ACM
-
REHMAN, M. S., KOTHAPALLI, K., AND NARAYANAN, P. J. Fast and Scalable List Ranking on the GPU. In ICS '09: Proceedings of the 23rd International Conference on Supercomputing (New York, NY, USA, 2009), ACM, pp. 235-243.
-
ICS '09: Proceedings of the 23rd International Conference on Supercomputing (New York, NY, USA, 2009)
, pp. 235-243
-
-
Rehman, M.S.1
Kothapalli, K.2
Narayanan, P.J.3
-
22
-
-
43449094719
-
Program Optimization Space Pruning for a Multithreaded GPU
-
RYOO, S., RODRIGUES, C. I., STONE, S., BAGHSORKHI, S. S., UENG, S.-Z., STRATTON, J. A., AND HWU, W. W. Program Optimization Space Pruning for a Multithreaded GPU. In Proc. the Intl. Symp. Code Gen. and Opt. (2008), pp. 195-204.
-
Proc. the Intl. Symp. Code Gen. and Opt. (2008)
, pp. 195-204
-
-
Ryoo, S.1
Rodrigues, C.I.2
Stone, S.3
Baghsorkhi, S.S.4
Ueng, S.-Z.5
Stratton, J.A.6
Hwu, W.W.7
-
23
-
-
70449793037
-
Exploring the Multiple-GPU Design Space
-
IEEE Computer Society
-
SCHAA, D., AND KAELI, D. Exploring the Multiple-GPU Design Space. In IPDPS '09: Proceedings of the 2009 IEEE International Symposium on Parallel & Distributed Processing (Washington, DC, USA, 2009), IEEE Computer Society, pp. 1-12.
-
IPDPS '09: Proceedings of the 2009 IEEE International Symposium on Parallel & Distributed Processing (Washington, DC, USA, 2009)
, pp. 1-12
-
-
Schaa, D.1
Kaeli, D.2
-
24
-
-
78651284120
-
Scan Primitives for GPU Computing
-
SENGUPTA, S., HARRIS, M., ZHANG, Y., AND OWENS, J. D. Scan Primitives for GPU Computing. In Proc. ACM Symp. Graphics Hardware (2007), pp. 97-106.
-
Proc. ACM Symp. Graphics Hardware (2007)
, pp. 97-106
-
-
Sengupta, S.1
Harris, M.2
Zhang, Y.3
Owens, J.D.4
-
25
-
-
0025467711
-
A Bridging Model for Parallel Computation
-
VALIANT, L. G. A Bridging Model for Parallel Computation. Comm. ACM 33, 8 (1990), 103-111.
-
(1990)
Comm. ACM
, vol.33
, Issue.8
, pp. 103-111
-
-
Valiant, L.G.1
-
27
-
-
0242424254
-
Hardware-Based Nonlinear Filtering and Segmentation using High-Level Shading Languages
-
VIOLA, I., KANITSAR, A., AND GROLLER, E. Hardware-Based Nonlinear Filtering and Segmentation using High-Level Shading Languages. In Proc. IEEE Visualization (2003), pp. 309-316.
-
Proc. IEEE Visualization (2003)
, pp. 309-316
-
-
Viola, I.1
Kanitsar, A.2
Groller, E.3
|