SCOPUS 정보 검색 플랫폼

Proceedings of the International Conference on Supercomputing

Volumn , Issue , 2009, Pages 235-243

Fast and scalable list ranking on the GPU

(3) Rehman, M Suhail a Kothapalli, Kishore a,b Narayanan, P J a,c

a INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY (India)

b Center for Security Theory and Algorithmic Research International (India)

c Center for Visual Information Technology (India)

Author keywords

GPGPU; Irregular algorithm; List ranking; Many core; Parallel algorithm

Indexed keywords

BALANCED LOADS; CELL BROADBAND ENGINE; GENERAL PURPOSE PROGRAMMING; GRAPHICS PROCESSING UNIT; IRREGULAR ALGORITHM; LIST RANKING; MANY-CORE; MEMORY ACCESS; MULTITHREADED; MULTITHREADED ARCHITECTURE; PARALLEL COMPUTING; PRACTICAL ISSUES;

COMPUTER GRAPHICS EQUIPMENT; INTELLIGENT CONTROL; PARALLEL ARCHITECTURES; PROGRAM PROCESSORS;

PARALLEL ALGORITHMS;

EID: 70449700267 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1542275.1542311 Document Type: Conference Paper

Times cited : (20)

References (21)

1
- 0025256474
- A Simple Randomized Parallel Algorithm for List-Ranking
- R. J. Anderson and G. L. Miller. A Simple Randomized Parallel Algorithm for List-Ranking. Information Processing Letters, 33(5):269-273, 1990.
- (1990) Information Processing Letters , vol.33 , Issue.5 , pp. 269-273
- Anderson, R.J.¹ Miller, G.L.²

2
- 33745125067
- On the Architectural Requirements for Efficient Execution of Graph Algorithms
- June
- D. Bader, G. Cong, and J. Feo. On the Architectural Requirements for Efficient Execution of Graph Algorithms. International Conference on Parallel Processing (ICPP), 2005, pages 547-556, June 2005.
- (2005) International Conference on Parallel Processing (ICPP) , pp. 547-556
- Bader, D.¹ Cong, G.² Feo, J.³

3
- 34548718683
- On the Design and Analysis of Irregular Algorithms on the Cell Processor: A Case Study of List Ranking
- IEEE
- D. A. Bader, V. Agarwal, and K. Madduri. On the Design and Analysis of Irregular Algorithms on the Cell Processor: A Case Study of List Ranking. In 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS), pages 1-10. IEEE, 2007.
- (2007) 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS) , pp. 1-10
- Bader, D.A.¹ Agarwal, V.² Madduri, K.³

4
- 23744452484
- D. A. Bader and G. Cong. A Fast, Parallel Spanning Tree Algorithm for Symmetric Multiprocessors (SMPs). Journal of Parallel and Distributed Computing, 65(9):994-1006, 2005.
- D. A. Bader and G. Cong. A Fast, Parallel Spanning Tree Algorithm for Symmetric Multiprocessors (SMPs). Journal of Parallel and Distributed Computing, 65(9):994-1006, 2005.

5
- 84945304588
- Evaluating Arithmetic Expressions Using Tree Contraction: A Fast and Scalable Parallel Implementation for Symmetric Multiprocessors (SMPs)
- HiPC '02: Proceedings of the 9th International Conference on High Performance Computing, London, UK, Springer-Verlag
- D. A. Bader, S. Sreshta, and N. R. Weisse-Bernstein. Evaluating Arithmetic Expressions Using Tree Contraction: A Fast and Scalable Parallel Implementation for Symmetric Multiprocessors (SMPs). In HiPC '02: Proceedings of the 9th International Conference on High Performance Computing, Lecture Notes in Computer Science, pages 63-78, London, UK, 2002. Springer-Verlag.
- (2002) Lecture Notes in Computer Science , pp. 63-78
- Bader, D.A.¹ Sreshta, S.² Weisse-Bernstein, N.R.³

6
- 85088003777
- GPU Computing with NVIDIA CUDA
- New York, NY, USA, ACM
- I. Buck. GPU Computing with NVIDIA CUDA. In SIGGRAPH '07: ACM SIGGRAPH 2007 courses, page 6, New York, NY, USA, 2007. ACM.
- (2007) SIGGRAPH '07: ACM SIGGRAPH 2007 courses , pp. 6
- Buck, I.¹

7
- 0024684158
- Faster Optimal Parallel Prefix Sums and List Ranking
- R. Cole and U. Vishkin. Faster Optimal Parallel Prefix Sums and List Ranking. Information and Computation, 81(3):334-352, 1989.
- (1989) Information and Computation , vol.81 , Issue.3 , pp. 334-352
- Cole, R.¹ Vishkin, U.²

8
- 57349184047
- Fast Scan Algorithms on Graphics Processors
- ACM New York, NY, USA
- Y. Dotsenko, N. Govindaraju, P. Sloan, C. Boyd, and J. Manferdelli. Fast Scan Algorithms on Graphics Processors. In Proceedings of the 22nd Annual International Conference on Supercomputing (ICS), pages 205-213. ACM New York, NY, USA, 2008.
- (2008) Proceedings of the 22nd Annual International Conference on Supercomputing (ICS) , pp. 205-213
- Dotsenko, Y.¹ Govindaraju, N.² Sloan, P.³ Boyd, C.⁴ Manferdelli, J.⁵

9
- 35948931417
- Cache-Efficient Numerical Algorithms using Graphics Hardware
- N. Govindaraju and D. Manocha. Cache-Efficient Numerical Algorithms using Graphics Hardware. Parallel Computing, 33(10-11):663-684, 2007.
- (2007) Parallel Computing , vol.33 , Issue.10-11 , pp. 663-684
- Govindaraju, N.¹ Manocha, D.²

10
- 34548292052
- A Memory Model for Scientific Algorithms on Graphics Processors
- New York, NY, USA, ACM
- N. K. Govindaraju, S. Larsen, J. Gray, and D. Manocha. A Memory Model for Scientific Algorithms on Graphics Processors. In SC '06: Proceedings of the 2006 ACM/IEEE conference on Supercomputing, page 89, New York, NY, USA, 2006. ACM.
- (2006) SC '06: Proceedings of the 2006 ACM/IEEE conference on Supercomputing , pp. 89
- Govindaraju, N.K.¹ Larsen, S.² Gray, J.³ Manocha, D.⁴

11
- 70449710961
- M. Harris, J. Owens, S. Sengupta, Y. Zhang, and A. Davidson. CUDPP: CUDA Data Parallel Primitives Library. http://gpgpu.org/developer/cudpp.
- CUDPP: CUDA Data Parallel Primitives Library
- Harris, M.¹ Owens, J.² Sengupta, S.³ Zhang, Y.⁴ Davidson, A.⁵

12
- 84979025439
- Designing Practical Efficient Algorithms for Symmetric Multiprocessors
- ALENEX '99: Selected papers from the International Workshop on Algorithm Engineering and Experimentation, Springer-Verlag
- D. R. Helman and J. JáJá. Designing Practical Efficient Algorithms for Symmetric Multiprocessors. In ALENEX '99: Selected papers from the International Workshop on Algorithm Engineering and Experimentation, Lecture Notes in Computer Science, pages 37-56. Springer-Verlag, 1999.
- (1999) Lecture Notes in Computer Science , pp. 37-56
- Helman, D.R.¹ JáJá, J.²

13
- 0003860991
- Addison Wesley Longman Publishing Co, Inc, Redwood City, CA, USA
- J. JáJá. An Introduction to Parallel Algorithms. Addison Wesley Longman Publishing Co., Inc., Redwood City, CA, USA, 1992.
- (1992) An Introduction to Parallel Algorithms
- JáJá, J.¹

14
- 0022246544
- Oct
- G. L. Miller and J. H. Reif. Parallel Tree Contraction and its Application. pages 478-489, Oct. 1985.
- (1985) Parallel Tree Contraction and its Application , pp. 478-489
- Miller, G.L.¹ Reif, J.H.²

15
- 77952342828
- OpenCL Specification V1.0
- Technical report, Khronos OpenCL Working Group
- A. Munshi. OpenCL Specification V1.0. Technical report, Khronos OpenCL Working Group, 2008.
- (2008)
- Munshi, A.¹

16
- 41649101136
- CUDA: Compute Unified Device Architecture Programming Guide
- NVIDIA Corporation, Technical report, NVIDIA, 2007
- NVIDIA Corporation. CUDA: Compute Unified Device Architecture Programming Guide. Technical report, NVIDIA, 2007.

17
- 85032450738
- List Ranking and List Scan on the Cray C-90
- New York, NY, USA, ACM
- M. Reid-Miller. List Ranking and List Scan on the Cray C-90. In SPAA '94: Proceedings of the Sixth Annual ACM Symposium on Parallel Algorithms and Architectures, pages 104-113, New York, NY, USA, 1994. ACM.
- (1994) SPAA '94: Proceedings of the Sixth Annual ACM Symposium on Parallel Algorithms and Architectures , pp. 104-113
- Reid-Miller, M.¹

18
- 43449094719
- Program Optimization Space Pruning for a Multithreaded GPU
- ACM New York, NY, USA
- S. Ryoo, C. Rodrigues, S. Stone, S. Baghsorkhi, S. Ueng, J. Stratton, and W. Wen-mei. Program Optimization Space Pruning for a Multithreaded GPU. In Proceedings of the Sixth Annual IEEE/ACM International Symposium on Code Generation and Optimization, pages 195-204. ACM New York, NY, USA, 2008.
- (2008) Proceedings of the Sixth Annual IEEE/ACM International Symposium on Code Generation and Optimization , pp. 195-204
- Ryoo, S.¹ Rodrigues, C.² Stone, S.³ Baghsorkhi, S.⁴ Ueng, S.⁵ Stratton, J.⁶ Wen-mei, W.⁷

19
- 78651284120
- Scan Primitives for GPU Computing
- Switzerland, Eurographics Association
- S. Sengupta, M. Harris, Y. Zhang, and J. D. Owens. Scan Primitives for GPU Computing. In GH '07: Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS Symposium on Graphics Hardware, pages 97-106, Switzerland, 2007. Eurographics Association.
- (2007) GH '07: Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS Symposium on Graphics Hardware , pp. 97-106
- Sengupta, S.¹ Harris, M.² Zhang, Y.³ Owens, J.D.⁴

20
- 35048813593
- Minimizing Global Communication in Parallel List Ranking
- Euro-Par Parallel Processing, Springer
- J. Sibeyn. Minimizing Global Communication in Parallel List Ranking. In Euro-Par Parallel Processing, Lecture Notes in Computer Science, pages 894-902. Springer, 2003.
- (2003) Lecture Notes in Computer Science , pp. 894-902
- Sibeyn, J.¹

21
- 0004129817
- PhD thesis, Cornell University, Ithaca, NY, USA
- J. C. Wyllie. The Complexity of Parallel Computations. PhD thesis, Cornell University, Ithaca, NY, USA, 1979.
- (1979) The Complexity of Parallel Computations
- Wyllie, J.C.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.