메뉴 건너뛰기




Volumn , Issue , 2009, Pages 235-243

Fast and scalable list ranking on the GPU

Author keywords

GPGPU; Irregular algorithm; List ranking; Many core; Parallel algorithm

Indexed keywords

BALANCED LOADS; CELL BROADBAND ENGINE; GENERAL PURPOSE PROGRAMMING; GRAPHICS PROCESSING UNIT; IRREGULAR ALGORITHM; LIST RANKING; MANY-CORE; MEMORY ACCESS; MULTITHREADED; MULTITHREADED ARCHITECTURE; PARALLEL COMPUTING; PRACTICAL ISSUES;

EID: 70449700267     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1542275.1542311     Document Type: Conference Paper
Times cited : (20)

References (21)
  • 1
    • 0025256474 scopus 로고
    • A Simple Randomized Parallel Algorithm for List-Ranking
    • R. J. Anderson and G. L. Miller. A Simple Randomized Parallel Algorithm for List-Ranking. Information Processing Letters, 33(5):269-273, 1990.
    • (1990) Information Processing Letters , vol.33 , Issue.5 , pp. 269-273
    • Anderson, R.J.1    Miller, G.L.2
  • 2
    • 33745125067 scopus 로고    scopus 로고
    • On the Architectural Requirements for Efficient Execution of Graph Algorithms
    • June
    • D. Bader, G. Cong, and J. Feo. On the Architectural Requirements for Efficient Execution of Graph Algorithms. International Conference on Parallel Processing (ICPP), 2005, pages 547-556, June 2005.
    • (2005) International Conference on Parallel Processing (ICPP) , pp. 547-556
    • Bader, D.1    Cong, G.2    Feo, J.3
  • 4
    • 23744452484 scopus 로고    scopus 로고
    • D. A. Bader and G. Cong. A Fast, Parallel Spanning Tree Algorithm for Symmetric Multiprocessors (SMPs). Journal of Parallel and Distributed Computing, 65(9):994-1006, 2005.
    • D. A. Bader and G. Cong. A Fast, Parallel Spanning Tree Algorithm for Symmetric Multiprocessors (SMPs). Journal of Parallel and Distributed Computing, 65(9):994-1006, 2005.
  • 5
    • 84945304588 scopus 로고    scopus 로고
    • Evaluating Arithmetic Expressions Using Tree Contraction: A Fast and Scalable Parallel Implementation for Symmetric Multiprocessors (SMPs)
    • HiPC '02: Proceedings of the 9th International Conference on High Performance Computing, London, UK, Springer-Verlag
    • D. A. Bader, S. Sreshta, and N. R. Weisse-Bernstein. Evaluating Arithmetic Expressions Using Tree Contraction: A Fast and Scalable Parallel Implementation for Symmetric Multiprocessors (SMPs). In HiPC '02: Proceedings of the 9th International Conference on High Performance Computing, Lecture Notes in Computer Science, pages 63-78, London, UK, 2002. Springer-Verlag.
    • (2002) Lecture Notes in Computer Science , pp. 63-78
    • Bader, D.A.1    Sreshta, S.2    Weisse-Bernstein, N.R.3
  • 6
    • 85088003777 scopus 로고    scopus 로고
    • GPU Computing with NVIDIA CUDA
    • New York, NY, USA, ACM
    • I. Buck. GPU Computing with NVIDIA CUDA. In SIGGRAPH '07: ACM SIGGRAPH 2007 courses, page 6, New York, NY, USA, 2007. ACM.
    • (2007) SIGGRAPH '07: ACM SIGGRAPH 2007 courses , pp. 6
    • Buck, I.1
  • 7
    • 0024684158 scopus 로고
    • Faster Optimal Parallel Prefix Sums and List Ranking
    • R. Cole and U. Vishkin. Faster Optimal Parallel Prefix Sums and List Ranking. Information and Computation, 81(3):334-352, 1989.
    • (1989) Information and Computation , vol.81 , Issue.3 , pp. 334-352
    • Cole, R.1    Vishkin, U.2
  • 9
    • 35948931417 scopus 로고    scopus 로고
    • Cache-Efficient Numerical Algorithms using Graphics Hardware
    • N. Govindaraju and D. Manocha. Cache-Efficient Numerical Algorithms using Graphics Hardware. Parallel Computing, 33(10-11):663-684, 2007.
    • (2007) Parallel Computing , vol.33 , Issue.10-11 , pp. 663-684
    • Govindaraju, N.1    Manocha, D.2
  • 12
    • 84979025439 scopus 로고    scopus 로고
    • Designing Practical Efficient Algorithms for Symmetric Multiprocessors
    • ALENEX '99: Selected papers from the International Workshop on Algorithm Engineering and Experimentation, Springer-Verlag
    • D. R. Helman and J. JáJá. Designing Practical Efficient Algorithms for Symmetric Multiprocessors. In ALENEX '99: Selected papers from the International Workshop on Algorithm Engineering and Experimentation, Lecture Notes in Computer Science, pages 37-56. Springer-Verlag, 1999.
    • (1999) Lecture Notes in Computer Science , pp. 37-56
    • Helman, D.R.1    JáJá, J.2
  • 13
    • 0003860991 scopus 로고
    • Addison Wesley Longman Publishing Co, Inc, Redwood City, CA, USA
    • J. JáJá. An Introduction to Parallel Algorithms. Addison Wesley Longman Publishing Co., Inc., Redwood City, CA, USA, 1992.
    • (1992) An Introduction to Parallel Algorithms
    • JáJá, J.1
  • 15
    • 77952342828 scopus 로고    scopus 로고
    • OpenCL Specification V1.0
    • Technical report, Khronos OpenCL Working Group
    • A. Munshi. OpenCL Specification V1.0. Technical report, Khronos OpenCL Working Group, 2008.
    • (2008)
    • Munshi, A.1
  • 16
    • 41649101136 scopus 로고    scopus 로고
    • CUDA: Compute Unified Device Architecture Programming Guide
    • NVIDIA Corporation, Technical report, NVIDIA, 2007
    • NVIDIA Corporation. CUDA: Compute Unified Device Architecture Programming Guide. Technical report, NVIDIA, 2007.
  • 20
    • 35048813593 scopus 로고    scopus 로고
    • Minimizing Global Communication in Parallel List Ranking
    • Euro-Par Parallel Processing, Springer
    • J. Sibeyn. Minimizing Global Communication in Parallel List Ranking. In Euro-Par Parallel Processing, Lecture Notes in Computer Science, pages 894-902. Springer, 2003.
    • (2003) Lecture Notes in Computer Science , pp. 894-902
    • Sibeyn, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.