SCOPUS 정보 검색 플랫폼

ACM SIGPLAN Notices

Volumn 47, Issue 8, 2012, Pages 117-127

Scalable gpu graph traversal

(3) Merrill, Duane a Garland, Michael b Grimshaw, Andrew a

a UNIVERSITY OF VIRGINIA (United States)

b NVIDIA (United States)

Author keywords

Breadth first search; GPU; Graph algorithms; Graph traversal; Parallel algorithms; Prefix sum; Sparse graph

Indexed keywords

BREADTH-FIRST SEARCH; GPU; GRAPH ALGORITHMS; GRAPH TRAVERSALS; PREFIX SUM; SPARSE GRAPHS;

ALGORITHMS; PARALLEL ALGORITHMS;

GRAPH THEORY;

EID: 84878544432 PISSN: 15232867 EISSN: None Source Type: Journal
DOI: 10.1145/2370036.2145832 Document Type: Conference Paper

Times cited : (241)

References (33)

1
- 84878572675
- Accessed: 07-11
- 10th DIMACS Implementation Challenge: http://www.cc.gatech.edu/dimacs10/ index.shtml. Accessed: 2011-07-11.
- (2011) 10th DIMACS Implementation Challenge

2
- 84878525321
- Accessed: 07-11
- 9th DIMACS Implementation Challenge: http://www.dis.uniroma1.it/ ~challenge9/download.shtml. Accessed: 2011-07-11.
- (2011) 9th DIMACS Implementation Challenge

3
- 78650848150
- Scalable graph exploration on multicore processors
- New Orleans, LA, USA, Nov. 2010
- Agarwal, V. et al. 2010. Scalable Graph Exploration on Multicore Processors. 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (New Orleans, LA, USA, Nov. 2010), 1-11.
- (2010) 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis , pp. 1-11
- Agarwal, V.¹

4
- 34547399946
- Designing multithreaded algorithms for breadth-first search and st-connectivity on the cray MTA-2
- Columbus, OH, USA
- Bader, D.A. and Madduri, K. Designing Multithreaded Algorithms for Breadth-First Search and st-connectivity on the Cray MTA-2. 2006 International Conference on Parallel Processing (ICPP'06) (Columbus, OH, USA), 523-530.
- 2006 International Conference on Parallel Processing (ICPP'06) , pp. 523-530
- Bader, D.A.¹ Madduri, K.²

5
- 33745125067
- On the architectural requirements for efficient execution of graph algorithms
- Oslo, Norway
- Bader, D.A. et al. On the Architectural Requirements for Efficient Execution of Graph Algorithms. 2005 International Conference on Parallel Processing (ICPP'05) (Oslo, Norway), 547-556.
- (2005) International Conference on Parallel Processing (ICPP'05) , pp. 547-556
- Bader, D.A.¹

6
- 74049143158
- Implementing sparse matrix-vector multiplication on throughput-oriented processors
- New York, NY, USA, 2009
- Bell, N. and Garland, M. 2009. Implementing sparse matrix-vector multiplication on throughput-oriented processors. Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (New York, NY, USA, 2009), 18:1-18:11.
- (2009) Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis , pp. 1-11
- Bell, N.¹ Garland, M.²

7
- 0002924004
- Synthesis of Parallel Algorithms
- Blelloch, G.E. 1990. Prefix Sums and Their Applications. Synthesis of Parallel Algorithms.
- (1990) Prefix Sums and Their Applications
- Blelloch, G.E.¹

8
- 0024770039
- Scans as primitive parallel operations
- Nov. 1989
- Blelloch, G.E. 1989. Scans as primitive parallel operations. IEEE Transactions on Computers. 38, 11 (Nov. 1989), 1526-1538.
- (1989) IEEE Transactions on Computers , vol.38 , Issue.11 , pp. 1526-1538
- Blelloch, G.E.¹

9
- 0025550099
- Scan primitives for vector computers
- Los Alamitos, CA, USA, 1990
- Chatterjee, S. et al. 1990. Scan primitives for vector computers. Proceedings of the 1990 ACM/IEEE conference on Supercomputing (Los Alamitos, CA, USA, 1990), 666-675.
- (1990) Proceedings of the 1990 ACM/IEEE Conference on Supercomputing , pp. 666-675
- Chatterjee, S.¹

10
- 70649092154
- Rodinia: A benchmark suite for heterogeneous computing
- Austin, TX, USA, Oct. 2009
- Che, S. et al. 2009. Rodinia: A benchmark suite for heterogeneous computing. 2009 IEEE International Symposium on Workload Characterization (IISWC) (Austin, TX, USA, Oct. 2009), 44-54.
- (2009) 2009 IEEE International Symposium on Workload Characterization (IISWC) , pp. 44-54
- Che, S.¹

11
- 0004116989
- MIT Press
- Cormen, T.H. et al. 2001. Introduction to Algorithms. MIT Press.
- (2001) Introduction to Algorithms
- Cormen, T.H.¹

12
- 76349105923
- Taming irregular EDA applications on GPUs
- New York, NY, USA, 2009
- Deng, Y. (Steve) et al. 2009. Taming irregular EDA applications on GPUs. Proceedings of the 2009 International Conference on Computer-Aided Design (New York, NY, USA, 2009), 539-546.
- (2009) Proceedings of the 2009 International Conference on Computer-Aided Design , pp. 539-546
- Deng, Y.¹

13
- 57349184047
- Fast scan algorithms on graphics processors
- New York, NY, USA, 2008
- Dotsenko, Y. et al. 2008. Fast scan algorithms on graphics processors. Proceedings of the 22nd annual international conference on Supercomputing (New York, NY, USA, 2008), 205-213.
- (2008) Proceedings of the 22nd Annual International Conference on Supercomputing , pp. 205-213
- Dotsenko, Y.¹

14
- 51549093017
- Sparse matrix computations on manycore GPU's
- New York, NY, USA, 2008
- Garland, M. 2008. Sparse matrix computations on manycore GPU's. Proceedings of the 45th annual Design Automation Conference (New York, NY, USA, 2008), 2-6.
- (2008) Proceedings of the 45th Annual Design Automation Conference , pp. 2-6
- Garland, M.¹

15
- 84878570817
- Accessed: 07-11
- GTgraph: A suite of synthetic random graph generators: https://sdm.lbl.gov/~kamesh/software/GTgraph/. Accessed: 2011-07-11.
- (2011) GTgraph: A Suite of Synthetic Random Graph Generators

16
- 38349041620
- Accelerating large graph algorithms on the GPU using CUDA
- Berlin, Heidelberg, 2007
- Harish, P. and Narayanan, P.J. 2007. Accelerating large graph algorithms on the GPU using CUDA. Proceedings of the 14th international conference on High performance computing (Berlin, Heidelberg, 2007), 197-208.
- (2007) Proceedings of the 14th International Conference on High Performance Computing , pp. 197-208
- Harish, P.¹ Narayanan, P.J.²

17
- 0022882379
- Data parallel algorithms
- Dec. 1986
- Hillis, W.D. and Steele, G.L. 1986. Data parallel algorithms. Communications of the ACM. 29, 12 (Dec. 1986), 1170-1183.
- (1986) Communications of the ACM , vol.29 , Issue.12 , pp. 1170-1183
- Hillis, W.D.¹ Steele, G.L.²

18
- 79952811127
- Accelerating CUDA graph algorithms at maximum warp
- New York, NY, USA, 2011
- Hong, S. et al. 2011. Accelerating CUDA graph algorithms at maximum warp. Proceedings of the 16th ACM symposium on Principles and practice of parallel programming (New York, NY, USA, 2011), 267-276.
- (2011) Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming , pp. 267-276
- Hong, S.¹

19
- 84858417648
- (New York, NY, USA, 2011), to appear
- Hong, S. et al. 2011. Efficient Parallel Graph Exploration for Multi-Core CPU and GPU. (New York, NY, USA, 2011), to appear.
- (2011) Efficient Parallel Graph Exploration for Multi-Core CPU and GPU
- Hong, S.¹

20
- 51849094710
- On implementing graph cuts on CUDA
- Boston, MA, Oct. 2007
- Hussein, M. et al. 2007. On Implementing Graph Cuts on CUDA. First Workshop on General Purpose Processing on Graphics Processing Units (Boston, MA, Oct. 2007).
- (2007) First Workshop on General Purpose Processing on Graphics Processing Units
- Hussein, M.¹

21
- 77954929696
- A work-efficient parallel breadth-first search algorithm (or how to cope with the nondeterminism of reducers)
- New York, NY, USA, 2010
- Leiserson, C.E. and Schardl, T.B. 2010. A work-efficient parallel breadth-first search algorithm (or how to cope with the nondeterminism of reducers). Proceedings of the 22nd ACM symposium on Parallelism in algorithms and architectures (New York, NY, USA, 2010), 303-314.
- (2010) Proceedings of the 22nd ACM Symposium on Parallelism in Algorithms and Architectures , pp. 303-314
- Leiserson, C.E.¹ Schardl, T.B.²

22
- 77956200064
- An effective GPU implementation of breadthfirst search
- New York, NY, USA, 2010
- Luo, L. et al. 2010. An effective GPU implementation of breadthfirst search. Proceedings of the 47th Design Automation Conference (New York, NY, USA, 2010), 52-55.
- (2010) Proceedings of the 47th Design Automation Conference , pp. 52-55
- Luo, L.¹

23
- 79959718248
- High performance and scalable radix sorting: A case study of implementing dynamic parallelism for GPU computing
- 2011
- Merrill, D. and Grimshaw, A. 2011. High Performance and Scalable Radix Sorting: A case study of implementing dynamic parallelism for GPU computing. Parallel Processing Letters. 21, 02 (2011), 245-272.
- (2011) Parallel Processing Letters , vol.21 , Issue.2 , pp. 245-272
- Merrill, D.¹ Grimshaw, A.²

24
- 78149268496
- Parallel scan for stream architectures
- Department of Computer Science, University of Virginia
- Merrill, D. and Grimshaw, A. 2009. Parallel Scan for Stream Architectures. Technical Report #CS2009-14. Department of Computer Science, University of Virginia.
- (2009) Technical Report #CS2009-14
- Merrill, D.¹ Grimshaw, A.²

25
- 84858400990
- High performance and scalable GPU graph traversal
- Department of Computer Science, University of Virginia
- Merrill, D. et al. 2011. High Performance and Scalable GPU Graph Traversal. Technical Report #CS2011-05. Department of Computer Science, University of Virginia.
- (2011) Technical Report #CS2011-05
- Merrill, D.¹

26
- 80053212330
- Accessed: -07-11
- Parboil Benchmark suite: http://impact.crhc.illinois.edu/parboil.php. Accessed: 2011-07-11.
- (2011) Parboil Benchmark Suite

27
- 51649124194
- Efficient breadth-first search on the Cell/BE processor
- Oct. 2008
- Scarpazza, D.P. et al. 2008. Efficient Breadth-First Search on the Cell/BE Processor. IEEE Transactions on Parallel and Distributed Systems. 19, 10 (Oct. 2008), 1381-1395.
- (2008) IEEE Transactions on Parallel and Distributed Systems , vol.19 , Issue.10 , pp. 1381-1395
- Scarpazza, D.P.¹

28
- 77952833958
- Efficient parallel scan algorithms for GPUs
- NVIDIA
- Sengupta, S. et al. 2008. Efficient parallel scan algorithms for GPUs. Technical Report #NVR-2008-003. NVIDIA.
- (2008) Technical Report #NVR-2008-003
- Sengupta, S.¹

29
- 84878557233
- Accessed: 07-11
- The Graph 500 List: http://www.graph500.org/. Accessed: 2011-07-11.
- (2011) The Graph 500 List

30
- 0025588012
- High-probability parallel transitive closure algorithms
- Island of Crete, Greece, 1990
- Ullman, J. and Yannakakis, M. 1990. High-probability parallel transitive closure algorithms. Proceedings of the second annual ACM symposium on Parallel algorithms and architectures - SPAA '90 (Island of Crete, Greece, 1990), 200-209.
- (1990) Proceedings of the Second Annual ACM Symposium on Parallel Algorithms and Architectures - SPAA '90 , pp. 200-209
- Ullman, J.¹ Yannakakis, M.²

31
- 0012453312
- Accessed: 07-11
- University of Florida Sparse Matrix Collection: http://www.cise.ufl.edu/ research/sparse/matrices/. Accessed: 2011-07-11.
- (2011) University of Florida Sparse Matrix Collection

32
- 77952348205
- Topologically adaptive parallel breadth-first search on multicore processors
- Nov. 2009
- Xia, Y. and Prasanna, V.K. 2009. Topologically Adaptive Parallel Breadth-first Search on Multicore Processors. 21st International Conference on Parallel and Distributed Computing and Systems (PDCS'09) (Nov. 2009).
- (2009) 21st International Conference on Parallel and Distributed Computing and Systems (PDCS'09)
- Xia, Y.¹ Prasanna, V.K.²

33
- 33845388971
- A Scalable distributed parallel breadth-first search algorithm on blue Gene/L
- Seattle, WA, USA
- Yoo, A. et al. A Scalable Distributed Parallel Breadth-First Search Algorithm on Blue Gene/L. ACM/IEEE SC 2005 Conference (SC'05) (Seattle, WA, USA), 25-25.
- ACM/IEEE SC 2005 Conference (SC'05) , pp. 25-25
- Yoo, A.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.