메뉴 건너뛰기




Volumn , Issue , 2013, Pages 1013-1024

Deploying graph algorithms on GPUs: An adaptive solution

Author keywords

GPU; Graph algorithms; Runtime system

Indexed keywords

COMPUTATIONAL PATTERNS; ELECTRICAL CIRCUIT MODELS; GPU; GRAPH ALGORITHMS; GRAPHICS PROCESSING UNITS; IRREGULAR DATA STRUCTURES; RUNTIME SYSTEMS; SINGLE SOURCE SHORTEST PATHS;

EID: 84884875169     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2013.101     Document Type: Conference Paper
Times cited : (32)

References (33)
  • 2
    • 67650076853 scopus 로고    scopus 로고
    • How much parallelism is there in irregular applications
    • M. Kulkarni, M. Burtscher, et al, "How much parallelism is there in irregular applications," in Proc. of PPoPP 2009.
    • Proc. of PPoPP 2009
    • Kulkarni, M.1    Burtscher, M.2
  • 3
    • 79952783409 scopus 로고    scopus 로고
    • Ordered vs. unordered: A comparison of parallelism and work-efficiency in irregular algorithms
    • M. A. Hassaan, M. Burtscher, and K. Pingali, "Ordered vs. unordered: a comparison of parallelism and work-efficiency in irregular algorithms," in Proc. of PPoPP 2011.
    • Proc. of PPoPP 2011
    • Hassaan, M.A.1    Burtscher, M.2    Pingali, K.3
  • 6
    • 79959869566 scopus 로고    scopus 로고
    • STAPL: An adaptive, generic parallel C++ library
    • P. An, A. Jula, S. Rus, et al, "STAPL: an adaptive, generic parallel C++ library," in Proc. of LCPC 2003.
    • Proc. of LCPC 2003
    • An, P.1    Jula, A.2    Rus, S.3
  • 7
    • 60649099910 scopus 로고    scopus 로고
    • Accelerating large graph algorithms on the GPU using CUDA
    • P. Harish and P. J. Narayanan, "Accelerating large graph algorithms on the GPU using CUDA," in Proc. of HiPC 2007.
    • Proc. of HiPC 2007
    • Harish, P.1    Narayanan, P.J.2
  • 8
    • 77956200064 scopus 로고    scopus 로고
    • An Effective GPU Implementation of Breadth-first Search
    • L. Luo, M. Wong, and W.-M. Hwu, "An Effective GPU Implementation of Breadth-first Search," in Proc. of DAC 2010.
    • Proc. of DAC 2010
    • Luo, L.1    Wong, M.2    Hwu, W.-M.3
  • 13
    • 84856541553 scopus 로고    scopus 로고
    • Efficient Parallel Graph Exploration on Multi-Core CPU and GPU
    • S. Hong, T. Oguntebi, and K. Olukotun, "Efficient Parallel Graph Exploration on Multi-Core CPU and GPU," in Proc. of PACT 2011.
    • Proc. of PACT 2011
    • Hong, S.1    Oguntebi, T.2    Olukotun, K.3
  • 14
    • 70649092154 scopus 로고    scopus 로고
    • Rodinia: A benchmark suite for heterogeneous computing
    • S. Che, M. Boyer, J. Meng, et al, "Rodinia: A benchmark suite for heterogeneous computing," in Proc. of IISWC 2009.
    • Proc. of IISWC 2009
    • Che, S.1    Boyer, M.2    Meng, J.3
  • 15
    • 77954995885 scopus 로고    scopus 로고
    • Debunking the 100X GPU vs. CPU myth: An evaluation of throughput computing on CPU and GPU
    • V. W. Lee, C. Kim, J. Chhugani, et al, "Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU," in Proc. of ISCA 2010.
    • Proc. of ISCA 2010
    • Lee, V.W.1    Kim, C.2    Chhugani, J.3
  • 19
    • 34547399946 scopus 로고    scopus 로고
    • Designing Multithreaded Algorithms for Breadth-First Search and st-connectivity on the Cray MTA-2
    • D. A. Bader and K. Madduri, "Designing Multithreaded Algorithms for Breadth-First Search and st-connectivity on the Cray MTA-2," in Proc. of ICPP 2006.
    • Proc. of ICPP 2006
    • Bader, D.A.1    Madduri, K.2
  • 20
    • 77954929696 scopus 로고    scopus 로고
    • A work-efficient parallel breadth-first search algorithm (or how to cope with the nondeterminism of reducers)
    • C. E. Leiserson and T. B. Schardl, "A work-efficient parallel breadth-first search algorithm (or how to cope with the nondeterminism of reducers)," in Proc of SPAA 2010.
    • Proc of SPAA 2010
    • Leiserson, C.E.1    Schardl, T.B.2
  • 23
    • 84886400902 scopus 로고    scopus 로고
    • Parallel Bidirectional Dijkstra's Shortest Path Algorithm
    • G. Vaira and O. Kurasova, "Parallel Bidirectional Dijkstra's Shortest Path Algorithm," in Proc. of DB&IS 2010, 2011.
    • (2011) Proc. of DB&IS 2010
    • Vaira, G.1    Kurasova, O.2
  • 24
    • 0026986470 scopus 로고    scopus 로고
    • A parallel algorithm for computing minimum spanning trees
    • D. B. Johnson and P. Metaxas, "A parallel algorithm for computing minimum spanning trees," in Proc. of SPAA 1992.
    • Proc. of SPAA 1992
    • Johnson, D.B.1    Metaxas, P.2
  • 25
    • 0032313730 scopus 로고    scopus 로고
    • Practical Parallel Algorithms for Minimum Spanning Trees
    • F. Dehne and S. Gotz, "Practical Parallel Algorithms for Minimum Spanning Trees," in Proc. of SRDS 1998.
    • Proc. of SRDS 1998
    • Dehne, F.1    Gotz, S.2
  • 26
    • 33749508859 scopus 로고    scopus 로고
    • Fast shared-memory algorithms for computing the minimum spanning forest of sparse graphs
    • DOI 10.1016/j.jpdc.2006.06.001, PII S0743731506001262
    • D. A. Bader and G. Cong, "Fast shared-memory algorithms for computing the minimum spanning forest of sparse graphs," J. Parallel Distrib. Comput., vol. 66, pp. 1366-1378, 2006. (Pubitemid 44528211)
    • (2006) Journal of Parallel and Distributed Computing , vol.66 , Issue.11 , pp. 1366-1378
    • Bader, D.A.1    Cong, G.2
  • 28
    • 0004799247 scopus 로고    scopus 로고
    • An optimal randomized parallel algorithm for finding connected components in a graph
    • H. Gazit, "An optimal randomized parallel algorithm for finding connected components in a graph," in Proc. of FOCS 1986.
    • Proc. of FOCS 1986
    • Gazit, H.1
  • 29
    • 84886381218 scopus 로고    scopus 로고
    • Finding strongly connected components in parallel in particle transport sweeps
    • I. William Mclendon, B. Hendrickson, et al, "Finding strongly connected components in parallel in particle transport sweeps," in Proc. of SPAA 2001.
    • Proc. of SPAA 2001
    • William Mclendon, I.1    Hendrickson, B.2
  • 33
    • 84866465427 scopus 로고    scopus 로고
    • (Aug 5). mc.stanford.edu/cgi-bin/images/3/34/Darve-cme343-
    • J. Balfour. (Aug 5). CUDA Threads and Atomics. mc.stanford.edu/cgi-bin/ images/3/34/Darve-cme343-cuda-3.pdf
    • CUDA Threads and Atomics
    • Balfour, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.