메뉴 건너뛰기




Volumn , Issue , 2014, Pages 239-251

Cusha: Vertex-centric graph processing on GPUs

Author keywords

Coalesced memory accesses; Concatenated Windows; G Shards; GPU; Graph representation

Indexed keywords

ALGORITHMS; PROGRAM PROCESSORS;

EID: 84904412583     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2600212.2600227     Document Type: Conference Paper
Times cited : (249)

References (30)
  • 1
    • 78650848150 scopus 로고    scopus 로고
    • Scalable graph exploration on multicore processors
    • V. Agarwal, F. Petrini, D. Pasetto, and D. A. Bader. Scalable graph exploration on multicore processors. In SC, pages 1-11, 2010.
    • (2010) SC , pp. 1-11
    • Agarwal, V.1    Petrini, F.2    Pasetto, D.3    Bader, D.A.4
  • 2
    • 51049088410 scopus 로고    scopus 로고
    • Snap, small-world network analysis and partitioning: An open-source parallel graph framework for the exploration of large-scale networks
    • D. A. Bader and K. Madduri. Snap, small-world network analysis and partitioning: An open-source parallel graph framework for the exploration of large-scale networks. In IPDPS, pages 1-12, 2008.
    • (2008) IPDPS , pp. 1-12
    • Bader, D.A.1    Madduri, K.2
  • 3
    • 70349169075 scopus 로고    scopus 로고
    • Analyzing cuda workloads using a detailed gpu simulator
    • A. Bakhoda, G. Yuan, W. Fung, H. Wong, and T. Aamodt. Analyzing cuda workloads using a detailed gpu simulator. In ISPASS, pages 163-174, 2009.
    • (2009) ISPASS , pp. 163-174
    • Bakhoda, A.1    Yuan, G.2    Fung, W.3    Wong, H.4    Aamodt, T.5
  • 4
    • 80053287330 scopus 로고    scopus 로고
    • Computing strongly connected components in parallel on cuda
    • J. Barnat, P. Bauch, L. Brim, and M. Ceska. Computing strongly connected components in parallel on cuda. In IPDPS, pages 544-555, 2011.
    • (2011) IPDPS , pp. 544-555
    • Barnat, J.1    Bauch, P.2    Brim, L.3    Ceska, M.4
  • 5
    • 32344433867 scopus 로고    scopus 로고
    • R-mat: A recursive model for graph mining
    • D. Chakrabarti, Y. Zhan, and C. Faloutsos. R-mat: A recursive model for graph mining. In In SDM, 2004.
    • (2004) SDM
    • Chakrabarti, D.1    Zhan, Y.2    Faloutsos, C.3
  • 6
    • 83155184570 scopus 로고    scopus 로고
    • Dymaxion: Optimizing memory access patterns for heterogeneous systems
    • S. Che, J. Sheaffer, and K. Skadron. Dymaxion: Optimizing memory access patterns for heterogeneous systems. In SC, pages 1-11, 2011.
    • (2011) SC , pp. 1-11
    • Che, S.1    Sheaffer, J.2    Skadron, K.3
  • 8
    • 84867570222 scopus 로고    scopus 로고
    • A yoke of oxen and a thousand chickens for heavy lifting graph processing
    • A. Gharaibeh, L. Beltrao Costa, E. Santos-Neto, and M. Ripeanu. A yoke of oxen and a thousand chickens for heavy lifting graph processing. In PACT, pages 345-354, 2012.
    • (2012) PACT , pp. 345-354
    • Gharaibeh, A.1    Costa, L.B.2    Santos-Neto, E.3    Ripeanu, M.4
  • 9
    • 84904415300 scopus 로고    scopus 로고
    • Efficient large-scale graph processing on hybrid CPU and GPU Systems
    • A. Gharaibeh, L. Beltrao Costa, E. Santos-Neto, and M. Ripeanu. Efficient Large-Scale Graph Processing on Hybrid CPU and GPU Systems. In CoRR, 2013.
    • (2013) CoRR
    • Gharaibeh, A.1    Costa, L.B.2    Santos-Neto, E.3    Ripeanu, M.4
  • 10
    • 38349041620 scopus 로고    scopus 로고
    • Accelerating large graph algorithms on the gpu using cuda
    • P. Harish and P. J. Narayanan. Accelerating large graph algorithms on the gpu using cuda. In HiPC, pages 197-208, 2007.
    • (2007) HiPC , pp. 197-208
    • Harish, P.1    Narayanan, P.J.2
  • 11
    • 54049101457 scopus 로고    scopus 로고
    • Optimizing parallel reduction in cuda
    • M. Harris et al. Optimizing parallel reduction in cuda. NVIDIA Developer Technology, 2, 2007.
    • (2007) NVIDIA Developer Technology , pp. 2
    • Harris, M.1
  • 12
    • 79952811127 scopus 로고    scopus 로고
    • Accelerating cuda graph algorithms at maximum warp
    • S. Hong, S. K. Kim, T. Oguntebi, and K. Olukotun. Accelerating cuda graph algorithms at maximum warp. In PPoPP, pages 267-276, 2011.
    • (2011) PPoPP , pp. 267-276
    • Hong, S.1    Kim, S.K.2    Oguntebi, T.3    Olukotun, K.4
  • 13
    • 84856541553 scopus 로고    scopus 로고
    • Efficient parallel graph exploration on multi-core cpu and gpu
    • S. Hong, T. Oguntebi, and K. Olukotun. Efficient parallel graph exploration on multi-core cpu and gpu. In PACT, 2011.
    • (2011) PACT
    • Hong, S.1    Oguntebi, T.2    Olukotun, K.3
  • 14
    • 85042632297 scopus 로고    scopus 로고
    • Graphchi: Large-scale graph computation on just a pc
    • A. Kyrola, G. Blelloch, and C. Guestrin. Graphchi: Large-scale graph computation on just a pc. In OSDI, pages 31-46, 2012.
    • (2012) OSDI , pp. 31-46
    • Kyrola, A.1    Blelloch, G.2    Guestrin, C.3
  • 17
    • 70350647041 scopus 로고    scopus 로고
    • Community structure in large networks: Natural cluster sizes and the absence of large well-defined clusters
    • abs/0810.1355
    • J. Leskovec, K. J. Lang, A. Dasgupta, and M. W. Mahoney. Community structure in large networks: Natural cluster sizes and the absence of large well-defined clusters. CoRR, abs/0810.1355, 2008.
    • (2008) CoRR
    • Leskovec, J.1    Lang, K.J.2    Dasgupta, A.3    Mahoney, M.W.4
  • 19
    • 77956200064 scopus 로고    scopus 로고
    • An effective gpu implementation of breadth-first search
    • L. Luo, M. Wong, and W.-m. Hwu. An effective gpu implementation of breadth-first search. In DAC, pages 52-55, 2010.
    • (2010) DAC , pp. 52-55
    • Luo, L.1    Wong, M.2    Hwu, W.-.3
  • 20
    • 84878605997 scopus 로고    scopus 로고
    • A gpu implementation of inclusion-based points- to analysis
    • M. Mendez-Lojo, M. Burtscher, and K. Pingali. A gpu implementation of inclusion-based points- to analysis. In PPoPP, pages 107-116, 2012.
    • (2012) PPoPP , pp. 107-116
    • Mendez-Lojo, M.1    Burtscher, M.2    Pingali, K.3
  • 21
    • 84878544432 scopus 로고    scopus 로고
    • Scalable gpu graph traversal
    • D. Merrill, M. Garland, and A. Grimshaw. Scalable gpu graph traversal. In PPoPP, pages 117-128, 2012.
    • (2012) PPoPP , pp. 117-128
    • Merrill, D.1    Garland, M.2    Grimshaw, A.3
  • 22
    • 84885201942 scopus 로고    scopus 로고
    • Morph algorithms on gpus
    • R. Nasre, M. Burtscher, and K. Pingali. Morph algorithms on gpus. In PPoPP, pages 147-156, 2013.
    • (2013) PPoPP , pp. 147-156
    • Nasre, R.1    Burtscher, M.2    Pingali, K.3
  • 24
    • 84863443917 scopus 로고    scopus 로고
    • Adaptive input-aware compilation for graphics engines
    • M. Samadi, A. Hormati, M. Mehrara, J. Lee, and S. Mahlke. Adaptive input-aware compilation for graphics engines. In PLDI, pages 13-22, 2012.
    • (2012) PLDI , pp. 13-22
    • Samadi, M.1    Hormati, A.2    Mehrara, M.3    Lee, J.4    Mahlke, S.5
  • 26
    • 0025467711 scopus 로고
    • A bridging model for parallel computation
    • L. G. Valiant. A bridging model for parallel computation. Commun. ACM, 33(8):103-111, 1990.
    • (1990) Commun. ACM , vol.33 , Issue.8 , pp. 103-111
    • Valiant, L.G.1
  • 27
    • 84885201786 scopus 로고    scopus 로고
    • Complexity analysis and algorithm design for reorganizing data to minimize non-coalesced memory accesses on gpu
    • B. Wu, Z. Zhao, E. Z. Zhang, Y. Jiang, and X. Shen. Complexity analysis and algorithm design for reorganizing data to minimize non-coalesced memory accesses on gpu. In PPoPP, pages 57-68, 2013.
    • (2013) PPoPP , pp. 57-68
    • Wu, B.1    Zhao, Z.2    Zhang, E.Z.3    Jiang, Y.4    Shen, X.5
  • 28
    • 78649564670 scopus 로고    scopus 로고
    • Efficient pagerank and spmv computation on amd gpus
    • T. Wu, B. Wang, Y. Shan, F. Yan, Y. Wang, and N. Xu. Efficient pagerank and spmv computation on amd gpus. In ICPP, pages 81-89, 2010.
    • (2010) ICPP , pp. 81-89
    • Wu, T.1    Wang, B.2    Shan, Y.3    Yan, F.4    Wang, Y.5    Xu, N.6
  • 29
    • 79953126288 scopus 로고    scopus 로고
    • On-the-fly elimination of dynamic irregularities for gpu computing
    • E. Z. Zhang, Y. Jiang, Z. Guo, K. Tian, and X. Shen. On-the-fly elimination of dynamic irregularities for gpu computing. In ASPLOS, pages 369-380, 2011.
    • (2011) ASPLOS , pp. 369-380
    • Zhang, E.Z.1    Jiang, Y.2    Guo, Z.3    Tian, K.4    Shen, X.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.