SCOPUS 정보 검색 플랫폼

Proceedings - 2013 IEEE International Symposium on Workload Characterization, IISWC 2013

Volumn , Issue , 2013, Pages 185-195

Pannotia: Understanding irregular GPGPU graph applications

(4) Che, Shuai a Beckmann, Bradford M a Reinhardt, Steven K a Skadron, Kevin b

a AMD Research and Computer Science (United States)

b UNIVERSITY OF VIRGINIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ACCESS PATTERNS; DATA-PARALLEL APPLICATIONS;

PROGRAM PROCESSORS;

EID: 84893628986 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IISWC.2013.6704684 Document Type: Conference Paper

Times cited : (165)

References (43)

1
- 84893594127
- AMD Radeon HD 7000 series counters. Web resource
- AMD Radeon HD 7000 series counters. Web resource. http: //developer.amd.com/tools/heterogeneous-computing/amd-app-profiler/ user- guide/app- profiler- settings/.

2
- 84893553919
- The 10th DIMACS Implementation Challenge Graph Partitioning and Graph Clustering. Web resource
- The 10th DIMACS Implementation Challenge Graph Partitioning and Graph Clustering. Web resource. http://www.cc.gatech.edu/dimacslO/.

3
- 84893617546
- The 9th DIMACS Implementation Challenge Shortest Paths. Web resource
- The 9th DIMACS Implementation Challenge Shortest Paths. Web resource. http://www.dis.uniromal.it/challenge9/.

4
- 84893549371
- AMD Accelerated Parallel Processing: OpenCL Programming Guide. Web resource
- AMD Accelerated Parallel Processing: OpenCL Programming Guide. Web resource.

5
- 84893538088
- AMD Graphics Core Next Architecture. Web resource
- http://developer.amd.com/download/AMD-Accelerated- Parallel-Processing- OpenCL-Programming-Guide.pdf AMD Graphics Core Next Architecture. Web resource. http://www.amd. com/us/products/technologies/gcn/Pages/gcn-architecture.aspx.

6
- 85166916238
- Gephi: An open source software for exploring and manipulating networks
- May
- M. Bastian, S. Heymann, and M. Jacomy. Gephi: An open source software for exploring and manipulating networks. In ICWSM, May 2009.
- (2009) ICWSM
- Bastian, M.¹ Heymann, S.² Jacomy, M.³

7
- 56449124998
- Splash-2: A quantitative comparison of two multithreaded benchmark suites on chip-multiprocessors
- Sep
- C. Bienia, S. Kumar, and K. Li. PARSEC vs. SPLASH-2: A quantitative comparison of two multithreaded benchmark suites on chip-multiprocessors. In IISWC, Sep 2008.
- (2008) IISWC
- Bienia, C.¹ Kumar, S.² Li. Parsec Vs, K.³

8
- 78751484931
- Fidelity and scaling of the parsec benchmark inputs
- Dec
- C. Bienia and K. Li. Fidelity and scaling of the PARSEC benchmark inputs. In IISWC, Dec 2010
- (2010) IISWC
- Bienia, C.¹ Li, K.²

9
- 0035648637
- A faster algorithm for betweenness centrality
- Ulrik Brandes. A faster algorithm for betweenness centrality. 1. Math. Social., 25:163-177, 2001.
- (2001) 1. Math. Social , vol.25 , pp. 163-177
- Brandes, U.¹

10
- 84873458159
- A quantitative study ofirregular programs on gpus
- Nov
- M. Burtscher, R. Nasre, and K. Pingali. A quantitative study ofirregular programs on GPUs. In IISWC, Nov 2012.
- (2012) IISWC
- Burtscher, M.¹ Nasre, R.² Pingali, K.³

11
- 84858427151
- An efficient cuda implementation of the tree-based barnes hut n-body algorithm
- Morgan Kaufmann
- M. Burtscher and K. Pingali. An efficient CUDA implementation of the tree-based Barnes Hut n-body algorithm. In GPU Computing Gems, pages 75-92. Morgan Kaufmann, 2011.
- (2011) GPU Computing Gems , pp. 75-92
- Burtscher, M.¹ Pingali, K.²

12
- 70649092154
- Rodinia: A benchmark suite for heterogeneous computing
- Oct
- S. Che, M. Boyer, J. Meng, D. Tarjan, J. W. Sheaffer, S-H. Lee, and K. Skadron. Rodinia: A benchmark suite for heterogeneous computing. In IISWC, Oct 2009.
- (2009) IISWC
- Che, S.¹ Boyer, M.² Meng, J.³ Tarjan, D.⁴ Sheaffer, J.W.⁵ Lee, S.-H.⁶ Skadron, K.⁷

13
- 78751505898
- A characterization of the rodinia benchmark suite with comparison to contemporary cmp workloads
- Dec
- S. Che, J. W. Sheaffer, M. Boyer, L. G. Szafaryn, L. Wang, and K. Skadron. A characterization of the Rodinia benchmark suite with comparison to contemporary CMP workloads. In IISWC, Dec 2010.
- (2010) IISWC
- Che, S.¹ Sheaffer, J.W.² Boyer, M.³ Szafaryn, L.G.⁴ Wang, L.⁵ Skadron, K.⁶

14
- 0004116989
- McGraw-Hill, 2nd edition
- T. H. Cormen, C. Stein, R. L. Rivest, and C. E. Leiserson. Introduction to Algorithms. McGraw-Hill, 2nd edition, 2001.
- (2001) Introduction to Algorithms
- Cormen, T.H.¹ Stein, C.² Rivest, R.L.³ Leiserson, C.E.⁴

15
- 77954719557
- The scalable heterogeneous computing (shoc) benchmark suite
- Mar
- A. Danalis, G. Marin, C. McCurdy, J. S. Meredith, P. C. Roth, K. Spafford, V. Tipparaju, and J. S. Vetter. The scalable HeterOgeneous computing (SHOC) benchmark suite. In GPGPU, Mar 2010.
- (2010) GPGPU
- Danalis, A.¹ Marin, G.² McCurdy, C.³ Meredith, J.S.⁴ Roth, P.C.⁵ Spafford, K.⁶ Tipparaju, V.⁷ Vetter, J.S.⁸

16
- 84948749348
- Workload design: Selecting representative program-input pairs
- Sept
- L. Eeckhout, H. Vandierendonck, and K. D. Bosschere. Workload design: Selecting representative program-input pairs. In PACT, Sept 2002.
- (2002) PACT
- Eeckhout, L.¹ Vandierendonck, H.² Bosschere, K.D.³

17
- 78751477137
- Exploring gpgpu workloads: Characterization methodology, analysis and microarchitecture evaluation implication
- Dec
- N. Goswami, R. Shankar, M. Joshi, and Tao Li. Exploring gpgpu workloads: Characterization methodology, analysis and microarchitecture evaluation implication. In IISWC, Dec 2010.
- (2010) IISWC
- Goswami, N.¹ Shankar, R.² Joshi, M.³ Li, T.⁴

18
- 84893608095
- NVIDIA CUDA Programming Guide. Web resource
- NVIDIA CUDA Programming Guide. Web resource. http://developer. nvidia.com/object/gpucomputing.html.

19
- 60649099910
- Accelerating large graph algorithms on the gpu using cuda
- Dec
- P. Harish and P. Narayanan. Accelerating large graph algorithms on the GPU using CUDA. In HiPC, Dec 2007.
- (2007) HiPC
- Harish, P.¹ Narayanan, P.²

20
- 34548329985
- Microarchitecture-independent workload characterization
- K. Hoste and L. Eeckhout. Microarchitecture-independent workload characterization. IEEE Micro, 27(3):63-72, 2007.
- (2007) IEEE Micro , vol.27 , Issue.3 , pp. 63-72
- Hoste, K.¹ Eeckhout, L.²

21
- 84893609423
- J. Cohen and P. Castonguay. Efficient graph matching and coloring on the gpu. http://developer.download.nvidia.com/GTC/PDF/GTC2012/ PresentationPDF/ S0332-GTC2012-Graph-Coloring-GPU.pdf.
- Efficient Graph Matching and Coloring on the Gpu
- Cohen, J.¹ Castonguay, P.²

22
- 33646486530
- Measuring benchmark similarity using inherent program characteristics
- A. Joshi, A. Phansalkar, L. Eeckhout, and L. K. John. Measuring benchmark similarity using inherent program characteristics. IEEE Trans. Comp, 55(6):769-782, 2006.
- (2006) IEEE Trans. Comp , vol.55 , Issue.6 , pp. 769-782
- Joshi, A.¹ Phansalkar, A.² Eeckhout, L.³ John, L.K.⁴

23
- 79952042632
- Connected component labeling on a 2d grid using cuda
- O. Kalentev, A. Rai, S. Kemnitz, and R. Schneider. Connected component labeling on a 2D grid using CUDA. J. Parallel and Dist. Comp., 71(4):615-620, 2011.
- (2011) J. Parallel and Dist. Comp , vol.71 , Issue.4 , pp. 615-620
- Kalentev, O.¹ Rai, A.² Kemnitz, S.³ Schneider, R.⁴

24
- 70649104826
- A characterization and analysis of ptx kernels
- Oct
- A. Kerr, G. Diamos, and S. Yalamanchili. A characterization and analysis of PTX kernels. In IISWC, Oct 2009.
- (2009) IISWC
- Kerr, A.¹ Diamos, G.² Yalamanchili, S.³

25
- 85042632297
- Graphchi: Large-scale graph computation on just a pc
- Oct
- A. Kyrola, G. Blelloch, and C. Guestrin. Graphchi: Large-scale graph computation on just a pc. In OSDI, Oct 2012.
- (2012) OSDI
- Kyrola, A.¹ Blelloch, G.² Guestrin, C.³

26
- 80052875653
- Graphlab: A new parallel framework for machine learning
- July
- Y. Low, J. Gonzalez, A. Kyrola, D. Bickson, C. Guestrin, and J. M. Hellerstein. Graphlab: A new parallel framework for machine learning. In UAI, July 2010.
- (2010) UAI
- Low, Y.¹ Gonzalez, J.² Kyrola, A.³ Bickson, D.⁴ Guestrin, C.⁵ Hellerstein, J.M.⁶

27
- 0021946709
- A simple parallel algorithm for the maximal independent set problem
- May
- M. Luby. A simple parallel algorithm for the maximal independent set problem. In STOC, May 1985.
- (1985) STOC
- Luby, M.¹

28
- 77954723629
- Pregel: A system for large-scale graph processing
- June
- G. Malewicz, M. H. Austern, A. J.C Bik, J. C. Dehnert, I. Horn, N. Leiser, and G. Czajkowski. Pregel: a system for large-scale graph processing. In SIGMOD, June 2010.
- (2010) SIGMOD
- Malewicz, G.¹ Austern, M.H.² Bik, A.J.C.³ Dehnert, J.C.⁴ Horn, I.⁵ Leiser, N.⁶ Czajkowski, G.⁷

29
- 84893592799
- Matrix Market Format. Web resource
- Matrix Market Format. Web resouce. http://math.nist.gov/MatrixMarket/ formats.html.

30
- 84866862121
- Robust simd: Dynamically adapted simd width and multi-threading depth
- May
- J. Meng, J. W. Sheaffer, and K. Skadron. Robust SIMD: Dynamically adapted simd width and multi-threading depth. In IPDPS, May 2012.
- (2012) IPDPS
- Meng, J.¹ Sheaffer, J.W.² Skadron, K.³

31
- 77954976292
- Dynamic warp subdivision for integrated branch and memory divergence tolerance
- June
- J. Meng, D. Tarjan, and K. Skadron. Dynamic warp subdivision for integrated branch and memory divergence tolerance. In ISCA, June 2010
- (2010) ISCA
- Meng, J.¹ Tarjan, D.² Skadron, K.³

32
- 84858391043
- Scalable gpu graph traversal
- Feb
- D. G. Merrill, M. Garland, and A. S. Grimshaw. Scalable GPU graph traversal. In PPoPP, Feb 2012.
- (2012) PPoPP
- Merrill, D.G.¹ Garland, M.² Grimshaw, A.S.³

33
- 84893573741
- METIS File Format. Web resource
- METIS File Format. Web resource. http://people.sc.fsu.edu/~jburkardt/ data/metis-graph/metis-graph.html.

34
- 84893573144
- GTGraph: A Suite of Synthetic Random Graph Generators. Web resource
- GTGraph: A Suite of Synthetic Random Graph Generators. Web resource. http://www.cse.psu.edu/~madduri/software/GTgraph/index. html.

35
- 84879777611
- A study on connected components labeling algorithms using gpus
- Aug
- V. M. A. Oliveira and R. A. Lotufo. A study on connected components labeling algorithms using GPUs. In SIBGRAFI, Aug 2010.
- (2010) SIBGRAFI
- Oliveira, V.M.A.¹ Lotufo, R.A.²

36
- 0003780986
- Technical Report SIDL-WP-1999- 01204, Stanford Univerisity
- L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical Report SIDL-WP-1999- 01204, Stanford Univerisity, 1999
- (1999) The Pagerank Citation Ranking: Bringing Order to the Web
- Page, L.¹ Brin, S.² Motwani, R.³ Winograd, T.⁴

37
- 35348913704
- Analysis of redundancy and application balance in the spec cpu2006 benchmark suite
- June
- A. Phansalkar, A. Joshi, and L. K. John. Analysis of redundancy and application balance in the SPEC CPU2006 benchmark suite. In 1SCA, June 2007.
- (2007) 1SCA
- Phansalkar, A.¹ Joshi, A.² John, L.K.³

38
- 84893527877
- AFDS 2012 Phil Rogers Keynote: The programmer's guide to a universe of possibility. Web resource
- AFDS 2012 Phil Rogers Keynote: The programmer's guide to a universe of possibility. Web resource. http://hsafoundation.com/ publications/.

39
- 78651284120
- Scan primitives for gpu computing
- Aug
- S. Sengupta, M. Harris, Y. Zhang, and J. D. Owens. Scan primitives for GPU computing. In GH, Aug 2007
- (2007) GH
- Sengupta, S.¹ Harris, M.² Zhang, Y.³ Owens, J.D.⁴

40
- 79955809263
- Fast network centrality analysis using gpus
- Z. Shi and B. Zhang. Fast network centrality analysis using gpus. BMC Bioinformatics, 12(140), 2011
- (2011) BMC Bioinformatics , vol.12 , Issue.140
- Shi, Z.¹ Zhang, B.²

41
- 84893557304
- Parboil Benchmark suite. Web resource
- Parboil Benchmark suite. Web resource. http://impact.crhc.illinois.edu/ parboil.php.

42
- 84893579755
- The University of Florida Sparse Matrix Collection. Web resource
- The University of Florida Sparse Matrix Collection. Web resource. http: IIwww.cise.u8.edu/research/sparse/matrices/.

43
- 70450194802
- Fast minimum spanning tree for large graphs on the gpu
- Jul
- V. Vineet, P. Harish, S. Patidar, and P. J. Narayanan. Fast minimum spanning tree for large graphs on the GPU. In HPG, Jul 2009.
- (2009) HPG
- Vineet, V.¹ Harish, P.² Patidar, S.³ Narayanan, P.J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.