SCOPUS 정보 검색 플랫폼

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP

Volumn , Issue , 2012, Pages 107-116

A GPU implementation of inclusion-based points-to analysis

(3) Méndez Lojo, Mario a Burtscher, Martin b Pingali, Keshav a,c

a UNIVERSITY OF TEXAS AT AUSTIN (United States)

b Texas State University (United States)

c University of Texas at Austin (United States)

Author keywords

CUDA; GPU; Graph algorithms; Inclusion based points to analysis; Irregular programs

Indexed keywords

AS GRAPH; AVERAGE SPEED; BREADTH-FIRST SEARCH; CPU CORES; CUDA; DENSE ARRAYS; GPU; GPU IMPLEMENTATION; GRAPH ALGORITHMS; GRAPH ANALYSIS; GRAPHICS PROCESSING UNITS; MULTI CORE; PARALLEL IMPLEMENTATIONS; PARALLEL PROGRAM; POINTER-BASED DATA STRUCTURES; POINTS-TO ANALYSIS; UNDERLYING GRAPHS;

COMPUTER PROGRAMMING LANGUAGES; PARALLEL ARCHITECTURES; PARALLEL PROGRAMMING; PROGRAM PROCESSORS;

ALGORITHMS;

EID: 84858374841 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2145816.2145831 Document Type: Conference Paper

Times cited : (42)

References (36)

1
- 78449295608
- NVIDIA's Next Generation CUDA Compute Architecture: Fermi. http://www.nvidia.com/content/PDF/fermi-white-papers/NVIDIA-Fermi-Compute- Architecture-Whitepaper.pdf, 2010.
- (2010) NVIDIA's Next Generation CUDA Compute Architecture: Fermi

2
- 84858404631
- CUDA C Programming Guide 4.0. NVIDIA, 2011.
- (2011) CUDA C Programming Guide 4.0. NVIDIA

3
- 0004273497
- PhD thesis, DIKU, University of Copenhagen, May, (DIKU report 94/19
- L. O. Andersen. Program Analysis and Specialization for the C Programming Language. PhD thesis, DIKU, University of Copenhagen, May 1994. (DIKU report 94/19).
- (1994) Program Analysis and Specialization for the C Programming Language
- Andersen, L.O.¹

4
- 34547399946
- Designing multithreaded algorithms for Breadth-First Search and si-connectivity on the Cray MTA-2
- DOI 10.1109/ICPP.2006.34, 1690657, ICPP 2006: Proceedings of the 2006 International Conference on Parallel Processing
- David A. Bader and Kamesh Madduri. Designing multithreaded algorithms for breadth-first search and st-connectivity on the cray mta- 2. In Proceedings of the 2006 International Conference on Parallel Processing, ICPP'06, pages 523-530, Washington, DC, USA, 2006. IEEE Computer Society. (Pubitemid 47159081)
- (2006) Proceedings of the International Conference on Parallel Processing , pp. 523-530
- Bader, D.A.¹ Madduri, K.²

5
- 80053287330
- Computing strongly connected components in parallel on CUDA
- IEEE Computer Society
- J. Barnat, P. Bauch, L. Brim, and M. Češka. Computing Strongly Connected Components in Parallel on CUDA. In Proceedings of the 25th IEEE International Parallel & Distributed Processing Symposium (IPDPS'11), pages 541-552. IEEE Computer Society, 2011.
- (2011) Proceedings of the 25th IEEE International Parallel & Distributed Processing Symposium (IPDPS'11) , pp. 541-552
- Barnat, J.¹ Bauch, P.² Brim, L.³ Češka, M.⁴

6
- 0038716510
- Laurie hendren, and navindra umanee. points-to analysis using BDDs
- New York, NY, USA, ACM
- Marc Berndl, Ondrej Lhot́ak, Feng Qian, Laurie Hendren, and Navindra Umanee. Points-to analysis using BDDs. In Proc. Conf. on Programming Language Design and Implementation (PLDI), pages 103- 114, New York, NY, USA, 2003. ACM.
- (2003) Proc. Conf. on Programming Language Design and Implementation (PLDI) , pp. 103-114
- Berndl, M.¹ Lhot́ak, O.² Qian, F.³

7
- 33646563056
- editors. Springer-Verlag
- Ulrik Brandes and Thomas Erlebach, editors. Network Analysis: Methodological Foundations. Springer-Verlag, 2005.
- (2005) Network Analysis: Methodological Foundations
- Brandes, U.¹ Erlebach, T.²

8
- 0022769976
- Graph-based algorithms for boolean function manipulation
- Randal E. Bryant. Graph-based algorithms for boolean function manipulation. IEEE Transactions on Computers, 35:677-691, 1986.
- (1986) IEEE Transactions on Computers , vol.35 , pp. 677-691
- Randal, E.B.¹

9
- 84858427151
- An efficient CUDA implementation of the tree-based barnes hut n-body algorithm
- Morgan Kaufmann
- Martin Burtscher and Keshav Pingali. An efficient CUDA implementation of the tree-based barnes hut n-body algorithm. In GPU Computing Gems Emerald Edition, pages 75-92. Morgan Kaufmann, 2011.
- (2011) GPU Computing Gems Emerald Edition , pp. 75-92
- Burtscher, M.¹ Pingali, K.²

10
- 51449118065
- Sheaffer, and Kevin Skadron. A performance study of general-purpose applications on graphics processors using cuda
- October
- Shuai Che, Michael Boyer, Jiayuan Meng, David Tarjan, Jeremy W. Sheaffer, and Kevin Skadron. A performance study of general-purpose applications on graphics processors using cuda. J. Parallel Distrib. Comput., 68:1370-1380, October 2008.
- (2008) J. Parallel Distrib. Comput. , vol.68 , pp. 1370-1380
- Che, S.¹ Boyer, M.² Meng, J.³ Tarjan, D.⁴ Jeremy, W.⁵

11
- 0027803996
- Guaranteed-quality mesh generation for curved surfaces
- L. Paul Chew. Guaranteed-quality mesh generation for curved surfaces. In Proc. Symp. on Computational Geometry (SCG), 1993.
- (1993) Proc. Symp. on Computational Geometry (SCG)
- Chew, L.P.¹

12
- 0031630370
- Partial online cycle elimination in inclusion constraint graphs
- Manuel Fähndrich, Jeffrey S. Foster, Zhendong Su, and Alexander Aiken. Partial online cycle elimination in inclusion constraint graphs. In Proc. Conf. on Programming Language Design and Implementation (PLDI), pages 85-96, New York, NY, USA, 1998. ACM. (Pubitemid 128454787)
- (1998) SIGPLAN Notices (ACM Special Interest Group on Programming Languages) , vol.33 , Issue.5 , pp. 85-96
- Fahndrich, M.¹ Foster, J.S.² Su, Z.³ Aiken, A.⁴

13
- 35448946037
- The ant and the grasshopper: Fast and accurate pointer analysis for millions of lines of code
- Ben Hardekopf and Calvin Lin. The ant and the grasshopper: fast and accurate pointer analysis for millions of lines of code. In Proc. Conf. on Programming Language Design and Implementation (PLDI), 2007.
- (2007) Proc. Conf. on Programming Language Design and Implementation (PLDI)
- Hardekopf, B.¹ Lin, C.²

14
- 38349041620
- Accelerating large graph algorithms on the gpu using cuda
- Berlin, Heidelberg, Springer-Verlag
- Pawan Harish and P. J. Narayanan. Accelerating large graph algorithms on the gpu using cuda. In HiPC'07: Proceedings of the 14th international conference on High performance computing, pages 197- 208, Berlin, Heidelberg, 2007. Springer-Verlag.
- (2007) HiPC'07: Proceedings of the 14th International Conference on High Performance Computing , pp. 197-208
- Harish, P.¹ Narayanan, P.J.²

15
- 18844428084
- Ultra-fast aliasing analysis using cla: A million lines of c code in a second
- Nevin Heintze and Olivier Tardieu. Ultra-fast aliasing analysis using cla: a million lines of c code in a second. SIGPLAN Not., 36(5):254- 263, 2001.
- (2001) SIGPLAN Not. , vol.36 , Issue.5 , pp. 254-263
- Heintze, N.¹ Tardieu, O.²

16
- 0008525753
- Type inference and semi-unification
- New York, NY, USA, ACM
- Fritz Henglein. Type inference and semi-unification. In Proceedings of the 1988 ACM conference on LISP and functional programming, LFP'88, pages 184-197, New York, NY, USA, 1988. ACM.
- (1988) Proceedings of the 1988 ACM Conference on LISP and Functional Programming, LFP'88 , pp. 184-197
- Henglein, F.¹

17
- 0034825842
- Pointer analysis: Haven't we solved this problem yet?
- Michael Hind. Pointer analysis: haven't we solved this problem yet? In PASTE'01: Proceedings of the 2001 ACM SIGPLAN-SIGSOFT workshop on Program analysis for software tools and engineering, pages 54-61, New York, NY, USA, 2001. ACM. (Pubitemid 32861392)
- (2001) ACM SIGPLAN/SIGSOFT Workshop on Program Analysis for Software Tools and Engineering , pp. 54-61
- Hind, M.¹

18
- 79952811127
- Accelerating cuda graph algorithms at maximum warp
- New York, NY, USA, ACM
- Sungpack Hong, Sang Kyun Kim, Tayo Oguntebi, and Kunle Olukotun. Accelerating cuda graph algorithms at maximum warp. In Proceedings of the 16th ACM symposium on Principles and practice of parallel programming, PPoPP'11, pages 267-276, New York, NY, USA, 2011. ACM.
- (2011) Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming, PPoPP'11 , pp. 267-276
- Hong, S.¹ Kim, S.K.² Oguntebi, T.³ Olukotun, K.⁴

19
- 84856541553
- Efficient parallel graph exploration on multi-core cpu and gpu
- Sungpack Hong, Tayo Oguntebi, and Kunle Olukotun. Efficient parallel graph exploration on multi-core cpu and gpu. In 20th International Conference on Parallel Architectures and Compilation Techniques, PACT'11, 2011.
- (2011) 20th International Conference on Parallel Architectures and Compilation Techniques, PACT'11
- Hong, S.¹ Oguntebi, T.² Olukotun, K.³

20
- 70449914192
- On the energy efficiency of graphics processing units for scientific computing
- Song Huang, Shucai Xiao, and Wu chun Feng. On the energy efficiency of graphics processing units for scientific computing. In IPDPS, pages 1-8, 2009.
- (2009) IPDPS , pp. 1-8
- Huang, S.¹ Xiao, S.² Feng, W.C.³

21
- 35448941890
- Optimistic parallelism requires abstractions
- DOI 10.1145/1250734.1250759, PLDI'07: Proceedings of the 2007 ACM SIGPLAN Conference on Programming Language Design and Implementation
- Milind Kulkarni, Keshav Pingali, Bruce Walter, Ganesh Ramanarayanan, Kavita Bala, and L. Paul Chew. Optimistic parallelism requires abstractions. SIGPLAN Not. (Proceedings of PLDI), 42(6):211- 222, 2007. (Pubitemid 47630689)
- (2007) Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI) , pp. 211-222
- Kulkarni, M.¹ Pingali, K.² Walter, B.³ Ramanarayanan, G.⁴ Bala, K.⁵ Chew, L.P.⁶

22
- 77954995885
- Debunking the 100x gpu vs. Cpu myth: An evaluation of throughput computing on cpu and gpu
- New York, NY, USA, ACM
- VictorW. Lee, Changkyu Kim, Jatin Chhugani, Michael Deisher, Daehyun Kim, Anthony D. Nguyen, Nadathur Satish, Mikhail Smelyanskiy, Srinivas Chennupaty, Per Hammarlund, Ronak Singhal, and Pradeep Dubey. Debunking the 100x gpu vs. cpu myth: an evaluation of throughput computing on cpu and gpu. In Proceedings of the 37th annual international symposium on Computer architecture, ISCA'10, pages 451-460, New York, NY, USA, 2010. ACM.
- (2010) Proceedings of the 37th Annual International Symposium on Computer Architecture, ISCA'10 , pp. 451-460
- Lee, Victorw.¹ Kim, C.² Chhugani, J.³ Deisher, M.⁴ Kim, D.⁵ Nguyen, A.D.⁶ Satish, N.⁷ Smelyanskiy, M.⁸ Chennupaty, S.⁹ Hammarlund, P.¹⁰ Singhal, R.¹¹ Dubey, P.¹²

23
- 35248842644
- Scaling Java points-to analysis using Spark
- volume 2622 of LNCS, Warsaw, Poland, April, Springer
- Ondřej Lhoták and Laurie Hendren. Scaling Java points-to analysis using Spark. In G. Hedin, editor, Compiler Construction, 12th International Conference, volume 2622 of LNCS, pages 153-169, Warsaw, Poland, April 2003. Springer.
- (2003) G. Hedin, Editor, Compiler Construction, 12th International Conference , pp. 153-169
- Lhoták, O.¹ Hendren, L.²

24
- 77956200064
- An effective gpu implementation of breadth-first search
- New York, NY, USA, ACM
- Lijuan Luo, Martin Wong, and Wen-mei Hwu. An effective gpu implementation of breadth-first search. In Proceedings of the 47th Design Automation Conference, DAC'10, pages 52-55, New York, NY, USA, 2010. ACM.
- (2010) Proceedings of the 47th Design Automation Conference, DAC'10 , pp. 52-55
- Luo, L.¹ Wong, M.² Hwu, W.-M.³

25
- 79551677007
- Parallel inclusion-based points-to analysis
- October
- Mario Méndez-Lojo, Augustine Mathew, and Keshav Pingali. Parallel inclusion-based points-to analysis. In Proceedings of the 24th Annual ACM SIGPLAN Conference on Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA'10), October 2010.
- (2010) Proceedings of the 24th Annual ACM SIGPLAN Conference on Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA'10)
- Mario, M.-L.¹ Mathew, A.² Pingali, K.³

26
- 84858391043
- Scalable gpu graph traversal
- Duane G. Merrill, Michael Garland, and Andrew S. Grimshaw. Scalable gpu graph traversal. In 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP'12, 2012.
- (2012) 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP'12
- Merrill, D.G.¹ Garland, M.² Grimshaw, A.S.³

27
- 79953089159
- Synthesizing concurrent schedulers for irregular algorithms
- Donald Nguyen and Keshav Pingali. Synthesizing concurrent schedulers for irregular algorithms. In ASPLOS'11: Proceedings of International Conference on Architectural Support for Programming Languages and Operating Systems, 2011.
- (2011) ASPLOS'11: Proceedings of International Conference on Architectural Support for Programming Languages and Operating Systems
- Nguyen, D.¹ Pingali, K.²

28
- 84878562399
- NVIDIA
- NVIDIA. Thrust library version 1.4.0. http://code.google.com/p/thrust/.
- Thrust Library Version 1.4.0.

29
- 79959878035
- The tao of parallelism in algorithms
- New York, NY, USA, ACM
- Keshav Pingali, Donald Nguyen, Milind Kulkarni, Martin Burtscher, M. Amber Hassaan, Rashid Kaleem, Tsung-Hsien Lee, Andrew Lenharth, Roman Manevich, Mario Méndez-Lojo, Dimitrios Prountzos, and Xin Sui. The tao of parallelism in algorithms. In Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation, PLDI'11, pages 12-25, New York, NY, USA, 2011. ACM.
- (2011) Proceedings of the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI'11 , pp. 12-25
- Pingali, K.¹ Nguyen, D.² Kulkarni, M.³ Burtscher, M.⁴ Hassaan, M.A.⁵ Kaleem, R.⁶ Lee, T.-H.⁷ Lenharth, A.⁸ Manevich, R.⁹ Méndez-Lojo, M.¹⁰ Prountzos, D.¹¹ Sui, X.¹²

30
- 79251566519
- Eigencfa: Accelerating flow analysis with gpus
- New York, NY, USA, ACM
- Tarun Prabhu, Shreyas Ramalingam, Matthew Might, and Mary Hall. Eigencfa: accelerating flow analysis with gpus. In Proceedings of the 38th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages, POPL'11, pages 511-522, New York, NY, USA, 2011. ACM.
- (2011) Proceedings of the 38th Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, POPL'11 , pp. 511-522
- Prabhu, T.¹ Ramalingam, S.² Might, M.³ Hall, M.⁴

31
- 30544444823
- Program analysis via graph reachability
- Thomas W. Reps. Program analysis via graph reachability. Technical Report Technical Report Number 1386, University of Wisconsin, 1998.
- (1998) Technical Report Technical Report Number 1386, University of Wisconsin
- Reps, T.W.¹

32
- 17144372619
- Off-line variable substitution for scaling points-to analysis
- Atanas Rountev and Satish Chandra. Off-line variable substitution for scaling points-to analysis. In Proc. Conf. on Programming Language Design and Implementation (PLDI), pages 47-56, New York, NY, USA, 2000. ACM. (Pubitemid 32394082)
- (2000) Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI) , pp. 47-56
- Rountev, A.¹ Chandra, S.²

33
- 0029717388
- Points-to analysis in almost linear time
- New York, NY, USA, ACM
- Bjarne Steensgaard. Points-to analysis in almost linear time. In POPL'96: Proceedings of the 23rd ACM SIGPLAN-SIGACT symposium on Principles of programming languages, pages 32-41, New York, NY, USA, 1996. ACM.
- (1996) POPL'96: Proceedings of the 23rd ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages , pp. 32-41
- Steensgaard, B.¹

34
- 85092761228
- On the limits of gpu acceleration
- Berkeley, CA, USA, USENIX Association
- Richard Vuduc, Aparna Chandramowlishwaran, Jee Choi, Murat Guney, and Aashay Shringarpure. On the limits of gpu acceleration. In Proceedings of the 2nd USENIX conference on Hot topics in parallelism, HotPar'10, pages 13-13, Berkeley, CA, USA, 2010. USENIX Association.
- (2010) Proceedings of the 2nd USENIX Conference on Hot Topics in Parallelism, HotPar'10 , pp. 13-13
- Vuduc, R.¹ Chandramowlishwaran, A.² Choi, J.³ Guney, M.⁴ Shringarpure, A.⁵

35
- 8344251741
- Cloning-based context-sensitive pointer alias analysis using binary decision diagrams
- New York, NY, USA, ACM
- John Whaley and Monica S. Lam. Cloning-based context-sensitive pointer alias analysis using binary decision diagrams. In Proc. Conf. on Programming Language Design and Implementation (PLDI), pages 131-144, New York, NY, USA, 2004. ACM.
- (2004) Proc. Conf. on Programming Language Design and Implementation (PLDI) , pp. 131-144
- Whaley, J.¹ Lam, M.S.²

36
- 33845388971
- A scalable distributed parallel breadth-first search algorithm on bluegene/l
- Washington, DC, USA, IEEE Computer Society
- Andy Yoo, Edmond Chow, Keith Henderson, William McLendon, Bruce Hendrickson, and Umit Catalyurek. A scalable distributed parallel breadth-first search algorithm on bluegene/l. In Proceedings of the 2005 ACM/IEEE conference on Supercomputing, SC'05, pages 25-, Washington, DC, USA, 2005. IEEE Computer Society.
- (2005) Proceedings of the 2005 ACM/IEEE Conference on Supercomputing, SC'05 , pp. 25
- Yoo, A.¹ Chow, E.² Henderson, K.³ Mclendon, W.⁴ Hendrickson, B.⁵ Catalyurek, U.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.