SCOPUS 정보 검색 플랫폼

Proceedings - IEEE 27th International Parallel and Distributed Processing Symposium, IPDPS 2013

Volumn , Issue , 2013, Pages 463-474

Data-driven versus topology-driven irregular computations on GPUs

(3) Nasre, Rupesh a Burtscher, Martin b Pingali, Keshav a

a UNIVERSITY OF TEXAS AT AUSTIN (United States)

b TEXAS STATE UNIVERSITY (United States)

Author keywords

algorithmic properties; data driven; GPGPU; irregular algorithms; topology driven

Indexed keywords

ALGORITHMIC PROPERTIES; DATA-DRIVEN; GPGPU; GPU IMPLEMENTATION; GRAPH ALGORITHMS; IRREGULAR COMPUTATIONS; TOPOLOGY-DRIVEN; UNDIRECTED GRAPH;

ALGORITHMS; DIRECTED GRAPHS; DISTRIBUTED PARAMETER NETWORKS; OPTIMIZATION; PROGRAM PROCESSORS; TREES (MATHEMATICS);

TOPOLOGY;

EID: 84875980776 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IPDPS.2013.28 Document Type: Conference Paper

Times cited : (107)

References (27)

1
- 0004072686
- Addison Wesley
- A. Aho, R. Sethi, and J. Ullman. Compilers: principles, techniques, and tools. Addison Wesley, 1986.
- (1986) Compilers: Principles, Techniques, and Tools
- Aho, A.¹ Sethi, R.² Ullman, J.³

2
- 33846349887
- A hierarchical O(N log N) force-calculation algorithm
- December
- J. Barnes and P. Hut. A hierarchical O(N log N) force-calculation algorithm. Nature, 324(4), December 1986.
- (1986) Nature , vol.324 , Issue.4
- Barnes, J.¹ Hut, P.²

3
- 0000269759
- Scheduling multithreaded computations by work stealing
- Robert D. Blumofe and Charles E. Leiserson. Scheduling multithreaded computations by work stealing. J. ACM, 46(5):720-748, 1999.
- (1999) J. ACM , vol.46 , Issue.5 , pp. 720-748
- Blumofe, R.D.¹ Leiserson, C.E.²

4
- 26944443478
- Survey propagation: An algorithm for satisfiability
- DOI 10.1002/rsa.20057
- A. Braunstein, M. Mèzard, and R. Zecchina. Survey propagation: An algorithm for satisfiability. Random Structures and Algorithms, 27(2):201-226, 2005. (Pubitemid 41482546)
- (2005) Random Structures and Algorithms , vol.27 , Issue.2 , pp. 201-226
- Braunstein, A.¹ Mezard, M.² Zecchina, R.³

5
- 84884834081
- CMU 15-418 Spring Final Project Report
- Bruant, Hugues. Parallel simulation of cellular automata, CMU 15-418 (Spring 2012) Final Project Report. http://www.andrew.cmu.edu/user/hbruant/ 15418/finalreport.html.
- (2012) Parallel Simulation of Cellular Automata
- Bruant, H.¹

6
- 84858427151
- An efficient CUDA implementation of the tree-based barnes hut n-body algorithm
- Morgan Kaufmann
- Martin Burtscher and Keshav Pingali. An efficient CUDA implementation of the tree-based barnes hut n-body algorithm. In GPU Computing Gems Emerald Edition, pages 75-92. Morgan Kaufmann, 2011.
- (2011) GPU Computing Gems Emerald Edition , pp. 75-92
- Burtscher, M.¹ Pingali, K.²

7
- 51449118065
- A performance study of general-purpose applications on graphics processors using CUDA
- October
- Shuai Che, Michael Boyer, Jiayuan Meng, David Tarjan, Jeremy W. Sheaffer, and Kevin Skadron. A performance study of general-purpose applications on graphics processors using CUDA. Journal of Parallel and Distributing Computing, 68:1370-1380, October 2008.
- (2008) Journal of Parallel and Distributing Computing , vol.68 , pp. 1370-1380
- Che, S.¹ Boyer, M.² Meng, J.³ Tarjan, D.⁴ Sheaffer, J.W.⁵ Skadron, K.⁶

8
- 0027803996
- Guaranteed-quality mesh generation for curved surfaces
- L. Paul Chew. Guaranteed-quality mesh generation for curved surfaces. In Proceedings of the Symposium on Computational Geometry (SCG), 1993.
- Proceedings of the Symposium on Computational Geometry (SCG), 1993
- Chew, L.P.¹

9
- 0004116989
- McGraw Hill
- Thomas H. Cormen, Charles E. Leiserson, and Ronald L. Rivest. Introduction to Algorithms, McGraw Hill, 2001.
- (2001) Introduction to Algorithms
- Cormen, T.H.¹ Leiserson, C.E.² Rivest, R.L.³

10
- 84870690379
- A Study of Persistent Threads Style GPU Programming for GPGPU Workloads
- may
- Kshitij Gupta, Jeff A. Stuart, and John D. Owens. A Study of Persistent Threads Style GPU Programming for GPGPU Workloads. In Innovative Parallel Computing, page 14, may 2012.
- (2012) Innovative Parallel Computing , pp. 14
- Gupta, K.¹ Stuart, J.A.² Owens, J.D.³

11
- 38349041620
- Accelerating large graph algorithms on the GPU using CUDA
- Berlin, Heidelberg, Springer-Verlag
- Pawan Harish and P. J. Narayanan. Accelerating large graph algorithms on the GPU using CUDA. In HiPC'07: Proceedings of the 14th international conference on High performance computing, pages 197-208, Berlin, Heidelberg, 2007. Springer-Verlag.
- (2007) HiPC'07: Proceedings of the 14th International Conference on High Performance Computing , pp. 197-208
- Harish, P.¹ Narayanan, P.J.²

12
- 70450200605
- Technical Report Technical Report Number IIIT/TR/2009/74, International Institute of Information Technology Hyderabad
- Pawan Harish, Vibhav Vineet, and P. J. Narayanan. Large Graph Algorithms for Massively Multithreaded Architectures. Technical Report Technical Report Number IIIT/TR/2009/74, International Institute of Information Technology Hyderabad, 2009.
- (2009) Large Graph Algorithms for Massively Multithreaded Architectures
- Harish, P.¹ Vineet, V.² Narayanan, P.J.³

13
- 58549112478
- Transactional boosting: A methodology for highly-concurrent transactional objects
- New York, NY, USA, ACM
- Maurice Herlihy and Eric Koskinen. Transactional boosting: a methodology for highly-concurrent transactional objects. In PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, pages 207-216, New York, NY, USA, 2008. ACM.
- (2008) PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming , pp. 207-216
- Herlihy, M.¹ Koskinen, E.²

14
- 34548587507
- Focused Community Discovery
- Kirsten Hildrum and Philip S. Yu. Focused Community Discovery. In International Conference on Data Mining, 2005.
- International Conference on Data Mining, 2005
- Hildrum, K.¹ Yu, P.S.²

15
- 79952811127
- Accelerating CUDA graph algorithms at maximum warp
- New York, NY, USA, ACM
- Sungpack Hong, Sang Kyun Kim, Tayo Oguntebi, and Kunle Olukotun. Accelerating CUDA graph algorithms at maximum warp. In Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming, pages 267-276, New York, NY, USA, 2011. ACM.
- (2011) Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming , pp. 267-276
- Hong, S.¹ Kim, S.K.² Oguntebi, T.³ Olukotun, K.⁴

16
- 84863423999
- Dynamically managed data for CPU-GPU architectures
- New York, NY, USA, ACM
- Thomas B. Jablin, James A. Jablin, Prakash Prabhu, Feng Liu, and David I. August. Dynamically managed data for CPU-GPU architectures. In Proceedings of the Tenth International Symposium on Code Generation and Optimization, pages 165-174, New York, NY, USA, 2012. ACM.
- (2012) Proceedings of the Tenth International Symposium on Code Generation and Optimization , pp. 165-174
- Jablin, T.B.¹ Jablin, J.A.² Prabhu, P.³ Liu, F.⁴ August, D.I.⁵

17
- 35448941890
- Optimistic parallelism requires abstractions
- Milind Kulkarni, Keshav Pingali, Bruce Walter, Ganesh Ramanarayanan, Kavita Bala, and L. Paul Chew. Optimistic parallelism requires abstractions. SIGPLAN Notices (Proceedings of PLDI), 42(6):211-222, 2007.
- (2007) SIGPLAN Notices (Proceedings of PLDI) , vol.42 , Issue.6 , pp. 211-222
- Kulkarni, M.¹ Pingali, K.² Walter, B.³ Ramanarayanan, G.⁴ Bala, K.⁵ Chew, L.P.⁶

18
- 68849093624
- CUDA Solutions for the SSSP Problem
- Berlin, Heidelberg, Springer-Verlag
- Pedro J. Martín, Roberto Torres, and Antonio Gavilanes. CUDA Solutions for the SSSP Problem. In Proceedings of the 9th International Conference on Computational Science: Part I, pages 904-913, Berlin, Heidelberg, 2009. Springer-Verlag.
- (2009) Proceedings of the 9th International Conference on Computational Science: Part I , pp. 904-913
- Martín, P.J.¹ Torres, R.² Gavilanes, A.³

19
- 84878605997
- A GPU implementation of inclusion-based points-to analysis
- New York, NY, USA, ACM
- Mario Mendez-Lojo, Martin Burtscher, and Keshav Pingali. A GPU implementation of inclusion-based points-to analysis. In Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pages 107-116, New York, NY, USA, 2012. ACM.
- (2012) Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming , pp. 107-116
- Mendez-Lojo, M.¹ Burtscher, M.² Pingali, K.³

20
- 84858391043
- Scalable GPU Graph Traversal
- Duane G. Merrill, Michael Garland, and Andrew S. Grimshaw. Scalable GPU Graph Traversal. In 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012.
- 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012
- Merrill, D.G.¹ Garland, M.² Grimshaw, A.S.³

21
- 0022678067
- Distributed discrete-event simulation
- Jayadev Misra. Distributed discrete-event simulation. ACM Computing Surveys, 18(1):39-65, 1986.
- (1986) ACM Computing Surveys , vol.18 , Issue.1 , pp. 39-65
- Misra, J.¹

22
- 84875132103
- Morph Algorithms on GPUs
- Rupesh Nasre, Martin Burtscher, and Keshav Pingali. Morph Algorithms on GPUs. In Proceedings of the 18th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP '13, 2013.
- Proceedings of the 18th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP '13, 2013
- Nasre, R.¹ Burtscher, M.² Pingali, K.³

23
- 79953089159
- Synthesizing concurrent schedulers for irregular algorithms
- Donald Nguyen and Keshav Pingali. Synthesizing concurrent schedulers for irregular algorithms. In ASPLOS '11: Proceedings of International Conference on Architectural Support for Programming Languages and Operating Systems, 2011.
- ASPLOS '11: Proceedings of International Conference on Architectural Support for Programming Languages and Operating Systems, 2011
- Nguyen, D.¹ Pingali, K.²

24
- 79959878035
- The tao of parallelism in algorithms
- New York, NY, USA, ACM
- Keshav Pingali, Donald Nguyen, Milind Kulkarni, Martin Burtscher, M. Amber Hassaan, Rashid Kaleem, Tsung-Hsien Lee, Andrew Lenharth, Roman Manevich, Mario Méndez-Lojo, Dimitrios Prountzos, and Xin Sui. The tao of parallelism in algorithms. In Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation, pages 12-25, New York, NY, USA, 2011. ACM.
- (2011) Proceedings of the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 12-25
- Pingali, K.¹ Nguyen, D.² Kulkarni, M.³ Burtscher, M.⁴ Hassaan, M.A.⁵ Kaleem, R.⁶ Lee, T.-H.⁷ Lenharth, A.⁸ Manevich, R.⁹ Méndez-Lojo, M.¹⁰ Prountzos, D.¹¹ Sui, X.¹²

25
- 25144439604
- Pearson Addison Wesley
- Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, editors. Introduction to Data Mining. Pearson Addison Wesley, 2005.
- (2005) Introduction to Data Mining
- Tan, P.-N.¹ Steinbach, M.² Kumar, V.³

26
- 84934313374
- Task management for irregular-parallel workloads on the gpu
- Aire-la-Ville, Switzerland, Switzerland, Eurographics Association
- Stanley Tzeng, Anjul Patney, and John D. Owens. Task management for irregular-parallel workloads on the gpu. In Proceedings of the Conference on High Performance Graphics, HPG '10, pages 29-37, Aire-la-Ville, Switzerland, Switzerland, 2010. Eurographics Association.
- (2010) Proceedings of the Conference on High Performance Graphics, HPG '10 , pp. 29-37
- Tzeng, S.¹ Patney, A.² Owens, J.D.³

27
- 79953126288
- On-the-fly elimination of dynamic irregularities for GPU computing
- New York, NY, USA, ACM
- Eddy Z. Zhang, Yunlian Jiang, Ziyu Guo, Kai Tian, and Xipeng Shen. On-the-fly elimination of dynamic irregularities for GPU computing. In Proceedings of the Sixteenth International Conference on Architectural Support for Programming Languages and Operating Systems, pages 369-380, New York, NY, USA, 2011. ACM.
- (2011) Proceedings of the Sixteenth International Conference on Architectural Support for Programming Languages and Operating Systems , pp. 369-380
- Zhang, E.Z.¹ Jiang, Y.² Guo, Z.³ Tian, K.⁴ Shen, X.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.