-
4
-
-
34547399946
-
Designing multithreaded algorithms for Breadth-First Search and si-connectivity on the Cray MTA-2
-
DOI 10.1109/ICPP.2006.34, 1690657, ICPP 2006: Proceedings of the 2006 International Conference on Parallel Processing
-
David A. Bader and Kamesh Madduri. Designing multithreaded algorithms for breadth-first search and st-connectivity on the cray mta- 2. In Proceedings of the 2006 International Conference on Parallel Processing, ICPP'06, pages 523-530, Washington, DC, USA, 2006. IEEE Computer Society. (Pubitemid 47159081)
-
(2006)
Proceedings of the International Conference on Parallel Processing
, pp. 523-530
-
-
Bader, D.A.1
Madduri, K.2
-
5
-
-
80053287330
-
Computing strongly connected components in parallel on CUDA
-
IEEE Computer Society
-
J. Barnat, P. Bauch, L. Brim, and M. Češka. Computing Strongly Connected Components in Parallel on CUDA. In Proceedings of the 25th IEEE International Parallel & Distributed Processing Symposium (IPDPS'11), pages 541-552. IEEE Computer Society, 2011.
-
(2011)
Proceedings of the 25th IEEE International Parallel & Distributed Processing Symposium (IPDPS'11)
, pp. 541-552
-
-
Barnat, J.1
Bauch, P.2
Brim, L.3
Češka, M.4
-
6
-
-
0038716510
-
Laurie hendren, and navindra umanee. points-to analysis using BDDs
-
New York, NY, USA, ACM
-
Marc Berndl, Ondrej Lhot́ak, Feng Qian, Laurie Hendren, and Navindra Umanee. Points-to analysis using BDDs. In Proc. Conf. on Programming Language Design and Implementation (PLDI), pages 103- 114, New York, NY, USA, 2003. ACM.
-
(2003)
Proc. Conf. on Programming Language Design and Implementation (PLDI)
, pp. 103-114
-
-
Berndl, M.1
Lhot́ak, O.2
Qian, F.3
-
8
-
-
0022769976
-
Graph-based algorithms for boolean function manipulation
-
Randal E. Bryant. Graph-based algorithms for boolean function manipulation. IEEE Transactions on Computers, 35:677-691, 1986.
-
(1986)
IEEE Transactions on Computers
, vol.35
, pp. 677-691
-
-
Randal, E.B.1
-
9
-
-
84858427151
-
An efficient CUDA implementation of the tree-based barnes hut n-body algorithm
-
Morgan Kaufmann
-
Martin Burtscher and Keshav Pingali. An efficient CUDA implementation of the tree-based barnes hut n-body algorithm. In GPU Computing Gems Emerald Edition, pages 75-92. Morgan Kaufmann, 2011.
-
(2011)
GPU Computing Gems Emerald Edition
, pp. 75-92
-
-
Burtscher, M.1
Pingali, K.2
-
10
-
-
51449118065
-
Sheaffer, and Kevin Skadron. A performance study of general-purpose applications on graphics processors using cuda
-
October
-
Shuai Che, Michael Boyer, Jiayuan Meng, David Tarjan, Jeremy W. Sheaffer, and Kevin Skadron. A performance study of general-purpose applications on graphics processors using cuda. J. Parallel Distrib. Comput., 68:1370-1380, October 2008.
-
(2008)
J. Parallel Distrib. Comput.
, vol.68
, pp. 1370-1380
-
-
Che, S.1
Boyer, M.2
Meng, J.3
Tarjan, D.4
Jeremy, W.5
-
12
-
-
0031630370
-
Partial online cycle elimination in inclusion constraint graphs
-
Manuel Fähndrich, Jeffrey S. Foster, Zhendong Su, and Alexander Aiken. Partial online cycle elimination in inclusion constraint graphs. In Proc. Conf. on Programming Language Design and Implementation (PLDI), pages 85-96, New York, NY, USA, 1998. ACM. (Pubitemid 128454787)
-
(1998)
SIGPLAN Notices (ACM Special Interest Group on Programming Languages)
, vol.33
, Issue.5
, pp. 85-96
-
-
Fahndrich, M.1
Foster, J.S.2
Su, Z.3
Aiken, A.4
-
15
-
-
18844428084
-
Ultra-fast aliasing analysis using cla: A million lines of c code in a second
-
Nevin Heintze and Olivier Tardieu. Ultra-fast aliasing analysis using cla: a million lines of c code in a second. SIGPLAN Not., 36(5):254- 263, 2001.
-
(2001)
SIGPLAN Not.
, vol.36
, Issue.5
, pp. 254-263
-
-
Heintze, N.1
Tardieu, O.2
-
16
-
-
0008525753
-
Type inference and semi-unification
-
New York, NY, USA, ACM
-
Fritz Henglein. Type inference and semi-unification. In Proceedings of the 1988 ACM conference on LISP and functional programming, LFP'88, pages 184-197, New York, NY, USA, 1988. ACM.
-
(1988)
Proceedings of the 1988 ACM Conference on LISP and Functional Programming, LFP'88
, pp. 184-197
-
-
Henglein, F.1
-
18
-
-
79952811127
-
Accelerating cuda graph algorithms at maximum warp
-
New York, NY, USA, ACM
-
Sungpack Hong, Sang Kyun Kim, Tayo Oguntebi, and Kunle Olukotun. Accelerating cuda graph algorithms at maximum warp. In Proceedings of the 16th ACM symposium on Principles and practice of parallel programming, PPoPP'11, pages 267-276, New York, NY, USA, 2011. ACM.
-
(2011)
Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming, PPoPP'11
, pp. 267-276
-
-
Hong, S.1
Kim, S.K.2
Oguntebi, T.3
Olukotun, K.4
-
20
-
-
70449914192
-
On the energy efficiency of graphics processing units for scientific computing
-
Song Huang, Shucai Xiao, and Wu chun Feng. On the energy efficiency of graphics processing units for scientific computing. In IPDPS, pages 1-8, 2009.
-
(2009)
IPDPS
, pp. 1-8
-
-
Huang, S.1
Xiao, S.2
Feng, W.C.3
-
21
-
-
35448941890
-
Optimistic parallelism requires abstractions
-
DOI 10.1145/1250734.1250759, PLDI'07: Proceedings of the 2007 ACM SIGPLAN Conference on Programming Language Design and Implementation
-
Milind Kulkarni, Keshav Pingali, Bruce Walter, Ganesh Ramanarayanan, Kavita Bala, and L. Paul Chew. Optimistic parallelism requires abstractions. SIGPLAN Not. (Proceedings of PLDI), 42(6):211- 222, 2007. (Pubitemid 47630689)
-
(2007)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI)
, pp. 211-222
-
-
Kulkarni, M.1
Pingali, K.2
Walter, B.3
Ramanarayanan, G.4
Bala, K.5
Chew, L.P.6
-
22
-
-
77954995885
-
Debunking the 100x gpu vs. Cpu myth: An evaluation of throughput computing on cpu and gpu
-
New York, NY, USA, ACM
-
VictorW. Lee, Changkyu Kim, Jatin Chhugani, Michael Deisher, Daehyun Kim, Anthony D. Nguyen, Nadathur Satish, Mikhail Smelyanskiy, Srinivas Chennupaty, Per Hammarlund, Ronak Singhal, and Pradeep Dubey. Debunking the 100x gpu vs. cpu myth: an evaluation of throughput computing on cpu and gpu. In Proceedings of the 37th annual international symposium on Computer architecture, ISCA'10, pages 451-460, New York, NY, USA, 2010. ACM.
-
(2010)
Proceedings of the 37th Annual International Symposium on Computer Architecture, ISCA'10
, pp. 451-460
-
-
Lee, Victorw.1
Kim, C.2
Chhugani, J.3
Deisher, M.4
Kim, D.5
Nguyen, A.D.6
Satish, N.7
Smelyanskiy, M.8
Chennupaty, S.9
Hammarlund, P.10
Singhal, R.11
Dubey, P.12
-
23
-
-
35248842644
-
Scaling Java points-to analysis using Spark
-
volume 2622 of LNCS, Warsaw, Poland, April, Springer
-
Ondřej Lhoták and Laurie Hendren. Scaling Java points-to analysis using Spark. In G. Hedin, editor, Compiler Construction, 12th International Conference, volume 2622 of LNCS, pages 153-169, Warsaw, Poland, April 2003. Springer.
-
(2003)
G. Hedin, Editor, Compiler Construction, 12th International Conference
, pp. 153-169
-
-
Lhoták, O.1
Hendren, L.2
-
24
-
-
77956200064
-
An effective gpu implementation of breadth-first search
-
New York, NY, USA, ACM
-
Lijuan Luo, Martin Wong, and Wen-mei Hwu. An effective gpu implementation of breadth-first search. In Proceedings of the 47th Design Automation Conference, DAC'10, pages 52-55, New York, NY, USA, 2010. ACM.
-
(2010)
Proceedings of the 47th Design Automation Conference, DAC'10
, pp. 52-55
-
-
Luo, L.1
Wong, M.2
Hwu, W.-M.3
-
25
-
-
79551677007
-
Parallel inclusion-based points-to analysis
-
October
-
Mario Méndez-Lojo, Augustine Mathew, and Keshav Pingali. Parallel inclusion-based points-to analysis. In Proceedings of the 24th Annual ACM SIGPLAN Conference on Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA'10), October 2010.
-
(2010)
Proceedings of the 24th Annual ACM SIGPLAN Conference on Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA'10)
-
-
Mario, M.-L.1
Mathew, A.2
Pingali, K.3
-
29
-
-
79959878035
-
The tao of parallelism in algorithms
-
New York, NY, USA, ACM
-
Keshav Pingali, Donald Nguyen, Milind Kulkarni, Martin Burtscher, M. Amber Hassaan, Rashid Kaleem, Tsung-Hsien Lee, Andrew Lenharth, Roman Manevich, Mario Méndez-Lojo, Dimitrios Prountzos, and Xin Sui. The tao of parallelism in algorithms. In Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation, PLDI'11, pages 12-25, New York, NY, USA, 2011. ACM.
-
(2011)
Proceedings of the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI'11
, pp. 12-25
-
-
Pingali, K.1
Nguyen, D.2
Kulkarni, M.3
Burtscher, M.4
Hassaan, M.A.5
Kaleem, R.6
Lee, T.-H.7
Lenharth, A.8
Manevich, R.9
Méndez-Lojo, M.10
Prountzos, D.11
Sui, X.12
-
30
-
-
79251566519
-
Eigencfa: Accelerating flow analysis with gpus
-
New York, NY, USA, ACM
-
Tarun Prabhu, Shreyas Ramalingam, Matthew Might, and Mary Hall. Eigencfa: accelerating flow analysis with gpus. In Proceedings of the 38th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages, POPL'11, pages 511-522, New York, NY, USA, 2011. ACM.
-
(2011)
Proceedings of the 38th Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, POPL'11
, pp. 511-522
-
-
Prabhu, T.1
Ramalingam, S.2
Might, M.3
Hall, M.4
-
33
-
-
0029717388
-
Points-to analysis in almost linear time
-
New York, NY, USA, ACM
-
Bjarne Steensgaard. Points-to analysis in almost linear time. In POPL'96: Proceedings of the 23rd ACM SIGPLAN-SIGACT symposium on Principles of programming languages, pages 32-41, New York, NY, USA, 1996. ACM.
-
(1996)
POPL'96: Proceedings of the 23rd ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages
, pp. 32-41
-
-
Steensgaard, B.1
-
34
-
-
85092761228
-
On the limits of gpu acceleration
-
Berkeley, CA, USA, USENIX Association
-
Richard Vuduc, Aparna Chandramowlishwaran, Jee Choi, Murat Guney, and Aashay Shringarpure. On the limits of gpu acceleration. In Proceedings of the 2nd USENIX conference on Hot topics in parallelism, HotPar'10, pages 13-13, Berkeley, CA, USA, 2010. USENIX Association.
-
(2010)
Proceedings of the 2nd USENIX Conference on Hot Topics in Parallelism, HotPar'10
, pp. 13-13
-
-
Vuduc, R.1
Chandramowlishwaran, A.2
Choi, J.3
Guney, M.4
Shringarpure, A.5
-
35
-
-
8344251741
-
Cloning-based context-sensitive pointer alias analysis using binary decision diagrams
-
New York, NY, USA, ACM
-
John Whaley and Monica S. Lam. Cloning-based context-sensitive pointer alias analysis using binary decision diagrams. In Proc. Conf. on Programming Language Design and Implementation (PLDI), pages 131-144, New York, NY, USA, 2004. ACM.
-
(2004)
Proc. Conf. on Programming Language Design and Implementation (PLDI)
, pp. 131-144
-
-
Whaley, J.1
Lam, M.S.2
-
36
-
-
33845388971
-
A scalable distributed parallel breadth-first search algorithm on bluegene/l
-
Washington, DC, USA, IEEE Computer Society
-
Andy Yoo, Edmond Chow, Keith Henderson, William McLendon, Bruce Hendrickson, and Umit Catalyurek. A scalable distributed parallel breadth-first search algorithm on bluegene/l. In Proceedings of the 2005 ACM/IEEE conference on Supercomputing, SC'05, pages 25-, Washington, DC, USA, 2005. IEEE Computer Society.
-
(2005)
Proceedings of the 2005 ACM/IEEE Conference on Supercomputing, SC'05
, pp. 25
-
-
Yoo, A.1
Chow, E.2
Henderson, K.3
Mclendon, W.4
Hendrickson, B.5
Catalyurek, U.6
|