-
1
-
-
0004072686
-
-
Addison Wesley
-
A. Aho, R. Sethi, and J. Ullman. Compilers: principles, techniques, and tools. Addison Wesley, 1986.
-
(1986)
Compilers: Principles, Techniques, and Tools
-
-
Aho, A.1
Sethi, R.2
Ullman, J.3
-
2
-
-
33846349887
-
A hierarchical O(N log N) force-calculation algorithm
-
December
-
J. Barnes and P. Hut. A hierarchical O(N log N) force-calculation algorithm. Nature, 324(4), December 1986.
-
(1986)
Nature
, vol.324
, Issue.4
-
-
Barnes, J.1
Hut, P.2
-
3
-
-
0000269759
-
Scheduling multithreaded computations by work stealing
-
Robert D. Blumofe and Charles E. Leiserson. Scheduling multithreaded computations by work stealing. J. ACM, 46(5):720-748, 1999.
-
(1999)
J. ACM
, vol.46
, Issue.5
, pp. 720-748
-
-
Blumofe, R.D.1
Leiserson, C.E.2
-
4
-
-
26944443478
-
Survey propagation: An algorithm for satisfiability
-
DOI 10.1002/rsa.20057
-
A. Braunstein, M. Mèzard, and R. Zecchina. Survey propagation: An algorithm for satisfiability. Random Structures and Algorithms, 27(2):201-226, 2005. (Pubitemid 41482546)
-
(2005)
Random Structures and Algorithms
, vol.27
, Issue.2
, pp. 201-226
-
-
Braunstein, A.1
Mezard, M.2
Zecchina, R.3
-
5
-
-
84884834081
-
-
CMU 15-418 Spring Final Project Report
-
Bruant, Hugues. Parallel simulation of cellular automata, CMU 15-418 (Spring 2012) Final Project Report. http://www.andrew.cmu.edu/user/hbruant/ 15418/finalreport.html.
-
(2012)
Parallel Simulation of Cellular Automata
-
-
Bruant, H.1
-
6
-
-
84858427151
-
An efficient CUDA implementation of the tree-based barnes hut n-body algorithm
-
Morgan Kaufmann
-
Martin Burtscher and Keshav Pingali. An efficient CUDA implementation of the tree-based barnes hut n-body algorithm. In GPU Computing Gems Emerald Edition, pages 75-92. Morgan Kaufmann, 2011.
-
(2011)
GPU Computing Gems Emerald Edition
, pp. 75-92
-
-
Burtscher, M.1
Pingali, K.2
-
7
-
-
51449118065
-
A performance study of general-purpose applications on graphics processors using CUDA
-
October
-
Shuai Che, Michael Boyer, Jiayuan Meng, David Tarjan, Jeremy W. Sheaffer, and Kevin Skadron. A performance study of general-purpose applications on graphics processors using CUDA. Journal of Parallel and Distributing Computing, 68:1370-1380, October 2008.
-
(2008)
Journal of Parallel and Distributing Computing
, vol.68
, pp. 1370-1380
-
-
Che, S.1
Boyer, M.2
Meng, J.3
Tarjan, D.4
Sheaffer, J.W.5
Skadron, K.6
-
10
-
-
84870690379
-
A Study of Persistent Threads Style GPU Programming for GPGPU Workloads
-
may
-
Kshitij Gupta, Jeff A. Stuart, and John D. Owens. A Study of Persistent Threads Style GPU Programming for GPGPU Workloads. In Innovative Parallel Computing, page 14, may 2012.
-
(2012)
Innovative Parallel Computing
, pp. 14
-
-
Gupta, K.1
Stuart, J.A.2
Owens, J.D.3
-
13
-
-
58549112478
-
Transactional boosting: A methodology for highly-concurrent transactional objects
-
New York, NY, USA, ACM
-
Maurice Herlihy and Eric Koskinen. Transactional boosting: a methodology for highly-concurrent transactional objects. In PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, pages 207-216, New York, NY, USA, 2008. ACM.
-
(2008)
PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 207-216
-
-
Herlihy, M.1
Koskinen, E.2
-
15
-
-
79952811127
-
Accelerating CUDA graph algorithms at maximum warp
-
New York, NY, USA, ACM
-
Sungpack Hong, Sang Kyun Kim, Tayo Oguntebi, and Kunle Olukotun. Accelerating CUDA graph algorithms at maximum warp. In Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming, pages 267-276, New York, NY, USA, 2011. ACM.
-
(2011)
Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming
, pp. 267-276
-
-
Hong, S.1
Kim, S.K.2
Oguntebi, T.3
Olukotun, K.4
-
16
-
-
84863423999
-
Dynamically managed data for CPU-GPU architectures
-
New York, NY, USA, ACM
-
Thomas B. Jablin, James A. Jablin, Prakash Prabhu, Feng Liu, and David I. August. Dynamically managed data for CPU-GPU architectures. In Proceedings of the Tenth International Symposium on Code Generation and Optimization, pages 165-174, New York, NY, USA, 2012. ACM.
-
(2012)
Proceedings of the Tenth International Symposium on Code Generation and Optimization
, pp. 165-174
-
-
Jablin, T.B.1
Jablin, J.A.2
Prabhu, P.3
Liu, F.4
August, D.I.5
-
17
-
-
35448941890
-
Optimistic parallelism requires abstractions
-
Milind Kulkarni, Keshav Pingali, Bruce Walter, Ganesh Ramanarayanan, Kavita Bala, and L. Paul Chew. Optimistic parallelism requires abstractions. SIGPLAN Notices (Proceedings of PLDI), 42(6):211-222, 2007.
-
(2007)
SIGPLAN Notices (Proceedings of PLDI)
, vol.42
, Issue.6
, pp. 211-222
-
-
Kulkarni, M.1
Pingali, K.2
Walter, B.3
Ramanarayanan, G.4
Bala, K.5
Chew, L.P.6
-
18
-
-
68849093624
-
CUDA Solutions for the SSSP Problem
-
Berlin, Heidelberg, Springer-Verlag
-
Pedro J. Martín, Roberto Torres, and Antonio Gavilanes. CUDA Solutions for the SSSP Problem. In Proceedings of the 9th International Conference on Computational Science: Part I, pages 904-913, Berlin, Heidelberg, 2009. Springer-Verlag.
-
(2009)
Proceedings of the 9th International Conference on Computational Science: Part I
, pp. 904-913
-
-
Martín, P.J.1
Torres, R.2
Gavilanes, A.3
-
19
-
-
84878605997
-
A GPU implementation of inclusion-based points-to analysis
-
New York, NY, USA, ACM
-
Mario Mendez-Lojo, Martin Burtscher, and Keshav Pingali. A GPU implementation of inclusion-based points-to analysis. In Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pages 107-116, New York, NY, USA, 2012. ACM.
-
(2012)
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 107-116
-
-
Mendez-Lojo, M.1
Burtscher, M.2
Pingali, K.3
-
21
-
-
0022678067
-
Distributed discrete-event simulation
-
Jayadev Misra. Distributed discrete-event simulation. ACM Computing Surveys, 18(1):39-65, 1986.
-
(1986)
ACM Computing Surveys
, vol.18
, Issue.1
, pp. 39-65
-
-
Misra, J.1
-
24
-
-
79959878035
-
The tao of parallelism in algorithms
-
New York, NY, USA, ACM
-
Keshav Pingali, Donald Nguyen, Milind Kulkarni, Martin Burtscher, M. Amber Hassaan, Rashid Kaleem, Tsung-Hsien Lee, Andrew Lenharth, Roman Manevich, Mario Méndez-Lojo, Dimitrios Prountzos, and Xin Sui. The tao of parallelism in algorithms. In Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation, pages 12-25, New York, NY, USA, 2011. ACM.
-
(2011)
Proceedings of the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 12-25
-
-
Pingali, K.1
Nguyen, D.2
Kulkarni, M.3
Burtscher, M.4
Hassaan, M.A.5
Kaleem, R.6
Lee, T.-H.7
Lenharth, A.8
Manevich, R.9
Méndez-Lojo, M.10
Prountzos, D.11
Sui, X.12
-
26
-
-
84934313374
-
Task management for irregular-parallel workloads on the gpu
-
Aire-la-Ville, Switzerland, Switzerland, Eurographics Association
-
Stanley Tzeng, Anjul Patney, and John D. Owens. Task management for irregular-parallel workloads on the gpu. In Proceedings of the Conference on High Performance Graphics, HPG '10, pages 29-37, Aire-la-Ville, Switzerland, Switzerland, 2010. Eurographics Association.
-
(2010)
Proceedings of the Conference on High Performance Graphics, HPG '10
, pp. 29-37
-
-
Tzeng, S.1
Patney, A.2
Owens, J.D.3
-
27
-
-
79953126288
-
On-the-fly elimination of dynamic irregularities for GPU computing
-
New York, NY, USA, ACM
-
Eddy Z. Zhang, Yunlian Jiang, Ziyu Guo, Kai Tian, and Xipeng Shen. On-the-fly elimination of dynamic irregularities for GPU computing. In Proceedings of the Sixteenth International Conference on Architectural Support for Programming Languages and Operating Systems, pages 369-380, New York, NY, USA, 2011. ACM.
-
(2011)
Proceedings of the Sixteenth International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 369-380
-
-
Zhang, E.Z.1
Jiang, Y.2
Guo, Z.3
Tian, K.4
Shen, X.5
|