-
2
-
-
84878525321
-
-
Accessed: 07-11
-
9th DIMACS Implementation Challenge: http://www.dis.uniroma1.it/ ~challenge9/download.shtml. Accessed: 2011-07-11.
-
(2011)
9th DIMACS Implementation Challenge
-
-
-
3
-
-
78650848150
-
Scalable graph exploration on multicore processors
-
New Orleans, LA, USA, Nov. 2010
-
Agarwal, V. et al. 2010. Scalable Graph Exploration on Multicore Processors. 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (New Orleans, LA, USA, Nov. 2010), 1-11.
-
(2010)
2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
, pp. 1-11
-
-
Agarwal, V.1
-
4
-
-
34547399946
-
Designing multithreaded algorithms for breadth-first search and st-connectivity on the cray MTA-2
-
Columbus, OH, USA
-
Bader, D.A. and Madduri, K. Designing Multithreaded Algorithms for Breadth-First Search and st-connectivity on the Cray MTA-2. 2006 International Conference on Parallel Processing (ICPP'06) (Columbus, OH, USA), 523-530.
-
2006 International Conference on Parallel Processing (ICPP'06)
, pp. 523-530
-
-
Bader, D.A.1
Madduri, K.2
-
5
-
-
33745125067
-
On the architectural requirements for efficient execution of graph algorithms
-
Oslo, Norway
-
Bader, D.A. et al. On the Architectural Requirements for Efficient Execution of Graph Algorithms. 2005 International Conference on Parallel Processing (ICPP'05) (Oslo, Norway), 547-556.
-
(2005)
International Conference on Parallel Processing (ICPP'05)
, pp. 547-556
-
-
Bader, D.A.1
-
6
-
-
74049143158
-
Implementing sparse matrix-vector multiplication on throughput-oriented processors
-
New York, NY, USA, 2009
-
Bell, N. and Garland, M. 2009. Implementing sparse matrix-vector multiplication on throughput-oriented processors. Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (New York, NY, USA, 2009), 18:1-18:11.
-
(2009)
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
, pp. 1-11
-
-
Bell, N.1
Garland, M.2
-
8
-
-
0024770039
-
Scans as primitive parallel operations
-
Nov. 1989
-
Blelloch, G.E. 1989. Scans as primitive parallel operations. IEEE Transactions on Computers. 38, 11 (Nov. 1989), 1526-1538.
-
(1989)
IEEE Transactions on Computers
, vol.38
, Issue.11
, pp. 1526-1538
-
-
Blelloch, G.E.1
-
9
-
-
0025550099
-
Scan primitives for vector computers
-
Los Alamitos, CA, USA, 1990
-
Chatterjee, S. et al. 1990. Scan primitives for vector computers. Proceedings of the 1990 ACM/IEEE conference on Supercomputing (Los Alamitos, CA, USA, 1990), 666-675.
-
(1990)
Proceedings of the 1990 ACM/IEEE Conference on Supercomputing
, pp. 666-675
-
-
Chatterjee, S.1
-
10
-
-
70649092154
-
Rodinia: A benchmark suite for heterogeneous computing
-
Austin, TX, USA, Oct. 2009
-
Che, S. et al. 2009. Rodinia: A benchmark suite for heterogeneous computing. 2009 IEEE International Symposium on Workload Characterization (IISWC) (Austin, TX, USA, Oct. 2009), 44-54.
-
(2009)
2009 IEEE International Symposium on Workload Characterization (IISWC)
, pp. 44-54
-
-
Che, S.1
-
12
-
-
76349105923
-
Taming irregular EDA applications on GPUs
-
New York, NY, USA, 2009
-
Deng, Y. (Steve) et al. 2009. Taming irregular EDA applications on GPUs. Proceedings of the 2009 International Conference on Computer-Aided Design (New York, NY, USA, 2009), 539-546.
-
(2009)
Proceedings of the 2009 International Conference on Computer-Aided Design
, pp. 539-546
-
-
Deng, Y.1
-
13
-
-
57349184047
-
Fast scan algorithms on graphics processors
-
New York, NY, USA, 2008
-
Dotsenko, Y. et al. 2008. Fast scan algorithms on graphics processors. Proceedings of the 22nd annual international conference on Supercomputing (New York, NY, USA, 2008), 205-213.
-
(2008)
Proceedings of the 22nd Annual International Conference on Supercomputing
, pp. 205-213
-
-
Dotsenko, Y.1
-
14
-
-
51549093017
-
Sparse matrix computations on manycore GPU's
-
New York, NY, USA, 2008
-
Garland, M. 2008. Sparse matrix computations on manycore GPU's. Proceedings of the 45th annual Design Automation Conference (New York, NY, USA, 2008), 2-6.
-
(2008)
Proceedings of the 45th Annual Design Automation Conference
, pp. 2-6
-
-
Garland, M.1
-
16
-
-
38349041620
-
Accelerating large graph algorithms on the GPU using CUDA
-
Berlin, Heidelberg, 2007
-
Harish, P. and Narayanan, P.J. 2007. Accelerating large graph algorithms on the GPU using CUDA. Proceedings of the 14th international conference on High performance computing (Berlin, Heidelberg, 2007), 197-208.
-
(2007)
Proceedings of the 14th International Conference on High Performance Computing
, pp. 197-208
-
-
Harish, P.1
Narayanan, P.J.2
-
17
-
-
0022882379
-
Data parallel algorithms
-
Dec. 1986
-
Hillis, W.D. and Steele, G.L. 1986. Data parallel algorithms. Communications of the ACM. 29, 12 (Dec. 1986), 1170-1183.
-
(1986)
Communications of the ACM
, vol.29
, Issue.12
, pp. 1170-1183
-
-
Hillis, W.D.1
Steele, G.L.2
-
18
-
-
79952811127
-
Accelerating CUDA graph algorithms at maximum warp
-
New York, NY, USA, 2011
-
Hong, S. et al. 2011. Accelerating CUDA graph algorithms at maximum warp. Proceedings of the 16th ACM symposium on Principles and practice of parallel programming (New York, NY, USA, 2011), 267-276.
-
(2011)
Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming
, pp. 267-276
-
-
Hong, S.1
-
19
-
-
84858417648
-
-
(New York, NY, USA, 2011), to appear
-
Hong, S. et al. 2011. Efficient Parallel Graph Exploration for Multi-Core CPU and GPU. (New York, NY, USA, 2011), to appear.
-
(2011)
Efficient Parallel Graph Exploration for Multi-Core CPU and GPU
-
-
Hong, S.1
-
21
-
-
77954929696
-
A work-efficient parallel breadth-first search algorithm (or how to cope with the nondeterminism of reducers)
-
New York, NY, USA, 2010
-
Leiserson, C.E. and Schardl, T.B. 2010. A work-efficient parallel breadth-first search algorithm (or how to cope with the nondeterminism of reducers). Proceedings of the 22nd ACM symposium on Parallelism in algorithms and architectures (New York, NY, USA, 2010), 303-314.
-
(2010)
Proceedings of the 22nd ACM Symposium on Parallelism in Algorithms and Architectures
, pp. 303-314
-
-
Leiserson, C.E.1
Schardl, T.B.2
-
22
-
-
77956200064
-
An effective GPU implementation of breadthfirst search
-
New York, NY, USA, 2010
-
Luo, L. et al. 2010. An effective GPU implementation of breadthfirst search. Proceedings of the 47th Design Automation Conference (New York, NY, USA, 2010), 52-55.
-
(2010)
Proceedings of the 47th Design Automation Conference
, pp. 52-55
-
-
Luo, L.1
-
23
-
-
79959718248
-
High performance and scalable radix sorting: A case study of implementing dynamic parallelism for GPU computing
-
2011
-
Merrill, D. and Grimshaw, A. 2011. High Performance and Scalable Radix Sorting: A case study of implementing dynamic parallelism for GPU computing. Parallel Processing Letters. 21, 02 (2011), 245-272.
-
(2011)
Parallel Processing Letters
, vol.21
, Issue.2
, pp. 245-272
-
-
Merrill, D.1
Grimshaw, A.2
-
24
-
-
78149268496
-
Parallel scan for stream architectures
-
Department of Computer Science, University of Virginia
-
Merrill, D. and Grimshaw, A. 2009. Parallel Scan for Stream Architectures. Technical Report #CS2009-14. Department of Computer Science, University of Virginia.
-
(2009)
Technical Report #CS2009-14
-
-
Merrill, D.1
Grimshaw, A.2
-
25
-
-
84858400990
-
High performance and scalable GPU graph traversal
-
Department of Computer Science, University of Virginia
-
Merrill, D. et al. 2011. High Performance and Scalable GPU Graph Traversal. Technical Report #CS2011-05. Department of Computer Science, University of Virginia.
-
(2011)
Technical Report #CS2011-05
-
-
Merrill, D.1
-
26
-
-
80053212330
-
-
Accessed: -07-11
-
Parboil Benchmark suite: http://impact.crhc.illinois.edu/parboil.php. Accessed: 2011-07-11.
-
(2011)
Parboil Benchmark Suite
-
-
-
27
-
-
51649124194
-
Efficient breadth-first search on the Cell/BE processor
-
Oct. 2008
-
Scarpazza, D.P. et al. 2008. Efficient Breadth-First Search on the Cell/BE Processor. IEEE Transactions on Parallel and Distributed Systems. 19, 10 (Oct. 2008), 1381-1395.
-
(2008)
IEEE Transactions on Parallel and Distributed Systems
, vol.19
, Issue.10
, pp. 1381-1395
-
-
Scarpazza, D.P.1
-
28
-
-
77952833958
-
Efficient parallel scan algorithms for GPUs
-
NVIDIA
-
Sengupta, S. et al. 2008. Efficient parallel scan algorithms for GPUs. Technical Report #NVR-2008-003. NVIDIA.
-
(2008)
Technical Report #NVR-2008-003
-
-
Sengupta, S.1
-
29
-
-
84878557233
-
-
Accessed: 07-11
-
The Graph 500 List: http://www.graph500.org/. Accessed: 2011-07-11.
-
(2011)
The Graph 500 List
-
-
-
30
-
-
0025588012
-
High-probability parallel transitive closure algorithms
-
Island of Crete, Greece, 1990
-
Ullman, J. and Yannakakis, M. 1990. High-probability parallel transitive closure algorithms. Proceedings of the second annual ACM symposium on Parallel algorithms and architectures - SPAA '90 (Island of Crete, Greece, 1990), 200-209.
-
(1990)
Proceedings of the Second Annual ACM Symposium on Parallel Algorithms and Architectures - SPAA '90
, pp. 200-209
-
-
Ullman, J.1
Yannakakis, M.2
-
33
-
-
33845388971
-
A Scalable distributed parallel breadth-first search algorithm on blue Gene/L
-
Seattle, WA, USA
-
Yoo, A. et al. A Scalable Distributed Parallel Breadth-First Search Algorithm on Blue Gene/L. ACM/IEEE SC 2005 Conference (SC'05) (Seattle, WA, USA), 25-25.
-
ACM/IEEE SC 2005 Conference (SC'05)
, pp. 25-25
-
-
Yoo, A.1
|