-
2
-
-
80053975663
-
-
http://en.wikipedia.org/wiki/GeForce-200-Series, 2010.
-
(2010)
-
-
-
4
-
-
33244482870
-
A computational study of external-memory bfs algorithms
-
D. Ajwani, R. Dementiev, and U. Meyer. A computational study of external-memory bfs algorithms. In ACM-SIAM SODA, 2006.
-
(2006)
ACM-SIAM SODA
-
-
Ajwani, D.1
Dementiev, R.2
Meyer, U.3
-
5
-
-
79952783870
-
Designing multithreaded algorithms for breadth-first search and st-connectivity on the cray mta-2
-
D. Bader and K. Madduri. Designing multithreaded algorithms for breadth-first search and st-connectivity on the cray mta-2. In IEEE ICPP, 2006.
-
(2006)
IEEE ICPP
-
-
Bader, D.1
Madduri, K.2
-
6
-
-
79952794548
-
Snap, small-world network analysis and partitioning: An open-source parallel graph framework for the exploration of large-scale networks
-
D. Bader and K. Madduri. Snap, small-world network analysis and partitioning: An open-source parallel graph framework for the exploration of large-scale networks. In IEEE IPDPS, 2008.
-
(2008)
IEEE IPDPS
-
-
Bader, D.1
Madduri, K.2
-
8
-
-
70450275084
-
Analyzing cuda workloads using a detailed gpu simulator
-
A. Bakhoda, G. L. Yuan,W.W. L. Fung, H.Wong, and T. M. Aamodt. Analyzing cuda workloads using a detailed gpu simulator. In IEEE ISPASS, 2009.
-
(2009)
IEEE ISPASS
-
-
Bakhoda, A.1
Yuan, G.L.2
Fung, W.W.L.3
Wong, H.4
Aamodt, T.M.5
-
10
-
-
78650079065
-
Language virtualization for heterogeneous parallel computing
-
H. Chafi, Z. DeVito, A. Moors, T. Rompf, A. K. Sujeeth, P. Hanrahan, M. Odersky, and K. Olukotun. Language virtualization for heterogeneous parallel computing. In Proc. Conf. OOPSLA '10, 2010.
-
(2010)
Proc. Conf. OOPSLA '10
-
-
Chafi, H.1
DeVito, Z.2
Moors, A.3
Rompf, T.4
Sujeeth, A.K.5
Hanrahan, P.6
Odersky, M.7
Olukotun, K.8
-
12
-
-
70649092154
-
Rodinia: A benchmark suite for heterogeneous computing
-
S. Che, M. Boyer, J. Meng, D. Tarjan, J. Sheaffer, S.-H. Lee, and K. Skadron. Rodinia: A benchmark suite for heterogeneous computing. In IEEE IISWC, 2009.
-
(2009)
IEEE IISWC
-
-
Che, S.1
Boyer, M.2
Meng, J.3
Tarjan, D.4
Sheaffer, J.5
Lee, S.-H.6
Skadron, K.7
-
13
-
-
80053941100
-
-
Cray Inc. Cray xmt.
-
Cray, Inc. Cray xmt. http://www.cray.com/products/xmt/.
-
-
-
-
14
-
-
34548207355
-
Sequoia: Programming the memory hierarchy
-
K. Fatahalian, D. R. Horn, T. J. Knight, L. Leem, M. Houston, J. Y. Park, M. Erez, M. Ren, A. Aiken, W. J. Dally, and P. Hanrahan. Sequoia: Programming the memory hierarchy. In SC, 2006.
-
(2006)
SC
-
-
Fatahalian, K.1
Horn, D.R.2
Knight, T.J.3
Leem, L.4
Houston, M.5
Park, J.Y.6
Erez, M.7
Ren, M.8
Aiken, A.9
Dally, W.J.10
Hanrahan, P.11
-
15
-
-
60649099910
-
Accelerating large graph algorithms on the gpu using cuda
-
P. Harish and P. J. Narayanan. Accelerating large graph algorithms on the gpu using cuda. In HiPC, 2007.
-
(2007)
HiPC
-
-
Harish, P.1
Narayanan, P.J.2
-
16
-
-
70450200605
-
-
Technical Report IIIT/TR/2009/74, International Institute of Information Technology Hyderabad, India
-
P. harish, V. Vineet, and P. Narayanan. Large graph algorithms for massively multithreaded architectures. Technical Report IIIT/TR/2009/74, International Institute of Information Technology Hyderabad, India, 2009.
-
(2009)
Large Graph Algorithms for Massively Multithreaded Architectures
-
-
Harish, P.1
Vineet, V.2
Narayanan, P.3
-
19
-
-
77954995885
-
Debunking the 100x gpu vs. cpu myth: An evaluation of throughput computing on cpu and gpu
-
V. W. Lee, C. Kim, J. Chhugani, M. Deisher, D. Kim, A. D. Nguyen, N. Satish, M. Smelyanskiy, S. Chennupaty, P. Hammarlund, R. Singhal, and P. Dubey. Debunking the 100x gpu vs. cpu myth: An evaluation of throughput computing on cpu and gpu. In ISCA, 2010.
-
(2010)
ISCA
-
-
Lee, V.W.1
Kim, C.2
Chhugani, J.3
Deisher, M.4
Kim, D.5
Nguyen, A.D.6
Satish, N.7
Smelyanskiy, M.8
Chennupaty, S.9
Hammarlund, P.10
Singhal, R.11
Dubey, P.12
-
20
-
-
77954976292
-
Dynamic warp subdivision for integrated branch and memory divergence tolerance
-
J. Meng, D. Tarjan, and K. Skadron. Dynamic warp subdivision for integrated branch and memory divergence tolerance. In ISCA, 2010.
-
(2010)
ISCA
-
-
Meng, J.1
Tarjan, D.2
Skadron, K.3
-
22
-
-
79952776779
-
A parallel externalmemory frontier breadth-first traversal algorithm for clusters of workstations
-
R. Niewiadomski, J. Amaral, and R. Holte. A parallel externalmemory frontier breadth-first traversal algorithm for clusters of workstations. In IEEE ICPP, 2006.
-
(2006)
IEEE ICPP
-
-
Niewiadomski, R.1
Amaral, J.2
Holte, R.3
-
23
-
-
80053971905
-
-
Nvidia. Cuda.
-
Nvidia. Cuda. http://www.nvidia.com/cuda/.
-
-
-
-
25
-
-
33845388971
-
A scalable distributed parallel breadth-first search algorithm on bluegene/l
-
A. Yoo, E. Chow, K. Henderson, W. McLendon, B. Hendrickson, and U. Catalyurek. A scalable distributed parallel breadth-first search algorithm on bluegene/l. In ACM/IEEE SC, 2005.
-
(2005)
ACM/IEEE SC
-
-
Yoo, A.1
Chow, E.2
Henderson, K.3
McLendon, W.4
Hendrickson, B.5
Catalyurek, U.6
|