-
1
-
-
84893594127
-
-
AMD Radeon HD 7000 series counters. Web resource
-
AMD Radeon HD 7000 series counters. Web resource. http: //developer.amd.com/tools/heterogeneous-computing/amd-app-profiler/ user- guide/app- profiler- settings/.
-
-
-
-
2
-
-
84893553919
-
-
The 10th DIMACS Implementation Challenge Graph Partitioning and Graph Clustering. Web resource
-
The 10th DIMACS Implementation Challenge Graph Partitioning and Graph Clustering. Web resource. http://www.cc.gatech.edu/dimacslO/.
-
-
-
-
3
-
-
84893617546
-
-
The 9th DIMACS Implementation Challenge Shortest Paths. Web resource
-
The 9th DIMACS Implementation Challenge Shortest Paths. Web resource. http://www.dis.uniromal.it/challenge9/.
-
-
-
-
4
-
-
84893549371
-
-
AMD Accelerated Parallel Processing: OpenCL Programming Guide. Web resource
-
AMD Accelerated Parallel Processing: OpenCL Programming Guide. Web resource.
-
-
-
-
5
-
-
84893538088
-
-
AMD Graphics Core Next Architecture. Web resource
-
http://developer.amd.com/download/AMD-Accelerated- Parallel-Processing- OpenCL-Programming-Guide.pdf AMD Graphics Core Next Architecture. Web resource. http://www.amd. com/us/products/technologies/gcn/Pages/gcn-architecture.aspx.
-
-
-
-
6
-
-
85166916238
-
Gephi: An open source software for exploring and manipulating networks
-
May
-
M. Bastian, S. Heymann, and M. Jacomy. Gephi: An open source software for exploring and manipulating networks. In ICWSM, May 2009.
-
(2009)
ICWSM
-
-
Bastian, M.1
Heymann, S.2
Jacomy, M.3
-
7
-
-
56449124998
-
Splash-2: A quantitative comparison of two multithreaded benchmark suites on chip-multiprocessors
-
Sep
-
C. Bienia, S. Kumar, and K. Li. PARSEC vs. SPLASH-2: A quantitative comparison of two multithreaded benchmark suites on chip-multiprocessors. In IISWC, Sep 2008.
-
(2008)
IISWC
-
-
Bienia, C.1
Kumar, S.2
Li. Parsec Vs, K.3
-
8
-
-
78751484931
-
Fidelity and scaling of the parsec benchmark inputs
-
Dec
-
C. Bienia and K. Li. Fidelity and scaling of the PARSEC benchmark inputs. In IISWC, Dec 2010
-
(2010)
IISWC
-
-
Bienia, C.1
Li, K.2
-
9
-
-
0035648637
-
A faster algorithm for betweenness centrality
-
Ulrik Brandes. A faster algorithm for betweenness centrality. 1. Math. Social., 25:163-177, 2001.
-
(2001)
1. Math. Social
, vol.25
, pp. 163-177
-
-
Brandes, U.1
-
10
-
-
84873458159
-
A quantitative study ofirregular programs on gpus
-
Nov
-
M. Burtscher, R. Nasre, and K. Pingali. A quantitative study ofirregular programs on GPUs. In IISWC, Nov 2012.
-
(2012)
IISWC
-
-
Burtscher, M.1
Nasre, R.2
Pingali, K.3
-
11
-
-
84858427151
-
An efficient cuda implementation of the tree-based barnes hut n-body algorithm
-
Morgan Kaufmann
-
M. Burtscher and K. Pingali. An efficient CUDA implementation of the tree-based Barnes Hut n-body algorithm. In GPU Computing Gems, pages 75-92. Morgan Kaufmann, 2011.
-
(2011)
GPU Computing Gems
, pp. 75-92
-
-
Burtscher, M.1
Pingali, K.2
-
12
-
-
70649092154
-
Rodinia: A benchmark suite for heterogeneous computing
-
Oct
-
S. Che, M. Boyer, J. Meng, D. Tarjan, J. W. Sheaffer, S-H. Lee, and K. Skadron. Rodinia: A benchmark suite for heterogeneous computing. In IISWC, Oct 2009.
-
(2009)
IISWC
-
-
Che, S.1
Boyer, M.2
Meng, J.3
Tarjan, D.4
Sheaffer, J.W.5
Lee, S.-H.6
Skadron, K.7
-
13
-
-
78751505898
-
A characterization of the rodinia benchmark suite with comparison to contemporary cmp workloads
-
Dec
-
S. Che, J. W. Sheaffer, M. Boyer, L. G. Szafaryn, L. Wang, and K. Skadron. A characterization of the Rodinia benchmark suite with comparison to contemporary CMP workloads. In IISWC, Dec 2010.
-
(2010)
IISWC
-
-
Che, S.1
Sheaffer, J.W.2
Boyer, M.3
Szafaryn, L.G.4
Wang, L.5
Skadron, K.6
-
14
-
-
0004116989
-
-
McGraw-Hill, 2nd edition
-
T. H. Cormen, C. Stein, R. L. Rivest, and C. E. Leiserson. Introduction to Algorithms. McGraw-Hill, 2nd edition, 2001.
-
(2001)
Introduction to Algorithms
-
-
Cormen, T.H.1
Stein, C.2
Rivest, R.L.3
Leiserson, C.E.4
-
15
-
-
77954719557
-
The scalable heterogeneous computing (shoc) benchmark suite
-
Mar
-
A. Danalis, G. Marin, C. McCurdy, J. S. Meredith, P. C. Roth, K. Spafford, V. Tipparaju, and J. S. Vetter. The scalable HeterOgeneous computing (SHOC) benchmark suite. In GPGPU, Mar 2010.
-
(2010)
GPGPU
-
-
Danalis, A.1
Marin, G.2
McCurdy, C.3
Meredith, J.S.4
Roth, P.C.5
Spafford, K.6
Tipparaju, V.7
Vetter, J.S.8
-
16
-
-
84948749348
-
Workload design: Selecting representative program-input pairs
-
Sept
-
L. Eeckhout, H. Vandierendonck, and K. D. Bosschere. Workload design: Selecting representative program-input pairs. In PACT, Sept 2002.
-
(2002)
PACT
-
-
Eeckhout, L.1
Vandierendonck, H.2
Bosschere, K.D.3
-
17
-
-
78751477137
-
Exploring gpgpu workloads: Characterization methodology, analysis and microarchitecture evaluation implication
-
Dec
-
N. Goswami, R. Shankar, M. Joshi, and Tao Li. Exploring gpgpu workloads: Characterization methodology, analysis and microarchitecture evaluation implication. In IISWC, Dec 2010.
-
(2010)
IISWC
-
-
Goswami, N.1
Shankar, R.2
Joshi, M.3
Li, T.4
-
18
-
-
84893608095
-
-
NVIDIA CUDA Programming Guide. Web resource
-
NVIDIA CUDA Programming Guide. Web resource. http://developer. nvidia.com/object/gpucomputing.html.
-
-
-
-
19
-
-
60649099910
-
Accelerating large graph algorithms on the gpu using cuda
-
Dec
-
P. Harish and P. Narayanan. Accelerating large graph algorithms on the GPU using CUDA. In HiPC, Dec 2007.
-
(2007)
HiPC
-
-
Harish, P.1
Narayanan, P.2
-
20
-
-
34548329985
-
Microarchitecture-independent workload characterization
-
K. Hoste and L. Eeckhout. Microarchitecture-independent workload characterization. IEEE Micro, 27(3):63-72, 2007.
-
(2007)
IEEE Micro
, vol.27
, Issue.3
, pp. 63-72
-
-
Hoste, K.1
Eeckhout, L.2
-
22
-
-
33646486530
-
Measuring benchmark similarity using inherent program characteristics
-
A. Joshi, A. Phansalkar, L. Eeckhout, and L. K. John. Measuring benchmark similarity using inherent program characteristics. IEEE Trans. Comp, 55(6):769-782, 2006.
-
(2006)
IEEE Trans. Comp
, vol.55
, Issue.6
, pp. 769-782
-
-
Joshi, A.1
Phansalkar, A.2
Eeckhout, L.3
John, L.K.4
-
23
-
-
79952042632
-
Connected component labeling on a 2d grid using cuda
-
O. Kalentev, A. Rai, S. Kemnitz, and R. Schneider. Connected component labeling on a 2D grid using CUDA. J. Parallel and Dist. Comp., 71(4):615-620, 2011.
-
(2011)
J. Parallel and Dist. Comp
, vol.71
, Issue.4
, pp. 615-620
-
-
Kalentev, O.1
Rai, A.2
Kemnitz, S.3
Schneider, R.4
-
24
-
-
70649104826
-
A characterization and analysis of ptx kernels
-
Oct
-
A. Kerr, G. Diamos, and S. Yalamanchili. A characterization and analysis of PTX kernels. In IISWC, Oct 2009.
-
(2009)
IISWC
-
-
Kerr, A.1
Diamos, G.2
Yalamanchili, S.3
-
25
-
-
85042632297
-
Graphchi: Large-scale graph computation on just a pc
-
Oct
-
A. Kyrola, G. Blelloch, and C. Guestrin. Graphchi: Large-scale graph computation on just a pc. In OSDI, Oct 2012.
-
(2012)
OSDI
-
-
Kyrola, A.1
Blelloch, G.2
Guestrin, C.3
-
26
-
-
80052875653
-
Graphlab: A new parallel framework for machine learning
-
July
-
Y. Low, J. Gonzalez, A. Kyrola, D. Bickson, C. Guestrin, and J. M. Hellerstein. Graphlab: A new parallel framework for machine learning. In UAI, July 2010.
-
(2010)
UAI
-
-
Low, Y.1
Gonzalez, J.2
Kyrola, A.3
Bickson, D.4
Guestrin, C.5
Hellerstein, J.M.6
-
27
-
-
0021946709
-
A simple parallel algorithm for the maximal independent set problem
-
May
-
M. Luby. A simple parallel algorithm for the maximal independent set problem. In STOC, May 1985.
-
(1985)
STOC
-
-
Luby, M.1
-
28
-
-
77954723629
-
Pregel: A system for large-scale graph processing
-
June
-
G. Malewicz, M. H. Austern, A. J.C Bik, J. C. Dehnert, I. Horn, N. Leiser, and G. Czajkowski. Pregel: a system for large-scale graph processing. In SIGMOD, June 2010.
-
(2010)
SIGMOD
-
-
Malewicz, G.1
Austern, M.H.2
Bik, A.J.C.3
Dehnert, J.C.4
Horn, I.5
Leiser, N.6
Czajkowski, G.7
-
29
-
-
84893592799
-
-
Matrix Market Format. Web resource
-
Matrix Market Format. Web resouce. http://math.nist.gov/MatrixMarket/ formats.html.
-
-
-
-
30
-
-
84866862121
-
Robust simd: Dynamically adapted simd width and multi-threading depth
-
May
-
J. Meng, J. W. Sheaffer, and K. Skadron. Robust SIMD: Dynamically adapted simd width and multi-threading depth. In IPDPS, May 2012.
-
(2012)
IPDPS
-
-
Meng, J.1
Sheaffer, J.W.2
Skadron, K.3
-
31
-
-
77954976292
-
Dynamic warp subdivision for integrated branch and memory divergence tolerance
-
June
-
J. Meng, D. Tarjan, and K. Skadron. Dynamic warp subdivision for integrated branch and memory divergence tolerance. In ISCA, June 2010
-
(2010)
ISCA
-
-
Meng, J.1
Tarjan, D.2
Skadron, K.3
-
33
-
-
84893573741
-
-
METIS File Format. Web resource
-
METIS File Format. Web resource. http://people.sc.fsu.edu/~jburkardt/ data/metis-graph/metis-graph.html.
-
-
-
-
34
-
-
84893573144
-
-
GTGraph: A Suite of Synthetic Random Graph Generators. Web resource
-
GTGraph: A Suite of Synthetic Random Graph Generators. Web resource. http://www.cse.psu.edu/~madduri/software/GTgraph/index. html.
-
-
-
-
35
-
-
84879777611
-
A study on connected components labeling algorithms using gpus
-
Aug
-
V. M. A. Oliveira and R. A. Lotufo. A study on connected components labeling algorithms using GPUs. In SIBGRAFI, Aug 2010.
-
(2010)
SIBGRAFI
-
-
Oliveira, V.M.A.1
Lotufo, R.A.2
-
36
-
-
0003780986
-
-
Technical Report SIDL-WP-1999- 01204, Stanford Univerisity
-
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical Report SIDL-WP-1999- 01204, Stanford Univerisity, 1999
-
(1999)
The Pagerank Citation Ranking: Bringing Order to the Web
-
-
Page, L.1
Brin, S.2
Motwani, R.3
Winograd, T.4
-
37
-
-
35348913704
-
Analysis of redundancy and application balance in the spec cpu2006 benchmark suite
-
June
-
A. Phansalkar, A. Joshi, and L. K. John. Analysis of redundancy and application balance in the SPEC CPU2006 benchmark suite. In 1SCA, June 2007.
-
(2007)
1SCA
-
-
Phansalkar, A.1
Joshi, A.2
John, L.K.3
-
38
-
-
84893527877
-
-
AFDS 2012 Phil Rogers Keynote: The programmer's guide to a universe of possibility. Web resource
-
AFDS 2012 Phil Rogers Keynote: The programmer's guide to a universe of possibility. Web resource. http://hsafoundation.com/ publications/.
-
-
-
-
40
-
-
79955809263
-
Fast network centrality analysis using gpus
-
Z. Shi and B. Zhang. Fast network centrality analysis using gpus. BMC Bioinformatics, 12(140), 2011
-
(2011)
BMC Bioinformatics
, vol.12
, Issue.140
-
-
Shi, Z.1
Zhang, B.2
-
41
-
-
84893557304
-
-
Parboil Benchmark suite. Web resource
-
Parboil Benchmark suite. Web resource. http://impact.crhc.illinois.edu/ parboil.php.
-
-
-
-
42
-
-
84893579755
-
-
The University of Florida Sparse Matrix Collection. Web resource
-
The University of Florida Sparse Matrix Collection. Web resource. http: IIwww.cise.u8.edu/research/sparse/matrices/.
-
-
-
|