-
2
-
-
10444269335
-
Load balancing and locality in range-queriable data structures
-
ACM
-
J. Aspnes, J. Kirsch, and A. Krishnamurthy, "Load balancing and locality in range-queriable data structures," in Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing. ACM, 2004, pp. 115-124.
-
(2004)
Proceedings of the Twenty-Third Annual ACM Symposium on Principles of Distributed Computing
, pp. 115-124
-
-
Aspnes, J.1
Kirsch, J.2
Krishnamurthy, A.3
-
3
-
-
57649106258
-
Larrabee: A many-core x86 architecture for visual computing
-
New York, NY, USA: ACM
-
L. Seiler, D. Carmean, E. Sprangle, T. Forsyth, M. Abrash, P. Dubey, S. Junkins, A. Lake, J. Sugerman, R. Cavin, R. Espasa, E. Grochowski, T. Juan, and P. Hanrahan, "Larrabee: a many-core x86 architecture for visual computing," in SIGGRAPH '08: ACM SIGGRAPH 2008 papers. New York, NY, USA: ACM, 2008, pp. 1-15.
-
(2008)
SIGGRAPH '08: ACM SIGGRAPH 2008 Papers
, pp. 1-15
-
-
Seiler, L.1
Carmean, D.2
Sprangle, E.3
Forsyth, T.4
Abrash, M.5
Dubey, P.6
Junkins, S.7
Lake, A.8
Sugerman, J.9
Cavin, R.10
Espasa, R.11
Grochowski, E.12
Juan, T.13
Hanrahan, P.14
-
4
-
-
0035648637
-
A faster algorithm for betweenness centrality
-
U. Brandes, "A faster algorithm for betweenness centrality," J. Mathematical Sociology, vol. 25, no. 2, pp. 163-177, 2001.
-
(2001)
J. Mathematical Sociology
, vol.25
, Issue.2
, pp. 163-177
-
-
Brandes, U.1
-
5
-
-
41549097717
-
On variants of shortest-path betweenness centrality and their generic computation
-
-, "On variants of shortest-path betweenness centrality and their generic computation," Social Networks, vol. 30, no. 2, pp. 136-145, 2008.
-
(2008)
Social Networks
, vol.30
, Issue.2
, pp. 136-145
-
-
Brandes, U.1
-
6
-
-
79952579787
-
Exascale Computing Technology Challenges
-
J. Shalf, S. Dosanjh, and J. Morrison, "Exascale Computing Technology Challenges," High Performance Computing for Computational Science-VECPAR 2010, pp. 1-25, 2011.
-
(2011)
High Performance Computing for Computational Science-VECPAR 2010
, pp. 1-25
-
-
Shalf, J.1
Dosanjh, S.2
Morrison, J.3
-
7
-
-
0031696792
-
Cramming more components onto integrated circuits
-
G. Moore et al., "Cramming more components onto integrated circuits," Proceedings of the IEEE, vol. 86, no. 1, pp. 82-85, 1998.
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.1
, pp. 82-85
-
-
Moore, G.1
-
8
-
-
0003158656
-
Hitting the memory wall: Implications of the obvious
-
March
-
W. A. Wulf and S. A. McKee, "Hitting the memory wall: implications of the obvious," SIGARCH Comput. Archit. News, vol. 23, pp. 20-24, March 1995.
-
(1995)
SIGARCH Comput. Archit. News
, vol.23
, pp. 20-24
-
-
Wulf, W.A.1
McKee, S.A.2
-
9
-
-
84877720178
-
-
Los Alamos National Laboratory (LANL), Tech. Rep.
-
P. McCormick, R. Braithwaite, and W. Feng, "Empirical memory-access cost models in multicore numa architectures," Los Alamos National Laboratory (LANL), Tech. Rep., 2011.
-
(2011)
Empirical Memory-Access Cost Models in Multicore Numa Architectures
-
-
McCormick, P.1
Braithwaite, R.2
Feng, W.3
-
10
-
-
79955737126
-
A 32nm Westmere-EX Xeon® enterprise processor
-
IEEE
-
S. Sawant, U. Desai, G. Shamanna, L. Sharma, M. Ranade, A. Agarwal, S. Dakshinamurthy, and R. Narayanan, "A 32nm Westmere-EX Xeon® enterprise processor," in Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2011 IEEE International. IEEE, 2011, pp. 74-75.
-
(2011)
Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2011 IEEE International
, pp. 74-75
-
-
Sawant, S.1
Desai, U.2
Shamanna, G.3
Sharma, L.4
Ranade, M.5
Agarwal, A.6
Dakshinamurthy, S.7
Narayanan, R.8
-
11
-
-
0024936730
-
Simple but effective techniques for numa memory management
-
ACM
-
W. Bolosky, R. Fitzgerald, and M. Scott, "Simple but effective techniques for numa memory management," in ACM SIGOPS Operating Systems Review, vol. 23, no. 5. ACM, 1989, pp. 19-31.
-
(1989)
ACM SIGOPS Operating Systems Review
, vol.23
, Issue.5
, pp. 19-31
-
-
Bolosky, W.1
Fitzgerald, R.2
Scott, M.3
-
13
-
-
70449792770
-
A faster parallel algorithm and efficient multithreaded implementations for evaluating betweenness centrality on massive datasets
-
IEEE
-
K. Madduri, D. Ediger, K. Jiang, D. Bader, and D. Chavarria-Miranda, "A faster parallel algorithm and efficient multithreaded implementations for evaluating betweenness centrality on massive datasets," in Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on. IEEE, 2009, pp. 1-8.
-
(2009)
Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
, pp. 1-8
-
-
Madduri, K.1
Ediger, D.2
Jiang, K.3
Bader, D.4
Chavarria-Miranda, D.5
-
14
-
-
4644275245
-
Fast approximation of centrality
-
D. Eppstein and J. Wang, "Fast approximation of centrality," Journal of Graph Algorithms and Applications, vol. 8, no. 1, pp. 39-45, 2004.
-
(2004)
Journal of Graph Algorithms and Applications
, vol.8
, Issue.1
, pp. 39-45
-
-
Eppstein, D.1
Wang, J.2
-
15
-
-
38149071742
-
Approximating betweenness centrality
-
Proc. 5th Int'l. Workshop on Algorithms and Models for the Web-Graph (WAW 2007), ser. A. Bonato and F. Chung, Eds., Springer, Dec.
-
D. Bader, S. Kintali, K. Madduri, and M. Mihail, "Approximating betweenness centrality," in Proc. 5th Int'l. Workshop on Algorithms and Models for the Web-Graph (WAW 2007), ser. LNCS, A. Bonato and F. Chung, Eds., vol. 4863. Springer, Dec. 2007, pp. 124-137.
-
(2007)
LNCS
, vol.4863
, pp. 124-137
-
-
Bader, D.1
Kintali, S.2
Madduri, K.3
Mihail, M.4
-
16
-
-
58349114641
-
Better approximation of betweenness centrality
-
SIAM, Jan.
-
R. Geisberger, P. Sanders, and D. Schultes, "Better approximation of betweenness centrality," in Proc. Workshop on Algorithm Engineering and Experimentation (ALENEX 2008). SIAM, Jan. 2008, pp. 90-100.
-
(2008)
Proc. Workshop on Algorithm Engineering and Experimentation (ALENEX 2008)
, pp. 90-100
-
-
Geisberger, R.1
Sanders, P.2
Schultes, D.3
-
17
-
-
0002806690
-
Open MP: An Industry-Standard API for Shared-Memory Programming
-
L. Dagum and R. Menon, "Open MP: An Industry-Standard API for Shared-Memory Programming," Computational Science & Engineering, IEEE, vol. 5, no. 1, pp. 46-55, 1998.
-
(1998)
Computational Science & Engineering, IEEE
, vol.5
, Issue.1
, pp. 46-55
-
-
Dagum, L.1
Menon, R.2
-
19
-
-
0016940739
-
Comparative analysis of the Cuthill-McKee and the reverse Cuthill-McKee ordering algorithms for sparse matrices
-
W. Liu and A. Sherman, "Comparative analysis of the Cuthill-McKee and the reverse Cuthill-McKee ordering algorithms for sparse matrices," SIAM Journal on Numerical Analysis, pp. 198-213, 1976.
-
(1976)
SIAM Journal on Numerical Analysis
, pp. 198-213
-
-
Liu, W.1
Sherman, A.2
-
20
-
-
79958294668
-
Can models of scientific software-hardware interactions be predictive?
-
M. Frasca, A. Chatterjee, and P. Raghavan, "Can models of scientific software-hardware interactions be predictive?" Procedia CS, vol. 4, pp. 322-331, 2011.
-
(2011)
Procedia CS
, vol.4
, pp. 322-331
-
-
Frasca, M.1
Chatterjee, A.2
Raghavan, P.3
-
21
-
-
81355161778
-
The University of Florida sparse matrix collection
-
Dec. [Online]. Available
-
T. A. Davis and Y. Hu, "The University of Florida sparse matrix collection," ACM Trans. Math. Softw., vol. 38, no. 1, pp. 1:1-1:25, Dec. 2011. [Online]. Available: http://doi.acm.org/10.1145/2049662.2049663
-
(2011)
ACM Trans. Math. Softw.
, vol.38
, Issue.1
-
-
Davis, T.A.1
Hu, Y.2
-
22
-
-
0002634823
-
Scheduling multithreaded computations by work stealing
-
IEEE
-
R. Blumofe and C. Leiserson, "Scheduling multithreaded computations by work stealing," in Foundations of Computer Science, 1994 Proceedings., 35th Annual Symposium on. IEEE, 1994, pp. 356-368.
-
(1994)
Foundations of Computer Science, 1994 Proceedings., 35th Annual Symposium on
, pp. 356-368
-
-
Blumofe, R.1
Leiserson, C.2
-
23
-
-
0029191296
-
-
ACM
-
R. Blumofe, C. Joerg, B. Kuszmaul, C. Leiserson, K. Randall, and Y. Zhou, Cilk: An efficient multithreaded runtime system. ACM, 1995, vol. 30, no. 8.
-
(1995)
Cilk: An Efficient Multithreaded Runtime System
, vol.30
, Issue.8
-
-
Blumofe, R.1
Joerg, C.2
Kuszmaul, B.3
Leiserson, C.4
Randall, K.5
Zhou, Y.6
-
25
-
-
84864332206
-
-
D. Bader, H. Meyerhenke, P. Sanders, and D. Wagner, "10th DIMACS implementation challenge-graph partitioning and graph clustering, 2011."
-
10th DIMACS Implementation Challenge-Graph Partitioning and Graph Clustering, 2011
-
-
Bader, D.1
Meyerhenke, H.2
Sanders, P.3
Wagner, D.4
-
27
-
-
0038998034
-
A survey of memory bandwidth and machine balance in current high performance computers
-
J. McCalpin, "A survey of memory bandwidth and machine balance in current high performance computers," IEEE TCCA Newsletter, pp. 19-25, 1995.
-
(1995)
IEEE TCCA Newsletter
, pp. 19-25
-
-
McCalpin, J.1
-
28
-
-
0034268943
-
Portable programming interface for performance evaluation on modern processors
-
DOI 10.1177/109434200001400303
-
S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci, "A portable programming interface for performance evaluation on modern processors," International Journal of High Performance Computing Applications, vol. 14, no. 3, pp. 189-204, 2000. (Pubitemid 32025040)
-
(2000)
International Journal of High Performance Computing Applications
, vol.14
, Issue.3
, pp. 189-204
-
-
Browne, S.1
Dongarra, J.2
Garner, N.3
Ho, G.4
Mucci, P.5
-
29
-
-
0027694019
-
Access normalization: Loop restructuring for numa computers
-
W. Li and K. Pingali, "Access normalization: loop restructuring for numa computers," ACM Transactions on Computer Systems (TOCS), vol. 11, no. 4, pp. 353-375, 1993.
-
(1993)
ACM Transactions on Computer Systems (TOCS)
, vol.11
, Issue.4
, pp. 353-375
-
-
Li, W.1
Pingali, K.2
-
30
-
-
40349095122
-
Managing distributed, shared l2 caches through oslevel page allocation
-
S. Cho and L. Jin, "Managing distributed, shared l2 caches through oslevel page allocation," in Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 2006, pp. 455-468.
-
Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 2006
, pp. 455-468
-
-
Cho, S.1
Jin, L.2
-
31
-
-
85076887997
-
Corey: An operating system for many cores
-
USENIX Association
-
S. Boyd-Wickizer, H. Chen, R. Chen, Y. Mao, F. Kaashoek, R. Morris, A. Pesterev, L. Stein, M. Wu, Y. Dai et al., "Corey: An operating system for many cores," in Proceedings of the 8th USENIX conference on Operating systems design and implementation. USENIX Association, 2008, pp. 43-57.
-
(2008)
Proceedings of the 8th USENIX Conference on Operating Systems Design and Implementation
, pp. 43-57
-
-
Boyd-Wickizer, S.1
Chen, H.2
Chen, R.3
Mao, Y.4
Kaashoek, F.5
Morris, R.6
Pesterev, A.7
Stein, L.8
Wu, M.9
Dai, Y.10
-
32
-
-
0033645154
-
The data locality of work stealing
-
ACM
-
U. Acar, G. Blelloch, and R. Blumofe, "The data locality of work stealing," in Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architectures . ACM, 2000, pp. 1-12.
-
(2000)
Proceedings of the Twelfth Annual ACM Symposium on Parallel Algorithms and Architectures
, pp. 1-12
-
-
Acar, U.1
Blelloch, G.2
Blumofe, R.3
|