SCOPUS 정보 검색 플랫폼

International Conference for High Performance Computing, Networking, Storage and Analysis, SC

Volumn , Issue , 2012, Pages

NUMA-aware graph mining techniques for performance and energy efficiency

(3) Frasca, Michael a Madduri, Kamesh a Raghavan, Padma a

a The Pennsylvania State University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BETWEENNESS CENTRALITY; COMPUTATIONAL DEMANDS; HARDWARE TOPOLOGY; IRREGULAR APPLICATIONS; MULTI-CORE MACHINES; MULTI-CORE SYSTEMS; NON-UNIFORM MEMORY ARCHITECTURE; PERFORMANCE PROFILE;

ENERGY EFFICIENCY; ENERGY UTILIZATION; MICROPROCESSOR CHIPS;

DIGITAL STORAGE;

EID: 84877711984 PISSN: 21674329 EISSN: 21674337 Source Type: Conference Proceeding
DOI: 10.1109/SC.2012.81 Document Type: Conference Paper

Times cited : (26)

References (33)

1
- 77953207828
- Microsoft Research Redmond, WA
- A. Hey, S. Tansley, and K. Tolle, The fourth paradigm: data-intensive scientific discovery. Microsoft Research Redmond, WA, 2009.
- (2009) The Fourth Paradigm: Data-Intensive Scientific Discovery
- Hey, A.¹ Tansley, S.² Tolle, K.³

2
- 10444269335
- Load balancing and locality in range-queriable data structures
- ACM
- J. Aspnes, J. Kirsch, and A. Krishnamurthy, "Load balancing and locality in range-queriable data structures," in Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing. ACM, 2004, pp. 115-124.
- (2004) Proceedings of the Twenty-Third Annual ACM Symposium on Principles of Distributed Computing , pp. 115-124
- Aspnes, J.¹ Kirsch, J.² Krishnamurthy, A.³

3
- 57649106258
- Larrabee: A many-core x86 architecture for visual computing
- New York, NY, USA: ACM
- L. Seiler, D. Carmean, E. Sprangle, T. Forsyth, M. Abrash, P. Dubey, S. Junkins, A. Lake, J. Sugerman, R. Cavin, R. Espasa, E. Grochowski, T. Juan, and P. Hanrahan, "Larrabee: a many-core x86 architecture for visual computing," in SIGGRAPH '08: ACM SIGGRAPH 2008 papers. New York, NY, USA: ACM, 2008, pp. 1-15.
- (2008) SIGGRAPH '08: ACM SIGGRAPH 2008 Papers , pp. 1-15
- Seiler, L.¹ Carmean, D.² Sprangle, E.³ Forsyth, T.⁴ Abrash, M.⁵ Dubey, P.⁶ Junkins, S.⁷ Lake, A.⁸ Sugerman, J.⁹ Cavin, R.¹⁰ Espasa, R.¹¹ Grochowski, E.¹² Juan, T.¹³ Hanrahan, P.¹⁴

4
- 0035648637
- A faster algorithm for betweenness centrality
- U. Brandes, "A faster algorithm for betweenness centrality," J. Mathematical Sociology, vol. 25, no. 2, pp. 163-177, 2001.
- (2001) J. Mathematical Sociology , vol.25 , Issue.2 , pp. 163-177
- Brandes, U.¹

5
- 41549097717
- On variants of shortest-path betweenness centrality and their generic computation
- -, "On variants of shortest-path betweenness centrality and their generic computation," Social Networks, vol. 30, no. 2, pp. 136-145, 2008.
- (2008) Social Networks , vol.30 , Issue.2 , pp. 136-145
- Brandes, U.¹

6
- 79952579787
- Exascale Computing Technology Challenges
- J. Shalf, S. Dosanjh, and J. Morrison, "Exascale Computing Technology Challenges," High Performance Computing for Computational Science-VECPAR 2010, pp. 1-25, 2011.
- (2011) High Performance Computing for Computational Science-VECPAR 2010 , pp. 1-25
- Shalf, J.¹ Dosanjh, S.² Morrison, J.³

7
- 0031696792
- Cramming more components onto integrated circuits
- G. Moore et al., "Cramming more components onto integrated circuits," Proceedings of the IEEE, vol. 86, no. 1, pp. 82-85, 1998.
- (1998) Proceedings of the IEEE , vol.86 , Issue.1 , pp. 82-85
- Moore, G.¹

8
- 0003158656
- Hitting the memory wall: Implications of the obvious
- March
- W. A. Wulf and S. A. McKee, "Hitting the memory wall: implications of the obvious," SIGARCH Comput. Archit. News, vol. 23, pp. 20-24, March 1995.
- (1995) SIGARCH Comput. Archit. News , vol.23 , pp. 20-24
- Wulf, W.A.¹ McKee, S.A.²

9
- 84877720178
- Los Alamos National Laboratory (LANL), Tech. Rep.
- P. McCormick, R. Braithwaite, and W. Feng, "Empirical memory-access cost models in multicore numa architectures," Los Alamos National Laboratory (LANL), Tech. Rep., 2011.
- (2011) Empirical Memory-Access Cost Models in Multicore Numa Architectures
- McCormick, P.¹ Braithwaite, R.² Feng, W.³

10
- 79955737126
- A 32nm Westmere-EX Xeon® enterprise processor
- IEEE
- S. Sawant, U. Desai, G. Shamanna, L. Sharma, M. Ranade, A. Agarwal, S. Dakshinamurthy, and R. Narayanan, "A 32nm Westmere-EX Xeon® enterprise processor," in Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2011 IEEE International. IEEE, 2011, pp. 74-75.
- (2011) Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2011 IEEE International , pp. 74-75
- Sawant, S.¹ Desai, U.² Shamanna, G.³ Sharma, L.⁴ Ranade, M.⁵ Agarwal, A.⁶ Dakshinamurthy, S.⁷ Narayanan, R.⁸

11
- 0024936730
- Simple but effective techniques for numa memory management
- ACM
- W. Bolosky, R. Fitzgerald, and M. Scott, "Simple but effective techniques for numa memory management," in ACM SIGOPS Operating Systems Review, vol. 23, no. 5. ACM, 1989, pp. 19-31.
- (1989) ACM SIGOPS Operating Systems Review , vol.23 , Issue.5 , pp. 19-31
- Bolosky, W.¹ Fitzgerald, R.² Scott, M.³

12
- 84857187765
- Ph.D. dissertation, University of Toronto
- D. Tam, "Operating system management of shared caches on multicore processors," Ph.D. dissertation, University of Toronto, 2010.
- (2010) Operating System Management of Shared Caches on Multicore Processors
- Tam, D.¹

13
- 70449792770
- A faster parallel algorithm and efficient multithreaded implementations for evaluating betweenness centrality on massive datasets
- IEEE
- K. Madduri, D. Ediger, K. Jiang, D. Bader, and D. Chavarria-Miranda, "A faster parallel algorithm and efficient multithreaded implementations for evaluating betweenness centrality on massive datasets," in Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on. IEEE, 2009, pp. 1-8.
- (2009) Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on , pp. 1-8
- Madduri, K.¹ Ediger, D.² Jiang, K.³ Bader, D.⁴ Chavarria-Miranda, D.⁵

14
- 4644275245
- Fast approximation of centrality
- D. Eppstein and J. Wang, "Fast approximation of centrality," Journal of Graph Algorithms and Applications, vol. 8, no. 1, pp. 39-45, 2004.
- (2004) Journal of Graph Algorithms and Applications , vol.8 , Issue.1 , pp. 39-45
- Eppstein, D.¹ Wang, J.²

15
- 38149071742
- Approximating betweenness centrality
- Proc. 5th Int'l. Workshop on Algorithms and Models for the Web-Graph (WAW 2007), ser. A. Bonato and F. Chung, Eds., Springer, Dec.
- D. Bader, S. Kintali, K. Madduri, and M. Mihail, "Approximating betweenness centrality," in Proc. 5th Int'l. Workshop on Algorithms and Models for the Web-Graph (WAW 2007), ser. LNCS, A. Bonato and F. Chung, Eds., vol. 4863. Springer, Dec. 2007, pp. 124-137.
- (2007) LNCS , vol.4863 , pp. 124-137
- Bader, D.¹ Kintali, S.² Madduri, K.³ Mihail, M.⁴

16
- 58349114641
- Better approximation of betweenness centrality
- SIAM, Jan.
- R. Geisberger, P. Sanders, and D. Schultes, "Better approximation of betweenness centrality," in Proc. Workshop on Algorithm Engineering and Experimentation (ALENEX 2008). SIAM, Jan. 2008, pp. 90-100.
- (2008) Proc. Workshop on Algorithm Engineering and Experimentation (ALENEX 2008) , pp. 90-100
- Geisberger, R.¹ Sanders, P.² Schultes, D.³

17
- 0002806690
- Open MP: An Industry-Standard API for Shared-Memory Programming
- L. Dagum and R. Menon, "Open MP: An Industry-Standard API for Shared-Memory Programming," Computational Science & Engineering, IEEE, vol. 5, no. 1, pp. 46-55, 1998.
- (1998) Computational Science & Engineering, IEEE , vol.5 , Issue.1 , pp. 46-55
- Dagum, L.¹ Menon, R.²

18
- 85031264203
- Improving performance of sparse matrix-vector multiplication
- ACM
- A. Pinar and M. Heath, "Improving performance of sparse matrix-vector multiplication," in Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM). ACM, 1999, p. 30.
- (1999) Proceedings of the 1999 ACM/IEEE Conference on Supercomputing (CDROM) , pp. 30
- Pinar, A.¹ Heath, M.²

19
- 0016940739
- Comparative analysis of the Cuthill-McKee and the reverse Cuthill-McKee ordering algorithms for sparse matrices
- W. Liu and A. Sherman, "Comparative analysis of the Cuthill-McKee and the reverse Cuthill-McKee ordering algorithms for sparse matrices," SIAM Journal on Numerical Analysis, pp. 198-213, 1976.
- (1976) SIAM Journal on Numerical Analysis , pp. 198-213
- Liu, W.¹ Sherman, A.²

20
- 79958294668
- Can models of scientific software-hardware interactions be predictive?
- M. Frasca, A. Chatterjee, and P. Raghavan, "Can models of scientific software-hardware interactions be predictive?" Procedia CS, vol. 4, pp. 322-331, 2011.
- (2011) Procedia CS , vol.4 , pp. 322-331
- Frasca, M.¹ Chatterjee, A.² Raghavan, P.³

21
- 81355161778
- The University of Florida sparse matrix collection
- Dec. [Online]. Available
- T. A. Davis and Y. Hu, "The University of Florida sparse matrix collection," ACM Trans. Math. Softw., vol. 38, no. 1, pp. 1:1-1:25, Dec. 2011. [Online]. Available: http://doi.acm.org/10.1145/2049662.2049663
- (2011) ACM Trans. Math. Softw. , vol.38 , Issue.1
- Davis, T.A.¹ Hu, Y.²

22
- 0002634823
- Scheduling multithreaded computations by work stealing
- IEEE
- R. Blumofe and C. Leiserson, "Scheduling multithreaded computations by work stealing," in Foundations of Computer Science, 1994 Proceedings., 35th Annual Symposium on. IEEE, 1994, pp. 356-368.
- (1994) Foundations of Computer Science, 1994 Proceedings., 35th Annual Symposium on , pp. 356-368
- Blumofe, R.¹ Leiserson, C.²

23
- 0029191296
- ACM
- R. Blumofe, C. Joerg, B. Kuszmaul, C. Leiserson, K. Randall, and Y. Zhou, Cilk: An efficient multithreaded runtime system. ACM, 1995, vol. 30, no. 8.
- (1995) Cilk: An Efficient Multithreaded Runtime System , vol.30 , Issue.8
- Blumofe, R.¹ Joerg, C.² Kuszmaul, B.³ Leiserson, C.⁴ Randall, K.⁵ Zhou, Y.⁶

24
- 84860553699
- Tech Report, Department of Computer Science, Carnegie Mellon University
- D. Neill and A. Wierman, "On the benefits of work stealing in shared-memory multiprocessors," Tech Report, Department of Computer Science, Carnegie Mellon University, 2006.
- (2006) On the Benefits of Work Stealing in Shared-Memory Multiprocessors
- Neill, D.¹ Wierman, A.²

25
- 84864332206
- D. Bader, H. Meyerhenke, P. Sanders, and D. Wagner, "10th DIMACS implementation challenge-graph partitioning and graph clustering, 2011."
- 10th DIMACS Implementation Challenge-Graph Partitioning and Graph Clustering, 2011
- Bader, D.¹ Meyerhenke, H.² Sanders, P.³ Wagner, D.⁴

26
- 19944376183
- The WebGraph framework I: Compression techniques
- Manhattan, USA: ACM Press
- P. Boldi and S. Vigna, "The WebGraph framework I: Compression techniques," in Proc. of the Thirteenth International World Wide Web Conference (WWW 2004). Manhattan, USA: ACM Press, 2004, pp. 595-601.
- (2004) Proc. of the Thirteenth International World Wide Web Conference (WWW 2004) , pp. 595-601
- Boldi, P.¹ Vigna, S.²

27
- 0038998034
- A survey of memory bandwidth and machine balance in current high performance computers
- J. McCalpin, "A survey of memory bandwidth and machine balance in current high performance computers," IEEE TCCA Newsletter, pp. 19-25, 1995.
- (1995) IEEE TCCA Newsletter , pp. 19-25
- McCalpin, J.¹

28
- 0034268943
- Portable programming interface for performance evaluation on modern processors
- DOI 10.1177/109434200001400303
- S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci, "A portable programming interface for performance evaluation on modern processors," International Journal of High Performance Computing Applications, vol. 14, no. 3, pp. 189-204, 2000. (Pubitemid 32025040)
- (2000) International Journal of High Performance Computing Applications , vol.14 , Issue.3 , pp. 189-204
- Browne, S.¹ Dongarra, J.² Garner, N.³ Ho, G.⁴ Mucci, P.⁵

29
- 0027694019
- Access normalization: Loop restructuring for numa computers
- W. Li and K. Pingali, "Access normalization: loop restructuring for numa computers," ACM Transactions on Computer Systems (TOCS), vol. 11, no. 4, pp. 353-375, 1993.
- (1993) ACM Transactions on Computer Systems (TOCS) , vol.11 , Issue.4 , pp. 353-375
- Li, W.¹ Pingali, K.²

30
- 40349095122
- Managing distributed, shared l2 caches through oslevel page allocation
- S. Cho and L. Jin, "Managing distributed, shared l2 caches through oslevel page allocation," in Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 2006, pp. 455-468.
- Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 2006 , pp. 455-468
- Cho, S.¹ Jin, L.²

31
- 85076887997
- Corey: An operating system for many cores
- USENIX Association
- S. Boyd-Wickizer, H. Chen, R. Chen, Y. Mao, F. Kaashoek, R. Morris, A. Pesterev, L. Stein, M. Wu, Y. Dai et al., "Corey: An operating system for many cores," in Proceedings of the 8th USENIX conference on Operating systems design and implementation. USENIX Association, 2008, pp. 43-57.
- (2008) Proceedings of the 8th USENIX Conference on Operating Systems Design and Implementation , pp. 43-57
- Boyd-Wickizer, S.¹ Chen, H.² Chen, R.³ Mao, Y.⁴ Kaashoek, F.⁵ Morris, R.⁶ Pesterev, A.⁷ Stein, L.⁸ Wu, M.⁹ Dai, Y.¹⁰

32
- 0033645154
- The data locality of work stealing
- ACM
- U. Acar, G. Blelloch, and R. Blumofe, "The data locality of work stealing," in Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architectures . ACM, 2000, pp. 1-12.
- (2000) Proceedings of the Twelfth Annual ACM Symposium on Parallel Algorithms and Architectures , pp. 1-12
- Acar, U.¹ Blelloch, G.² Blumofe, R.³

33
- 84906697125
- Hierarchical work stealing on manycore clusters
- S. Min, C. Iancu, and K. Yelick, "Hierarchical work stealing on manycore clusters," in Fifth Conference on Partitioned Global Address Space Programming Models (PGAS11), 2011.
- Fifth Conference on Partitioned Global Address Space Programming Models (PGAS11), 2011
- Min, S.¹ Iancu, C.² Yelick, K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.