SCOPUS 정보 검색 플랫폼

Computer Science - Research and Development

Volumn 27, Issue 4, 2012, Pages 277-287

Profiling high performance dense linear algebra algorithms on multicore architectures for power and energy efficiency

(3) Ltaief, Hatem a Luszczek, Piotr b Dongarra, Jack b

a KING ABDULLAH UNIVERSITY OF SCIENCE AND TECHNOLOGY (Saudi Arabia)

b University of Tennessee (United States)

Author keywords

Dense linear algebra; Energy efficiency; Multicore architectures; Power profile; Tile algorithms

Indexed keywords

BLOCK ALGORITHM; DENSE LINEAR ALGEBRA; MULTICORE ARCHITECTURES; PARALLEL PERFORMANCE; POWER PROFILE; POWER PROFILING; SUB-MATRICES; TASK PARALLELISM;

ENERGY EFFICIENCY; LINEAR ALGEBRA; SOFTWARE ARCHITECTURE;

ALGORITHMS;

EID: 84868119278 PISSN: 18652034 EISSN: 18652042 Source Type: Journal
DOI: 10.1007/s00450-011-0191-z Document Type: Article

Times cited : (17)

References (21)

1
- 74049090446
- Comparative study of one-sided factorizations with multiple software packages on multi-core hardware
- ACM New York 10.1145/1654059.1654080 http://doi.acm.org/10.1145/1654059. 1654080
- Agullo E, Hadri B, Ltaief H, Dongarrra J (2009) Comparative study of one-sided factorizations with multiple software packages on multi-core hardware. In: SC '09: proceedings of the conference on high performance computing networking, storage and analysis. ACM, New York, pp 1-12. http://doi.acm.org/10. 1145/1654059.1654080
- (2009) SC '09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis , pp. 1-12
- Agullo, E.¹ Hadri, B.² Ltaief, H.³ Dongarrra, J.⁴

2
- 0003706460
- 3 Society for Industrial and Applied Mathematics Philadelphia 0934.65030 10.1137/1.9780898719604
- Anderson E, Bai Z, Bischof C, Blackford SL, Demmel JW, Dongarra JJ, Croz JD, Greenbaum A, Hammarling S, McKenney A, Sorensen DC (1999) LAPACK user's guide, 3rd edn. Society for Industrial and Applied Mathematics, Philadelphia
- (1999) LAPACK User's Guide
- Anderson, E.¹ Bai, Z.² Bischof, C.³ Blackford, S.L.⁴ Demmel, J.W.⁵ Dongarra, J.J.⁶ Croz, J.D.⁷ Greenbaum, A.⁸ Hammarling, S.⁹ McKenney, A.¹⁰ Sorensen, D.C.¹¹

3
- 77958512320
- Energy efficiency of mixed precision iterative refinement methods using hybrid hardware platformsan evaluation of different solver and hardware configurations
- 10.1007/s00450-010-0124-2
- Anzt H, Rocker B, Heuveline V (2010) Energy efficiency of mixed precision iterative refinement methods using hybrid hardware platformsan evaluation of different solver and hardware configurations. Comput Sci 25(3-4):141-148. doi: 10.1007/s00450-010-0124-2
- (2010) Comput Sci , vol.25 , Issue.3-4 , pp. 141-148
- Anzt, H.¹ Rocker, B.² Heuveline, V.³

4
- 77958509771
- A new energy aware performance metric
- 10.1007/s00450-010-0119-z
- Bekas C, Curioni A (2010) A new energy aware performance metric. Comput Sci 25(3-4):187-195. doi: 10.1007/s00450-010-0119-z
- (2010) Comput Sci , vol.25 , Issue.3-4 , pp. 187-195
- Bekas, C.¹ Curioni, A.²

5
- 0012881041
- Algorithm 807: The SBR toolboxsoftware for successive band reduction
- 1941086 10.1145/365723.365736 http://doi.acm.org/10.1145/365723.365736
- Bischof CH, Lang B, Sun X (2000) Algorithm 807: the SBR toolboxsoftware for successive band reduction. ACM Trans Math Softw 26(4):602-616. http://doi.acm.org/10.1145/365723.365736
- (2000) ACM Trans Math Softw , vol.26 , Issue.4 , pp. 602-616
- Bischof, C.H.¹ Lang, B.² Sun, X.³

6
- 35548933706
- Mixed precision iterative refinement techniques for the solution of dense linear systems
- 10.1177/1094342007084026 10.1177/1094342007084026
- Buttari A, Dongarra J, Langou J, Langou J, Luszczek P, Kurzak J (2007) Mixed precision iterative refinement techniques for the solution of dense linear systems. Int J Hight Perform Comput Appl 21(4):457-466. doi: 10.1177/1094342007084026
- (2007) Int J Hight Perform Comput Appl , vol.21 , Issue.4 , pp. 457-466
- Buttari, A.¹ Dongarra, J.² Langou, J.³ Langou, J.⁴ Luszczek, P.⁵ Kurzak, J.⁶

7
- 58149269099
- A class of parallel tiled linear algebra algorithms for multicore architectures
- 2492567 10.1016/j.parco.2008.10.002
- Buttari A, Langou J, Kurzak J, Dongarra J (2009) A class of parallel tiled linear algebra algorithms for multicore architectures. Parallel Comput 35(1):38-53
- (2009) Parallel Comput , vol.35 , Issue.1 , pp. 38-53
- Buttari, A.¹ Langou, J.² Kurzak, J.³ Dongarra, J.⁴

8
- 33746318690
- Reducing power with performance constraints for parallel sparse applications
- IEEE Comput Soc Los Alamitos http://doi.ieeecomputersociety.org/10.1109/ IPDPS.2005.378
- Chen G, Malkowski K, Kandemir MT, Raghavan P (2005) Reducing power with performance constraints for parallel sparse applications. In: IPDPS. IEEE Comput Soc, Los Alamitos. http://doi.ieeecomputersociety.org/10.1109/IPDPS.2005.378
- (2005) IPDPS
- Chen, G.¹ Malkowski, K.² Kandemir, M.T.³ Raghavan, P.⁴

9
- 51049107284
- Towards energy efficient scaling of scientific codes
- IEEE Press New York 10.1109/IPDPS.2008.4536217
- Ding Y, Malkowski K, Raghavan P, Kandemir MT (2008) Towards energy efficient scaling of scientific codes. In: IPDPS. IEEE Press, New York, pp 1-8. doi: 10.1109/IPDPS.2008.4536217
- (2008) IPDPS , pp. 1-8
- Ding, Y.¹ Malkowski, K.² Raghavan, P.³ Kandemir, M.T.⁴

10
- 31844450952
- Using multiple energy gears in MPI programs on a power-scalable cluster
- K. Pingali K.A. Yelick A.S. Grimshaw (eds) Chicago, IL, USA ACM SIGPLAN Notices 40 10.1145/1065944.1065967
- Freeh VW, Lowenthal DK (2005) Using multiple energy gears in MPI programs on a power-scalable cluster. In: Pingali K, Yelick KA, Grimshaw AS (eds) Proceedings of the ACM SIGPLAN symposium on principles and practice of parallel programming (10th PPOPP'2005), Chicago, IL, USA. ACM SIGPLAN Notices, vol 40, pp 164-173
- (2005) Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (10th PPOPP'2005) , pp. 164-173
- Freeh, V.W.¹ Lowenthal, D.K.²

11
- 77950629423
- Powerpack: Energy profiling and analysis of high-performance systems and applications
- 10.1109/TPDS.2009.76
- Ge R, Feng X, Song S, Chang HC, Li D, Cameron KW (2010) Powerpack: Energy profiling and analysis of high-performance systems and applications. IEEE Trans Parallel Distrib Syst PDS-21(5):658-671
- (2010) IEEE Trans Parallel Distrib Syst , vol.21 , Issue.5 , pp. 658-671
- Ge, R.¹ Feng, X.² Song, S.³ Chang, H.C.⁴ Li, D.⁵ Cameron, K.W.⁶

12
- 0004236492
- 3 John Hopkins studies in the mathematical sciences Johns Hopkins University Press Baltimore
- Golub GH, Van Loan CF (1996) Matrix computation, 3rd edn. John Hopkins studies in the mathematical sciences. Johns Hopkins University Press, Baltimore
- (1996) Matrix Computation
- Golub, G.H.¹ Van Loan, C.F.²

13
- 80054983967
- Blocked algorithms for the reduction to Hessenberg-triangular form revisited
- 1157.65348 10.1007/s10543-008-0180-1
- Kågström B, Kressner D, Quintana-Ortí E, Quintana-Ortí G (2008) Blocked algorithms for the reduction to Hessenberg-triangular form revisited. BIT Numer Math 48:563-584
- (2008) BIT Numer Math , vol.48 , pp. 563-584
- Kågström, B.¹ Kressner, D.² Quintana-Ortí, E.³ Quintana-Ortí, G.⁴

14
- 33746618548
- Just in time dynamic voltage scaling: Exploiting inter-node slack to save energy in MPI programs
- IEEE Comput Soc Los Alamitos http://doi.acm.org/10.1145/1105760.1105797
- Kappiah N, Freeh VW, Lowenthal DK (2005) Just in time dynamic voltage scaling: exploiting inter-node slack to save energy in MPI programs. In: SC. IEEE Comput Soc, Los Alamitos, p 33. http://doi.acm.org/10.1145/1105760.1105797
- (2005) SC , pp. 33
- Kappiah, N.¹ Freeh, V.W.² Lowenthal, D.K.³

15
- 66749092384
- Tech Rep TR-2008-13, Department of Computer Science and Engineering. University of Notre Dame
- Kogge P, Bergman K, Borkar S, Campbell D, Carlson W, Dally W, Denneau M, Franzon P, Harrod W, Hill K, Hiller J, Karp S, Keckler S, Klein D, Lucas R, Richards M, Scarpelli A, Scott S, Snavely A, Sterling T, Williams RS, Yelick K (2008) Exascale computing study: technology challenges in achieving exascale systems. Tech Rep TR-2008-13, Department of Computer Science and Engineering. University of Notre Dame
- (2008) Exascale Computing Study: Technology Challenges in Achieving Exascale Systems
- Kogge, P.¹ Bergman, K.² Borkar, S.³ Campbell, D.⁴ Carlson, W.⁵ Dally, W.⁶ Denneau, M.⁷ Franzon, P.⁸ Harrod, W.⁹ Hill, K.¹⁰ Hiller, J.¹¹ Karp, S.¹² Keckler, S.¹³ Klein, D.¹⁴ Lucas, R.¹⁵ Richards, M.¹⁶ Scarpelli, A.¹⁷ Scott, S.¹⁸ Snavely, A.¹⁹ Sterling, T.²⁰ Williams, R.S.²¹ Yelick, K.²² more..

16
- 84868144129
- High performance bidiagonal reduction using tile algorithms on homogeneous multicore architectures
- submitted
- Ltaief H, Luszczek P, Dongarra J (2011, submitted) High performance bidiagonal reduction using tile algorithms on homogeneous multicore architectures. ACM Trans Math Softw
- (2011) ACM Trans Math Softw
- Ltaief, H.¹ Luszczek, P.² Dongarra, J.³

17
- 80053252490
- Two-stage tridiagonal reduction for dense symmetric matrices using tile algorithms on multicore architectures
- ACM Anchorage
- Luszczek P, Ltaief H, Dongarra J (2011) Two-stage tridiagonal reduction for dense symmetric matrices using tile algorithms on multicore architectures. In: Proceedings of IPDPS 2011. ACM, Anchorage
- (2011) Proceedings of IPDPS 2011
- Luszczek, P.¹ Ltaief, H.² Dongarra, J.³

18
- 84876506575
- Multicore application modeling infrastructure (MuMI) project. http://www.mumi-tool.org
- Multicore Application Modeling Infrastructure (MuMI) Project

19
- 13444302326
- The free lunch is over: A fundamental turn toward concurrency in software
- Sutter H (2005) The free lunch is over: a fundamental turn toward concurrency in software. Dr Dobb's Journal 30(3). http://www.ddj.com/184405990
- (2005) Dr Dobb's Journal , vol.30 , Issue.3
- Sutter, H.¹

20
- 0003424374
- SIAM Philadelphia 0874.65013 10.1137/1.9780898719574 http://www.siam.org/ books/OT50/Index.htm
- Trefethen LN, Bau D (1997) Numerical linear algebra. SIAM, Philadelphia. http://www.siam.org/books/OT50/Index.htm
- (1997) Numerical Linear Algebra
- Trefethen, L.N.¹ Bau, D.²

21
- 83155186334
- University of Tennessee Knoxville
- University of Tennessee Knoxville (2010) PLASMA users' guide, parallel linear algebra software for multicore architectures, version 2.3. Available electronically at http://icl.cs.utk.edu/projectsfiles/plasma/pdf/users-guide.pdf
- (2010) PLASMA Users' Guide, Parallel Linear Algebra Software for Multicore Architectures, Version 2.3

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.