SCOPUS 정보 검색 플랫폼

International Conference for High Performance Computing, Networking, Storage and Analysis, SC

Volumn , Issue , 2013, Pages

An improved parallel singular value algorithm and its implementation for multicore hardware

(3) Haidar, Azzam a Kurzak, Jakub a Luszczek, Piotr a

a University of Tennessee (United States)

Author keywords

Eigenvalues and eigenvectors; Performance; Reduction to bidiagonal; Singular Value Decomposition; Task parallelism

Indexed keywords

EIGENVALUES AND EIGENFUNCTIONS; HARDWARE; PROGRAM PROCESSORS; SINGULAR VALUE DECOMPOSITION;

BIDIAGONAL; EIGENVALUES AND EIGENVECTORS; KERNEL OPTIMIZATIONS; OFF-CHIP COMMUNICATION; OPTIMIZATION TECHNIQUES; OPTIMIZED IMPLEMENTATION; PERFORMANCE; TASK PARALLELISM;

OPTIMIZATION;

EID: 84899676338 PISSN: 21674329 EISSN: 21674337 Source Type: Conference Proceeding
DOI: 10.1145/2503210.2503292 Document Type: Conference Paper

Times cited : (24)

References (63)

1
- 77953997924
- Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects
- DOI: 10. 1088/1742-6596/180/1/012037
- E. Agullo, J. Demmel, J. Dongarra, B. Hadri, J. Kurzak, J. Langou, H. Ltaief, P. Luszczek, and S. Tomov. Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects. J. Phys.: Conf. Ser., 180(1), 2009. DOI: 10. 1088/1742-6596/180/1/012037.
- (2009) J. Phys.: Conf. Ser. , vol.180 , Issue.1
- Agullo, E.¹ Demmel, J.² Dongarra, J.³ Hadri, B.⁴ Kurzak, J.⁵ Langou, J.⁶ Ltaief, H.⁷ Luszczek, P.⁸ Tomov, S.⁹

2
- 74049090446
- Comparative study of one-sided factorizations with multiple software packages on multi-core hardware
- New York, NY, USA, 2009. ACM
- E. Agullo, B. Hadri, H. Ltaief, and J. Dongarrra. Comparative study of one-sided factorizations with multiple software packages on multi-core hardware. In SC'09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, pages 1-12, New York, NY, USA, 2009. ACM.
- (2009) SC'09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis , pp. 1-12
- Agullo, E.¹ Hadri, B.² Ltaief, H.³ Dongarrra, J.⁴

3
- 0003706460
- Philadelphia, PA
- E. Anderson, Z. Bai, C. Bischof, L. S. Blackford, J. W. Demmel, J. J. Dongarra, J. Du Croz, A. Greenbaum, S. Hammarling, A. McKenney, and D. Sorensen. LAPACK Users' Guide. SIAM, Philadelphia, PA, 1992. http://www. netlib. org/lapack/lug/.
- (1992) LAPACK Users' Guide SIAM
- Anderson, E.¹ Bai, Z.² Bischof, C.³ Blackford, L.S.⁴ Demmel, J.W.⁵ Dongarra, J.J.⁶ Du Croz, J.⁷ Greenbaum, A.⁸ Hammarling, S.⁹ McKenney, A.¹⁰ Sorensen, D.¹¹

4
- 0017097208
- Singular value decompositions and digital image processing
- February
- H. C. Andrews and C. L. Patterson. Singular value decompositions and digital image processing. IEEE Trans Acoust., Speech, Signal Processing ASSP-24, 1, February 1976.
- (1976) IEEE Trans Acoust., Speech, Signal Processing ASSP-24 , vol.1
- Andrews, H.C.¹ Patterson, C.L.²

5
- 0001045187
- Numerical techniques in mathematical programming
- New York. Academm Press
- R. H. Bartels, G. H. Golub, and M. Saunders. Numerical techniques in mathematical programming. In Nonhnear Programming, pages 123-176, New York, 1971. Academm Press.
- (1971) Nonhnear Programming , pp. 123-176
- Bartels, R.H.¹ Golub, G.H.² Saunders, M.³

6
- 84949180378
- editors, Methods High-Performance Scientific Computing. Springer, London Dordrecht Heidelberg New York, 2012. ISBN 978-1-4471-2436-8 e-ISBN 978-1-4471-2437-5 DOI 10. 1007/978-1-4471-2437-5
- M. Becka, G. Oksa, and M. Vajtersic. Parallel Block-Jacobi SVD. In M. W. Berry, K. A. Gallivan, E. Gallopoulos, A. Grama, B. Philippe, Y. Saad, and F. Saied, editors, Methods High-Performance Scientific Computing, pages 185-197. Springer, London Dordrecht Heidelberg New York, 2012. ISBN 978-1-4471-2436-8 e-ISBN 978-1-4471-2437-5 DOI 10. 1007/978-1-4471-2437-5.
- Parallel block-jacobi svd , pp. 185-197
- Becka, M.¹ Oksa, G.² Vajtersic, M.³ Berry, M.W.⁴ Gallivan, K.A.⁵ Gallopoulos, E.⁶ Grama, A.⁷ Philippe, B.⁸ Saad, Y.⁹ Saied, F.¹⁰

7
- 0028572905
- Parallel tridiagonalization through two-step band reduction
- IEEE Computer Society Press
- C. Bischof, B. Lang, and X. Sun. Parallel tridiagonalization through two-step band reduction. In In Proceedings of the Scalable High-Performance Computing Conference, pages 23-27. IEEE Computer Society Press, 1994.
- (1994) Proceedings of the Scalable High-Performance Computing Conference , pp. 23-27
- Bischof, C.¹ Lang, B.² Sun, X.³

8
- 0012881041
- Algorithm
- C. H. Bischof, B. Lang, and X. Sun. Algorithm 807: The SBR Toolbox-software for successive band reduction. ACM TOMS, 26(4):602-616, 2000.
- (2000) The SBR Toolbox-software for Successive Band Reduction, ACM TOMS , vol.807 , Issue.4 , pp. 602-616
- Bischof, C.H.¹ Lang, B.² Sun, X.³

9
- 0001951009
- The WY representation for products of Householder matrices
- C. H. Bischof and C. V. Loan. The WY representation for products of Householder matrices. SIAM J. Sci. Statist. Comput., 8:s2-s13, 1987.
- (1987) SIAM J. Sci. Statist. Comput., 8:s2-s13
- Bischof, C.H.¹ Loan, C.V.²

10
- 0005571418
- Ghosts in tomography: The effects of poor angular coverage in 2-D seismic traveltime inversion
- N. Bregman, R. Bailey, and C. Chapman. Ghosts in tomography: The effects of poor angular coverage in 2-D seismic traveltime inversion. Can. J. Explor. Geophys, 25(1):7-27, 1989.
- (1989) Can. J. Explor. Geophys , vol.25 , Issue.1 , pp. 7-27
- Bregman, N.¹ Bailey, R.² Chapman, C.³

11
- 38049058008
- The impact of multicore on math software
- B. Kagström, E. Elmroth, J. Dongarra, and J. Wasniewski, editors, Applied Parallel Computing. State of the Art in Scientific Computing. Springer
- A. Buttari, J. Dongarra, J. Kurzak, J. Langou, P. Luszczek, and S. Tomov. The impact of multicore on math software. In B. Kagström, E. Elmroth, J. Dongarra, and J. Wasniewski, editors, Applied Parallel Computing. State of the Art in Scientific Computing, 8th International Workshop, PARA, volume 4699 of Lecture Notes in Computer Science, pages 1-10. Springer, 2006.
- (2006) 8th International Workshop, PARA, Volume 4699 of Lecture Notes in Computer Science , pp. 1-10
- Buttari, A.¹ Dongarra, J.² Kurzak, J.³ Langou, J.⁴ Luszczek, P.⁵ Tomov, S.⁶

12
- 36048997493
- Multithreading for synchronization tolerance in matrix factorization
- Boston, MA, June 24-28 2007. Journal of Physics: Conference Series, IOP Publishing. DOI: 10. 1088/1742-6596/78/1/012028
- A. Buttari, J. J. Dongarra, P. Husbands, J. Kurzak, and K. Yelick. Multithreading for synchronization tolerance in matrix factorization. In Scientific Discovery through Advanced Computing, SciDAC 2007, Boston, MA, June 24-28 2007. Journal of Physics: Conference Series 78:012028, IOP Publishing. DOI: 10. 1088/1742-6596/78/1/012028.
- Scientific Discovery Through Advanced Computing, SciDAC 2007 , vol.78 , pp. 012028
- Buttari, A.¹ Dongarra, J.J.² Husbands, P.³ Kurzak, J.⁴ Yelick, K.⁵

13
- 50249105132
- Parallel tiled QR factorization for multicore architectures
- DOI: 10. 1002/cpe. 1301
- A. Buttari, J. Langou, J. Kurzak, and J. J. Dongarra. Parallel tiled QR factorization for multicore architectures. Concurrency Computat.: Pract. Exper., 20(13):1573-1590, 2008. DOI: 10. 1002/cpe. 1301.
- (2008) Concurrency Computat.: Pract. Exper. , vol.20 , Issue.13 , pp. 1573-1590
- Buttari, A.¹ Langou, J.² Kurzak, J.³ Dongarra, J.J.⁴

14
- 58149269099
- Class of parallel tiled linear algebra algorithms for multicore architectures
- DOI: 10. 1016/j. parco. 2008. 10. 002
- A. Buttari, J. Langou, J. Kurzak, and J. J. Dongarra. A class of parallel tiled linear algebra algorithms for multicore architectures. Parellel Comput. Syst. Appl., 35:38-53, 2009. DOI: 10. 1016/j. parco. 2008. 10. 002.
- (2009) Parellel Comput. Syst. Appl. , vol.35 , pp. 38-53
- Buttari, A.¹ Langou, J.² Kurzak, J.³ Dongarra, J.J.⁴

15
- 35248843628
- Supermatrix out-of-order scheduling of matrix operations for SMP and multi-core architectures
- New York, NY, USA, 2007. ACM
- E. Chan, E. S. Quintana-Orti, G. Quintana-Orti, and R. van de Geijn. Supermatrix out-of-order scheduling of matrix operations for SMP and multi-core architectures. In SPAA'07: Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures, pages 116-125, New York, NY, USA, 2007. ACM.
- (2007) SPAA'07: Proceedings of the Nineteenth Annual ACM Symposium on Parallel Algorithms and Architectures , pp. 116-125
- Chan, E.¹ Quintana-Orti, E.S.² Quintana-Orti, G.³ Geijn De R.Van⁴

16
- 84899693521
- Algorithm 8xx: PIRO BAND, Pipelined Plane Rotations for Blocked Band Reduction
- T. A. Davis and S. Rajamanickam. Algorithm 8xx: PIRO BAND, Pipelined Plane Rotations for Blocked Band Reduction. Submitted to ACM TOMS, Available at www. cise. ufl. edu/srajaman/genband. pdf, 2010.
- (2010) Submitted to ACM TOMS, Available at Www. Cise. Ufl. Edu/srajaman/genband. Pdf
- Davis, T.A.¹ Rajamanickam, S.²

17
- 0026238244
- The bidiagonal singular value decomposition and Hamiltonian mechanics
- October 1991. (LAPACK Working Note #11)
- P. Deift, J. W. Demmel, L.-C. Li, and C. Tomei. The bidiagonal singular value decomposition and Hamiltonian mechanics. SIAM J. Numer. Anal., 28(5):1463-1516, October 1991. (LAPACK Working Note #11).
- SIAM J. Numer. Anal. , vol.28 , Issue.5 , pp. 1463-1516
- Deift, P.¹ Demmel, J.W.² Li, L.-C.³ Tomei, C.⁴

18
- 0001192187
- Accurate singular values of bidiagonal matrices
- September 1990. (Also LAPACK LAWN #3)
- J. W. Demmel and W. Kahan. Accurate singular values of bidiagonal matrices. SIAM J. Sci. Stat. Comput., 11(5):873-912, September 1990. (Also LAPACK LAWN #3).
- SIAM J. Sci. Stat. Comput. , vol.11 , Issue.5 , pp. 873-912
- Demmel, J.W.¹ Kahan, W.²

19
- 84899693051
- Exploiting fine-grain parallelism in recursive LU factorization
- Ghent, Belgium, August 30-September 2
- J. Dongarra, M. Faverge, H. Ltaief, and P. Luszczek. Exploiting fine-grain parallelism in recursive LU factorization. In ParCo 2011-International Conference on Parallel Computing, Ghent, Belgium, August 30-September 2 2011.
- (2011) ParCo 2011-International Conference on Parallel Computing
- Dongarra, J.¹ Faverge, M.² Ltaief, H.³ Luszczek, P.⁴

20
- 84857663656
- High performance matrix inversion based on LU factorization for multicore architectures
- New York, NY, USA, 2011. ACM
- J. Dongarra, M. Faverge, H. Ltaief, and P. Luszczek. High performance matrix inversion based on LU factorization for multicore architectures. In Proceedings of the 2011 ACM international workshop on Many task computing on grids and supercomputers, MTAGS'11, pages 33-42, New York, NY, USA, 2011. ACM.
- Proceedings of the 2011 ACM International Workshop on Many Task Computing on Grids and Supercomputers, MTAGS'11 , pp. 33-42
- Dongarra, J.¹ Faverge, M.² Ltaief, H.³ Luszczek, P.⁴

21
- 84899693051
- Exploiting fine-grain parallelism in recursive LU factorization
- ISBN 978-1-61499-040-6 (print); ISBN 978-1-61499-041-3
- J. Dongarra, M. Faverge, H. Ltaief, and P. Luszczek. Exploiting fine-grain parallelism in recursive LU factorization. Advances in Parallel Computing, Special Issue, 22:429-436, 2012. ISBN 978-1-61499-040-6 (print); ISBN 978-1-61499-041-3 (online).
- (2012) Advances in Parallel Computing, Special Issue , vol.22 , pp. 429-436
- Dongarra, J.¹ Faverge, M.² Ltaief, H.³ Luszczek, P.⁴

22
- 0000257176
- Block reduction of matrices to condensed forms for eigenvalue computations
- J. J. Dongarra, D. C. Sorensen, and S. J. Hammarling. Block reduction of matrices to condensed forms for eigenvalue computations. Journal of Computational and Applied Mathematics, 27(1-2):215-227, 1989.
- (1989) Journal of Computational and Applied Mathematics , vol.27 , Issue.1-2 , pp. 215-227
- Dongarra, J.J.¹ Sorensen, D.C.² Hammarling, S.J.³

23
- 0023488724
- Seismic waveform modeling in heterogeneous media by ray perturbation theory
- V. Farra and R. Madariaga. Seismic waveform modeling in heterogeneous media by ray perturbation theory. Journal of Geophysical Research: Solid Earth, 92(B3):2697-2712, 1987.
- (1987) Journal of Geophysical Research: Solid Earth , vol.92 B3 , pp. 2697-2712
- Farra, V.¹ Madariaga, R.²

24
- 21344496407
- Accurate singular values and differential qd algorithms
- V. Fernando and B. Parlett. Accurate singular values and differential qd algorithms. Numerisch Math., 67:191-229, 1994.
- (1994) Numerisch Math. , vol.67 , pp. 191-229
- Fernando, V.¹ Parlett, B.²

25
- 84879834718
- Multi-sweep algorithms for the symmetric eigenproblem
- Springer
- W. Gansterer, D. Kvasnicka, and C. Ueberhuber. Multi-sweep algorithms for the symmetric eigenproblem. In Vector and Parallel Processing-VECPAR'98, volume 1573 of Lecture Notes in Computer Science, pages 20-28. Springer, 1999.
- (1999) Vector and Parallel Processing-VECPAR'98, Volume 1573 of Lecture Notes in Computer Science , pp. 20-28
- Gansterer, W.¹ Kvasnicka, D.² Ueberhuber, C.³

26
- 0000288016
- Calculating the singular values and pseudoinverse of a matrix
- G. H. Golub and W. Kahan. Calculating the singular values and pseudoinverse of a matrix. SIAM J. Numer. Anal., 2(3):205-224, 1965.
- (1965) SIAM J. Numer. Anal. , vol.2 , Issue.3 , pp. 205-224
- Golub, G.H.¹ Kahan, W.²

27
- 0004236492
- The John Hopkins University Press, 4th edition, December 27 2012. ISBN-10: 1421407949, ISBN-13: 978-1421407944
- G. H. Golub and C. F. V. Loan. Matrix Computations. The John Hopkins University Press, 4th edition, December 27 2012. ISBN-10: 1421407949, ISBN-13: 978-1421407944.
- Matrix Computations
- Golub, G.H.¹ Loan, C.F.V.²

28
- 0007051545
- J. Wilkinson and C. Reinsch, editors, Handbook for Automattc Computation, II, Linear Algebra. Springer-Verlag, New York
- G. H. Golub and C. Reinsch. Singular value decomposition and least squares solutions. In J. Wilkinson and C. Reinsch, editors, Handbook for Automattc Computation, II, Linear Algebra. Springer-Verlag, New York, 1971.
- (1971) Singular Value Decomposition and Least Squares Solutions
- Golub, G.H.¹ Reinsch, C.²

29
- 0017011163
- Ill-conditioned eigensystems and the computation of the Jordan canonical form
- October
- G. H. Golub and J. H. Wilkinson. Ill-conditioned eigensystems and the computation of the Jordan canonical form. SIAM Rev., 18(4), October 1976.
- (1976) SIAM Rev. , vol.18 , Issue.4
- Golub, G.H.¹ Wilkinson, J.H.²

30
- 0542421948
- The solution of large dense generalized eigenvalue problems on the cray X-MP/24 with SSD
- April
- R. Grimes, H. Krakauer, J. Lewis, H. Simon, and S.-H. Wei. The solution of large dense generalized eigenvalue problems on the cray X-MP/24 with SSD. J. Comput. Phys., 69:471-481, April 1987.
- (1987) J. Comput. Phys. , vol.69 , pp. 471-481
- Grimes, R.¹ Krakauer, H.² Lewis, J.³ Simon, H.⁴ Wei, S.-H.⁵

31
- 0024082507
- Solution of large, dense symmetric generalized eigenvalue problems using secondary storage
- September
- R. G. Grimes and H. D. Simon. Solution of large, dense symmetric generalized eigenvalue problems using secondary storage. ACM Transactions on Mathematical Software, 14:241-256, September 1988.
- (1988) ACM Transactions on Mathematical Software , vol.14 , pp. 241-256
- Grimes, R.G.¹ Simon, H.D.²

32
- 1542533583
- A divide-and-conquer algorithm for the bidiagonal svd
- M. Gu and S. Eisenstat. A divide-and-conquer algorithm for the bidiagonal SVD. SIAM J. Mat. Anal. Appl., 16:79-92, 1995.
- (1995) SIAM J. Mat. Anal. Appl. , vol.16 , pp. 79-92
- Gu, M.¹ Eisenstat, S.²

33
- 84862107202
- Parallel and cache-efficient in-place matrix storage format conversion
- article 17. DOI: 10. 1145/2168773. 2168775
- F. G. Gustavson, L. Karlsson, and B. Kagström. Parallel and cache-efficient in-place matrix storage format conversion. ACM Trans. Math. Soft., 38(3):article 17, 2012. DOI: 10. 1145/2168773. 2168775.
- (2012) ACM Trans. Math. Soft. , vol.38 , Issue.3
- Gustavson, F.G.¹ Karlsson, L.² Kagström, B.³

34
- 83155188961
- Parallel reduction to condensed forms for symmetric eigenvalue problems using aggregated fine-grained and memory-aware kernels
- New York, NY, USA. ACM
- A. Haidar, H. Ltaief, and J. Dongarra. Parallel reduction to condensed forms for symmetric eigenvalue problems using aggregated fine-grained and memory-aware kernels. In Proceedings of SC'11, pages 8:1-8:11, New York, NY, USA, 2011. ACM.
- (2011) Proceedings of SC'11 , pp. 81-811
- Haidar, A.¹ Ltaief, H.² Dongarra, J.³

35
- 84866876895
- A comprehensive study of task coalescing for selecting parallelism granularity in a two-stage bidiagonal reduction
- Shanghai, China, May 21-25, ISBN 978-1-4673-0975-2
- A. Haidar, H. Ltaief, P. Luszczek, and J. Dongarra. A comprehensive study of task coalescing for selecting parallelism granularity in a two-stage bidiagonal reduction. In Proceedings of the IEEE International Parallel and Distributed Processing Symposium, Shanghai, China, May 21-25 2012. ISBN 978-1-4673-0975-2.
- (2012) Proceedings of the IEEE International Parallel and Distributed Processing Symposium
- Haidar, A.¹ Ltaief, H.² Luszczek, P.³ Dongarra, J.⁴

36
- 84860412769
- Analysis of dynamically scheduled tile algorithms for dense linear algebra on multicore architectures
- DOI: 10. 1002/cpe. 1829
- A. Haidar, H. Ltaief, A. YarKhan, and J. J. Dongarra. Analysis of dynamically scheduled tile algorithms for dense linear algebra on multicore architectures. Concurrency Computat.: Pract. Exper., 2011. DOI: 10. 1002/cpe. 1829.
- (2011) Concurrency Computat.: Pract. Exper.
- Haidar, A.¹ Ltaief, H.² Yarkhan, A.³ Dongarra, J.J.⁴

37
- 84899682038
- New multi-stage algorithm for symmetric eigenvalues and eigenvectors achieves two-fold speedup
- Aachen, Germany, August 26-30 (submitted)
- A. Haidar, P. Luszczek, and J. Dongarra. New multi-stage algorithm for symmetric eigenvalues and eigenvectors achieves two-fold speedup. In Euro-Par 2013, Aachen, Germany, August 26-30 2013. (submitted).
- (2013) Euro-Par 2013
- Haidar, A.¹ Luszczek, P.² Dongarra, J.³

38
- 84879834697
- A novel hybrid CPU-GPU generalized eigensolver for electronic structure calculations based on fine grained memory aware tasks
- September (accepted)
- A. Haidar, S. Tomov, J. Dongarra, R. Solca, and T. Schulthess. A novel hybrid CPU-GPU generalized eigensolver for electronic structure calculations based on fine grained memory aware tasks. International Journal of High Performance Computing Applications, September 2012. (accepted).
- (2012) International Journal of High Performance Computing Applications
- Haidar, A.¹ Tomov, S.² Dongarra, J.³ Solca, R.⁴ Schulthess, T.⁵

39
- 0015127540
- A numerical method for solving Fredholm integral equations of the first kind using singular values
- R. J. Hanson. A numerical method for solving Fredholm integral equations of the first kind using singular values. SIAM J. Numer. Anal., 8(3):616-626, 1971.
- (1971) SIAM J. Numer. Anal. , vol.8 , Issue.3 , pp. 616-626
- Hanson, R.J.¹

40
- 58149421595
- Analysis of a complex of statistical variables into principal components
- 498-520
- H. Hotelling. Analysis of a complex of statistical variables into principal components. J. Educ. Psych., 24:417-441, 498-520, 1933.
- (1933) J. Educ. Psych. , vol.24 , pp. 417-441
- Hotelling, H.¹

41
- 0002467254
- Simplified calculation of principal components
- H. Hotelling. Simplified calculation of principal components. Psychometrica, 1:27-35, 1935.
- (1935) Psychometrica , vol.1 , pp. 27-35
- Hotelling, H.¹

42
- 0000652188
- Unitary triangularization of a nonsymmetric matrix
- October. DOI 10. 1145/320941. 320947
- A. S. Householder. Unitary triangularization of a nonsymmetric matrix. Journal of the ACM (JACM), 5(4), October 1958. DOI 10. 1145/320941. 320947.
- (1958) Journal of the ACM (JACM) , vol.5 , Issue.4
- Householder, A.S.¹

43
- 84872201157
- Intel
- Intel. Math Kernel Library.
- Math Kernel Library

44
- 21344498628
- A Parallel Algorithm for Computing the Singular Value Decomposition of a Matrix
- E. R. Jessup and D. Sorensen. A Parallel Algorithm for Computing the Singular Value Decomposition of a Matrix. SIAM J. Matrix Anal. Appl., 15:530-548, 1994.
- (1994) SIAM J. Matrix Anal. Appl. , vol.15 , pp. 530-548
- Jessup, E.R.¹ Sorensen, D.²

45
- 0346688721
- Information filtering using the Riemannian SVD (R-SVD)
- A. Ferreira, J. D. P. Rolim, H. D. Simon, and S.-H. Teng, editors, Berkeley, California, USA, August 9-11, Proceedings, volume 1457 of Lecture Notes in Computer Science. Springer, 1998
- E. P. Jiang and M. W. Berry. Information filtering using the Riemannian SVD (R-SVD). In A. Ferreira, J. D. P. Rolim, H. D. Simon, and S.-H. Teng, editors, Solving Irregularly Structured Problems in Parallel, 5th International Symposium, IRREGULAR 98, Berkeley, California, USA, August 9-11, 1998, Proceedings, volume 1457 of Lecture Notes in Computer Science, pages 386-395. Springer, 1998.
- (1998) Solving Irregularly Structured Problems in Parallel, 5th International Symposium, IRREGULAR 98 , pp. 386-395
- Jiang, E.P.¹ Berry, M.W.²

46
- 49349111725
- Solving systems of linear equation on the CELL processor using Cholesky factorization
- DOI: TPDS. 2007. 70813
- J. Kurzak, A. Buttari, and J. J. Dongarra. Solving systems of linear equation on the CELL processor using Cholesky factorization. Trans. Parallel Distrib. Syst., 19(9):1175-1186, 2008. DOI: TPDS. 2007. 70813.
- (2008) Trans. Parallel Distrib. Syst. , vol.19 , Issue.9 , pp. 1175-1186
- Kurzak, J.¹ Buttari, A.² Dongarra, J.J.³

47
- 80053238375
- QR factorization for the CELL processor
- DOI: 10. 3233/SPR-2008-0268
- J. Kurzak and J. J. Dongarra. QR factorization for the CELL processor. Scientific Programming, 00:1-12, 2008. DOI: 10. 3233/SPR-2008-0268.
- (2008) Scientific Programming , vol.0 , pp. 1-12
- Kurzak, J.¹ Dongarra, J.J.²

48
- 73149105729
- Scheduling dense linear algebra operations on multicore processors
- DOI: 10. 1002/cpe. 1467
- J. Kurzak, H. Ltaief, J. J. Dongarra, and R. M. Badia. Scheduling dense linear algebra operations on multicore processors. Concurrency Computat.: Pract. Exper., 21(1):15-44, 2009. DOI: 10. 1002/cpe. 1467.
- (2009) Concurrency Computat.: Pract. Exper. , vol.21 , Issue.1 , pp. 15-44
- Kurzak, J.¹ Ltaief, H.² Dongarra, J.J.³ Badia, R.M.⁴

49
- 0040250198
- A parallel algorithm for reducing symmetric banded matrices to tridiagonal form
- November
- B. Lang. A parallel algorithm for reducing symmetric banded matrices to tridiagonal form. SIAM J. Sci. Comput., 14:1320-1338, November 1993.
- (1993) SIAM J. Sci. Comput. , vol.14 , pp. 1320-1338
- Lang, B.¹

50
- 0032678430
- Efficient eigenvalue and singular value computations on shared memory machines
- B. Lang. Efficient eigenvalue and singular value computations on shared memory machines. Parallel Computing, 25(7):845-860, 1999.
- (1999) Parallel Computing , vol.25 , Issue.7 , pp. 845-860
- Lang, B.¹

51
- 0003476369
- Prentice-Hall, Englewood Cliffs, N. J.
- C. Lawson and R. Hanson. Solving Least Squares Problems. Prentice-Hall, Englewood Cliffs, N. J., 1974.
- (1974) Solving Least Squares Problems
- Lawson, C.¹ Hanson, R.²

52
- 77649275879
- Parallel band two-sided matrix bidiagonalization for multicore architectures
- April
- H. Ltaief, J. Kurzak, and J. Dongarra. Parallel band two-sided matrix bidiagonalization for multicore architectures. IEEE Transactions on Parallel and Distributed Systems, 21(4), April 2010.
- (2010) IEEE Transactions on Parallel and Distributed Systems , vol.21 , Issue.4
- Ltaief, H.¹ Kurzak, J.² Dongarra, J.³

53
- 84877905452
- High performance bidiagonal reduction using tile algorithms on homogeneous multicore architectures
- In publication
- H. Ltaief, P. Luszczek, and J. Dongarra. High Performance Bidiagonal Reduction using Tile Algorithms on Homogeneous Multicore Architectures. ACM TOMS, 39(3), 2013. In publication.
- (2013) ACM TOMS , vol.39 , Issue.3
- Ltaief, H.¹ Luszczek, P.² Dongarra, J.³

54
- 84865266292
- Enhancing parallelism of tile bidiagonal transformation on multicore architectures using tree reduction
- R. Wyrzykowski, J. Dongarra, K. Karczewski, and J. Wasniewski, editors, Torun, Poland
- H. Ltaief, P. Luszczek, A. Haidar, and J. Dongarra. Enhancing parallelism of tile bidiagonal transformation on multicore architectures using tree reduction. In R. Wyrzykowski, J. Dongarra, K. Karczewski, and J. Wasniewski, editors, Proceedings of 9th International Conference, PPAM 2011, volume 7203, pages 661-670, Torun, Poland, 2012.
- (2012) Proceedings of 9th International Conference, PPAM 2011 , vol.7203 , pp. 661-670
- Ltaief, H.¹ Luszczek, P.² Haidar, A.³ Dongarra, J.⁴

55
- 80053252490
- Two-stage tridiagonal reduction for dense symmetric matrices using tile algorithms on multicore architectures
- Anchorage, Alaska, USA, May 16-20
- P. Luszczek, H. Ltaief, and J. Dongarra. Two-stage tridiagonal reduction for dense symmetric matrices using tile algorithms on multicore architectures. In IPDPS 2011: IEEE International Parallel and Distributed Processing Symposium, Anchorage, Alaska, USA, May 16-20 2011.
- (2011) IPDPS 2011: IEEE International Parallel and Distributed Processing Symposium
- Luszczek, P.¹ Ltaief, H.² Dongarra, J.³

56
- 85057401994
- Parallel methods for the singular value decomposition
- E. Kontoghiorghes, editor. Chapman & Hall/CRC
- B. P. M. Berry, D. Mezher and A. Sameh. Parallel methods for the singular value decomposition. In E. Kontoghiorghes, editor, Parallel Computing and Statistics, pages 117-164. Chapman & Hall/CRC, 2006.
- (2006) Parallel Computing and Statistics , pp. 117-164
- Berry, B.P.M.¹ Mezher, D.² Sameh, A.³

57
- 0019533482
- Principal component analysis in linear systems: Controllability, observability, and model reduction
- February
- B. C. Moore. Principal component analysis in linear systems: Controllability, observability, and model reduction. IEEE Transactions on Automatic Control, AC-26(1), February 1981.
- (1981) IEEE Transactions on Automatic Control , vol.AC-26 , Issue.1
- Moore, B.C.¹

58
- 0003868214
- Academic Press, New York
- G. W. Stewart. Introduction to Matrix Computations. Academic Press, New York, 1973.
- (1973) Introduction to Matrix Computations
- Stewart, G.W.¹

59
- 0347737736
- The decompositional approach to matrix computation
- Jan/Feb. ISSN: 1521-9615; DOI 10. 1109/5992. 814658
- G. W. Stewart. The decompositional approach to matrix computation. Computing in Science & Engineering, 2(1):50-59, Jan/Feb 2000. ISSN: 1521-9615; DOI 10. 1109/5992. 814658.
- (2000) Computing in Science & Engineering , vol.2 , Issue.1 , pp. 50-59
- Stewart, G.W.¹

60
- 83155186334
- University of Tennessee Knoxville., November
- University of Tennessee Knoxville. PLASMA Users' Guide, Parallel Linear Algebra Software for Multicore Architectures, Version 2. 3, November 2010.
- (2010) PLASMA Users' Guide, Parallel Linear Algebra Software for Multicore Architectures, Version 2. 3

61
- 0025467711
- A bridging model for parallel computation
- Aug. DOI 10. 1145/79173. 79181
- L. G. Valiant. A bridging model for parallel computation. Communications of the ACM, 33(8), Aug. 1990. DOI 10. 1145/79173. 79181.
- (1990) Communications of the ACM , vol.33 , Issue.8
- Valiant, L.G.¹

62
- 84899698517
- Technical Report CSD-05-1376, University of California Berkeley, Computer Science Division,. Also available as the LAPACK Working Note 166
- P. R. Willems, B. Lang, and C. Vömel. Computing the bidiagonal SVD using multiple relatively robust representations. Technical Report CSD-05-1376, University of California Berkeley, Computer Science Division, 2005. Also available as the LAPACK Working Note 166.
- (2005) Computing the Bidiagonal SVD Using Multiple Relatively Robust Representations
- Willems, P.R.¹ Lang, B.² Vömel, C.³

63
- 84857683415
- Technical Report ICL-UT-11-02, Innovative Computing Laboratory, University of Tennessee, April
- A. YarKhan, J. Kurzak, and J. Dongarra. QUARK users' guide: QUeueing And Runtime for Kernels. Technical Report ICL-UT-11-02, Innovative Computing Laboratory, University of Tennessee, April 2011.
- (2011) QUARK Users' Guide: QUeueing and Runtime for Kernels
- Yarkhan, A.¹ Kurzak, J.² Dongarra, J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.