메뉴 건너뛰기




Volumn 4967 LNCS, Issue , 2008, Pages 639-648

Parallel tiled QR factorization for multicore architectures

Author keywords

[No Author keywords available]

Indexed keywords

ALGEBRA; ALGORITHMS; BOOLEAN FUNCTIONS; COMPUTATIONAL METHODS; EVOLUTIONARY ALGORITHMS; FACTORIZATION; LEARNING ALGORITHMS; LINEAR ALGEBRA; MULTITASKING; PAPER; SCHEDULING ALGORITHMS; STANDARDS; STATISTICS; SUPERCOMPUTERS; TREES (MATHEMATICS);

EID: 45449096678     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-68111-3_67     Document Type: Conference Paper
Times cited : (6)

References (26)
  • 2
    • 45449102400 scopus 로고    scopus 로고
    • Teraflops research chip
    • Teraflops research chip, http://www.intel.com/research/platform/ terascale/teraflops.htm
  • 5
    • 35248868578 scopus 로고    scopus 로고
    • Implementing linear algebra routines on multi-core processors with pipelining and a look ahead
    • Also available as UT-CS-06-581, September
    • Kurzak, J., Dongarra, J.: Implementing linear algebra routines on multi-core processors with pipelining and a look ahead. LAPACK Working Note 178 (September 2006), Also available as UT-CS-06-581
    • (2006) LAPACK Working Note , vol.178
    • Kurzak, J.1    Dongarra, J.2
  • 6
    • 38049058008 scopus 로고    scopus 로고
    • Buttari, A., Dongarra, J., Kurzak, J., Langou, J., Luszczek, P., Tomov, S.: The impact of multicore on math software. In: Kågström, B., Elmroth, E., Dongarra, J., Waśniewski, J. (eds.) PARA 2006. LNCS, 4699, pp. 1-10. Springer, Heidelberg (2007)
    • Buttari, A., Dongarra, J., Kurzak, J., Langou, J., Luszczek, P., Tomov, S.: The impact of multicore on math software. In: Kågström, B., Elmroth, E., Dongarra, J., Waśniewski, J. (eds.) PARA 2006. LNCS, vol. 4699, pp. 1-10. Springer, Heidelberg (2007)
  • 8
    • 1842832833 scopus 로고    scopus 로고
    • Recursive blocked algorithms and hybrid data structures for dense matrix library software
    • Elmroth, E., Gustavson, F., Jonsson, I., Kågström, B.: Recursive blocked algorithms and hybrid data structures for dense matrix library software. SIAM Review 46(1), 3-45 (2004)
    • (2004) SIAM Review , vol.46 , Issue.1 , pp. 3-45
    • Elmroth, E.1    Gustavson, F.2    Jonsson, I.3    Kågström, B.4
  • 9
    • 38049087210 scopus 로고    scopus 로고
    • Gustavson, F., Karlsson, L., Kågström, B.: Three algorithms for cholesky factorization on distributed memory using packed storage. In: Kågström, B., Elmroth, E., Dongarra, J., Waśniewski, J. (eds.) PARA 2006. LNCS, 4699, pp. 550-559. Springer, Heidelberg (2007)
    • Gustavson, F., Karlsson, L., Kågström, B.: Three algorithms for cholesky factorization on distributed memory using packed storage. In: Kågström, B., Elmroth, E., Dongarra, J., Waśniewski, J. (eds.) PARA 2006. LNCS, vol. 4699, pp. 550-559. Springer, Heidelberg (2007)
  • 10
    • 45449118422 scopus 로고    scopus 로고
    • Kurzak, J., Buttari, A., Dongarra, J.: Solving systems of linear equations on the CELL processor using Cholesky factorization. Technical Report UT-CS-07-596, Innovative Computing Laboratory, University of Tennessee Knoxville (April 2007)
    • Kurzak, J., Buttari, A., Dongarra, J.: Solving systems of linear equations on the CELL processor using Cholesky factorization. Technical Report UT-CS-07-596, Innovative Computing Laboratory, University of Tennessee Knoxville (April 2007)
  • 11
    • 0020593101 scopus 로고
    • Solving linear algebraic equations on an mimd computer
    • Lord, R.E., Kowalik, J.S., Kumar, S.P.: Solving linear algebraic equations on an mimd computer. J. ACM 30(1), 103-117 (1983)
    • (1983) J. ACM , vol.30 , Issue.1 , pp. 103-117
    • Lord, R.E.1    Kowalik, J.S.2    Kumar, S.P.3
  • 14
    • 45449117612 scopus 로고    scopus 로고
    • Agarwal, R.C., Gustavson, F.G.: A parallel implementation of matrix multiplication and LU factorization on the IBM 3090. In: Proceedings of the IFIP WG 2.5 Working Group on Aspects of Computation on Asychronous Parallel Processors, Stanford CA, Augest 22-26,1988, North Holland, Amsterdam (1988)
    • Agarwal, R.C., Gustavson, F.G.: A parallel implementation of matrix multiplication and LU factorization on the IBM 3090. In: Proceedings of the IFIP WG 2.5 Working Group on Aspects of Computation on Asychronous Parallel Processors, Stanford CA, Augest 22-26,1988, North Holland, Amsterdam (1988)
  • 15
    • 0034224207 scopus 로고    scopus 로고
    • Applying recursion to serial and parallel QR factorization leads to better performance
    • Elmroth, E., Gustavson, F.G.: Applying recursion to serial and parallel QR factorization leads to better performance. IBM Journal of Research and Development 44(4), 605 (2000)
    • (2000) IBM Journal of Research and Development , vol.44 , Issue.4 , pp. 605
    • Elmroth, E.1    Gustavson, F.G.2
  • 16
    • 0004236492 scopus 로고    scopus 로고
    • 3rd edn. Johns Hopkins University Press, Baltimore
    • Golub, G., Van Loan, C.: Matrix Computations, 3rd edn. Johns Hopkins University Press, Baltimore (1996)
    • (1996) Matrix Computations
    • Golub, G.1    Van Loan, C.2
  • 17
    • 0004094905 scopus 로고    scopus 로고
    • 1st edn, SIAM, Philadelphia
    • Stewart, G.W.: Matrix Algorithms, 1st edn., vol. 1. SIAM, Philadelphia (1998)
    • (1998) Matrix Algorithms , vol.1
    • Stewart, G.W.1
  • 18
    • 45449092245 scopus 로고
    • FORTRAN Subroutines for Out-of-Core Solutions of Large Complex Linear Systems
    • Technical Report CR-159142, NASA November
    • Yip, E.L.: FORTRAN Subroutines for Out-of-Core Solutions of Large Complex Linear Systems. Technical Report CR-159142, NASA (November 1979)
    • (1979)
    • Yip, E.L.1
  • 19
    • 45449110534 scopus 로고    scopus 로고
    • Updating an LU factorization with pivoting
    • Technical Report TR-2006-42, The University of Texas at Austin, Department of Computer Sciences , FLAME Working Note 21
    • Quintana-Orti, E., van de Geijn, R.: Updating an LU factorization with pivoting, Technical Report TR-2006-42, The University of Texas at Austin, Department of Computer Sciences (2006), FLAME Working Note 21
    • (2006)
    • Quintana-Orti, E.1    van de Geijn, R.2
  • 20
    • 17644368925 scopus 로고    scopus 로고
    • Parallel out-of-core computation and updating of the QR factorization
    • Gunter, B.C., van de Geijn, R.A.: Parallel out-of-core computation and updating of the QR factorization. ACM Trans. Math. Softw. 31(1), 60-78 (2005)
    • (2005) ACM Trans. Math. Softw , vol.31 , Issue.1 , pp. 60-78
    • Gunter, B.C.1    van de Geijn, R.A.2
  • 21
    • 0029358998 scopus 로고
    • A parallel algorithm for the reduction of a nonsymmetric matrix to block upper-hessenberg form
    • Berry, M.W., Dongarra, J.J., Kim, Y.: A parallel algorithm for the reduction of a nonsymmetric matrix to block upper-hessenberg form. Parallel Comput. 21(8), 1189-1211 (1995)
    • (1995) Parallel Comput , vol.21 , Issue.8 , pp. 1189-1211
    • Berry, M.W.1    Dongarra, J.J.2    Kim, Y.3
  • 22
    • 84947583789 scopus 로고    scopus 로고
    • Gustavson, F.G.: New generalized data structures for matrices lead to a variety of high performance algorithms. In: Wyrzykowski, R., Dongarra, J., Paprzycki, M., Waśniewski, J. (eds.) PPAM 2001. LNCS, 2328, pp. 418-436. Springer, Heidelberg (2002)
    • Gustavson, F.G.: New generalized data structures for matrices lead to a variety of high performance algorithms. In: Wyrzykowski, R., Dongarra, J., Paprzycki, M., Waśniewski, J. (eds.) PPAM 2001. LNCS, vol. 2328, pp. 418-436. Springer, Heidelberg (2002)
  • 23
    • 0001951009 scopus 로고
    • The WY representation for products of householder matrices
    • Bischof, C., van Loan, C.: The WY representation for products of householder matrices. SIAM J. Sci. Stat. Comput. 8(1), 2-13 (1987)
    • (1987) SIAM J. Sci. Stat. Comput , vol.8 , Issue.1 , pp. 2-13
    • Bischof, C.1    van Loan, C.2
  • 24
    • 0003078924 scopus 로고
    • A storage-efficient WY representation for products of Householder transformations
    • Schreiber, R., van Loan, C.: A storage-efficient WY representation for products of Householder transformations. SIAM J. Sci. Stat. Comput. 10(1), 53-57 (1989)
    • (1989) SIAM J. Sci. Stat. Comput , vol.10 , Issue.1 , pp. 53-57
    • Schreiber, R.1    van Loan, C.2
  • 25
    • 0001951009 scopus 로고
    • The WY representation for products of householder matrices
    • Bischof, C., van Loan, C.: The WY representation for products of householder matrices. SIAM J. Sci. Stat. Comput. 8(1), 2-13 (1987)
    • (1987) SIAM J. Sci. Stat. Comput , vol.8 , Issue.1 , pp. 2-13
    • Bischof, C.1    van Loan, C.2
  • 26
    • 45449098829 scopus 로고    scopus 로고
    • Buttari, A., Langou, J., Kurzak, J., Dongarra, J.: Parallel Tiled QR Factorization for Multicore Architectures. Technical Report UT-CS-07-598, University of Tennessee (2007), LAPACK Working Note 190
    • Buttari, A., Langou, J., Kurzak, J., Dongarra, J.: Parallel Tiled QR Factorization for Multicore Architectures. Technical Report UT-CS-07-598, University of Tennessee (2007), LAPACK Working Note 190


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.