메뉴 건너뛰기




Volumn 18, Issue 1, 2010, Pages 35-50

Scheduling two-sided transformations using tile algorithms on multicore architectures

Author keywords

Linear algebra; Matrix factorization; Multicore; Scheduling; Two sided transformations

Indexed keywords

ALGEBRA; COMPUTER ARCHITECTURE; DATA FLOW ANALYSIS; EIGENVALUES AND EIGENFUNCTIONS; FACTORIZATION; LINEAR ALGEBRA; LINEAR TRANSFORMATIONS; MATHEMATICAL TRANSFORMATIONS; MATRIX ALGEBRA; PARALLEL PROCESSING SYSTEMS; PROGRAM PROCESSORS; SCHEDULING; SINGULAR VALUE DECOMPOSITION; SOFTWARE ARCHITECTURE;

EID: 77951935506     PISSN: 10589244     EISSN: None     Source Type: Journal    
DOI: 10.3233/SPR-2010-0297     Document Type: Article
Times cited : (6)

References (35)
  • 4
    • 12444316073 scopus 로고    scopus 로고
    • A new stable bidiagonal reduction algorithm
    • DOI 10.1016/j.laa.2004.09.019, PII S0024379504004276
    • J. L. Barlow, N. Bosner and Z. Drmač, A new stable bidiagonal reduction algorithm, Linear Algebra Appl. 397 (1) (2005), 35-84. (Pubitemid 40146312)
    • (2005) Linear Algebra and Its Applications , vol.397 , Issue.1-3 , pp. 35-84
    • Barlow, J.L.1    Bosner, N.2    Drmac, Z.3
  • 6
    • 48249107440 scopus 로고    scopus 로고
    • Block and parallel versions of one-sided bidiagonalization
    • N. Bosner and J. L. Barlow, Block and parallel versions of one-sided bidiagonalization, SIAM J. Matrix Anal. Appl. 29 (3) (2007), 927-953.
    • (2007) SIAM J. Matrix Anal. Appl , vol.29 , Issue.3 , pp. 927-953
    • Bosner, N.1    Barlow, J.L.2
  • 7
    • 77951890128 scopus 로고    scopus 로고
    • Multithreading for synchronization tolerance in matrix factorization
    • Boston, MA, IOP Publishing, June 24-28, 2007. (J. Phys.: Conference Series 78 012-028.)
    • A. Buttari, J. J. Dongarra, P. Husbands, J. Kurzak and K. Yelick, Multithreading for synchronization tolerance in matrix factorization, in: Scientific Discovery Through Advanced Computing, SciDAC 2007, Boston, MA, IOP Publishing, June 24-28, 2007. (J. Phys.: Conference Series 78 012-028.)
    • (2007) Scientific Discovery Through Advanced Computing, SciDAC
    • Buttari, A.1    Dongarra, J.J.2    Husbands, P.3    Kurzak, J.4    Yelick, K.5
  • 10
    • 58149269099 scopus 로고    scopus 로고
    • A class of parallel tiled linear algebra algorithms for multicore architectures
    • A. Buttari, J. Langou, J. Kurzak and J. J. Dongarra, A class of parallel tiled linear algebra algorithms for multicore architectures, Parellel Comput. Syst. Appl. 35 (2009), 38-53.
    • (2009) Parellel Comput. Syst. Appl , vol.35 , pp. 38-53
    • Buttari, A.1    Langou, J.2    Kurzak, J.3    Dongarra, J.J.4
  • 12
    • 33847379878 scopus 로고    scopus 로고
    • Estimating and correcting global weather model error
    • DOI 10.1175/MWR3289.1
    • K. E. Danforth, M. Christopher and M. Takemasa, Estimating and correcting global weather model error, Mon. Weather Rev. 135 (2) (2007), 281-299. (Pubitemid 46344360)
    • (2007) Monthly Weather Review , vol.135 , Issue.2 , pp. 281-299
    • Danforth, C.M.1    Kalnay, E.2    Miyoshi, T.3
  • 14
    • 0034224207 scopus 로고    scopus 로고
    • Applying recursion to serial and parallel qr factorization leads to better performance
    • E. Elmroth and F. G. Gustavson, Applying recursion to serial and parallel QR factorization leads to better performance, IBM J. Res. Dev. 44 (4) (2000), 605-624.
    • (2000) IBM J. Res. Dev. , vol.44 , Issue.4 , pp. 605-624
    • Elmroth, E.1    Gustavson, F.G.2
  • 16
    • 1842832833 scopus 로고    scopus 로고
    • Recursive blocked algorithms and hybrid data structures for dense matrix library software
    • E. Elmroth, F. G. Gustavson, I. Jonsson and B. Kågström, Recursive blocked algorithms and hybrid data structures for dense matrix library software, SIAM Rev. 46 (1) (2004), 3-45.
    • (2004) SIAM Rev , vol.46 , Issue.1 , pp. 3-45
    • Elmroth, E.1    Gustavson, F.G.2    Jonsson, I.3    Kågström, B.4
  • 17
    • 0004236492 scopus 로고    scopus 로고
    • 3rd edn, Johns Hopkins University Press, Baltimore, MD
    • G. H. Golub and C. F. van Loan, Matrix Computation, 3rd edn, Johns Hopkins University Press, Baltimore, MD, 1996.
    • (1996) Matrix computation
    • Golub, G.H.1    Van Loan, C.F.2
  • 18
    • 17644368925 scopus 로고    scopus 로고
    • Parallel out-of-core computation and updating of the QR factorization
    • B. C. Gunter and R. A. van de Geijn, Parallel out-of-core computation and updating of the QR factorization, ACM Trans. Math. Software 31 (1) (2005), 60-78.
    • (2005) ACM Trans. Math. Software , vol.31 , Issue.1 , pp. 60-78
    • Gunter, B.C.1    Van De, R.A.G.2
  • 21
    • 0033297112 scopus 로고    scopus 로고
    • A parallel algorithm for the reduction to tridiagonal form for eigendecomposition
    • M. Hegland, M. Kahn and M. Osborne, A parallel algorithm for the reduction to tridiagonal form for eigendecomposition, SIAM J. Sci. Comput. 21 (3) (1999), 987-1005.
    • (1999) SIAM J. Sci. Comput , vol.21 , Issue.3 , pp. 987-1005
    • Hegland, M.1    Kahn, M.2    Osborne, M.3
  • 22
    • 49349111725 scopus 로고    scopus 로고
    • Solving systems of linear equation on the cell processor using cholesky factorization
    • J. Kurzak, A. Buttari and J. J. Dongarra, Solving systems of linear equation on the CELL processor using Cholesky factorization, Trans. Parallel Distrib. Syst. 19 (9) (2008), 1175-1186.
    • (2008) Trans. Parallel Distrib. Syst , vol.19 , Issue.9 , pp. 1175-1186
    • Kurzak, J.1    Buttari, A.2    Dongarra, J.J.3
  • 24
    • 74549205359 scopus 로고    scopus 로고
    • Qr factorization for the cell processor
    • May
    • J. Kurzak and J. Dongarra, QR Factorization for the CELL processor, LAPACK Working Note 201, May 2008.
    • (2008) LAPACK Working Note , vol.201
    • Kurzak, J.1    Dongarra, J.2
  • 26
    • 0020593101 scopus 로고
    • Solving linear algebraic equations on an MIMD computer
    • DOI 10.1145/322358.322366
    • R. E. Lord, J. S. Kowalik and S. P. Kumar, Solving linear algebraic equations on an MIMD computer, J. ACM 30 (1) (1983), 103-117. (Pubitemid 13504813)
    • (1983) Journal of the ACM , vol.30 , Issue.1 , pp. 103-117
    • Lord, R.E.1    Kowalik, J.S.2    Kumar, S.P.3
  • 28
    • 0042235298 scopus 로고    scopus 로고
    • Tiling, block data layout, and memory hierarchy performance
    • N. Park, B. Hong and V. K. Prasanna, Tiling, block data layout, and memory hierarchy performance, IEEE Trans. Parallel Distrib. Syst. 14 (7) (2003), 640-654.
    • (2003) IEEE Trans. Parallel Distrib. Syst , vol.14 , Issue.7 , pp. 640-654
    • Park, N.1    Hong, B.2    Prasanna, V.K.3
  • 29
    • 57949083229 scopus 로고    scopus 로고
    • A dependency-aware task-based programming environment for multi-core architectures
    • Piscataway, NJ
    • J. M. PéArez, R. M. Badia and J. Labarta, A dependency-aware task-based programming environment for multi-core architectures, in: CLUSTER, IEEE, Piscataway, NJ, 2008, pp. 142-151.
    • (2008) CLUSTER, IEEE , pp. 142-151
    • PéArez, J.M.1    Badia, R.M.2    Labarta, J.3
  • 30
    • 35649006026 scopus 로고    scopus 로고
    • CellSs: Making it easier to program the cell broadband engine processor
    • DOI 10.1147/rd.515.0593
    • J. M. Perez, P. Bellens, R. M. Badia and J. Labarta, CellSs: making it easier to program the Cell Broadband Engine processor, IBM J. Res. Dev. 51 (5) (2007), 593-604. (Pubitemid 350031358)
    • (2007) IBM Journal of Research and Development , vol.51 , Issue.5 , pp. 593-604
    • Perez, J.M.1    Bellens, P.2    Badia, R.M.3    Labarta, J.4
  • 31
    • 85021253844 scopus 로고    scopus 로고
    • PIRO-BAND: PIpelined ROtations for BAnd Reduction, available at
    • PIRO-BAND: PIpelined ROtations for BAnd Reduction, available at: http://www.cise.ufl.edu/˜srajaman/.
  • 33
    • 0003078924 scopus 로고
    • A storage efficient wy representation for products of householder transformations
    • R. Schreiber and C. van Loan, A storage efficient WY representation for products of householder transformations, SIAM J. Sci. Statist. Comput. 10 (1989), 53-57.
    • (1989) SIAM J. Sci. Statist. Comput , vol.10 , pp. 53-57
    • Schreiber, R.1    Van Loan, C.2
  • 34
    • 85021229732 scopus 로고    scopus 로고
    • SMP Superscalar (SMPSs) User's Manual, Version 2.0, Barcelona Supercomputing Center
    • SMP Superscalar (SMPSs) User's Manual, Version 2.0, Barcelona Supercomputing Center, 2008.
    • (2008)
  • 35
    • 0004554167 scopus 로고    scopus 로고
    • Numerical linear algebra
    • Philadelphia, PA
    • L. N. Trefethen and D. Bau, Numerical Linear Algebra, SIAM, Philadelphia, PA, 1997.
    • (1997) SIAM
    • Trefethen, L.N.1    Bau, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.