메뉴 건너뛰기




Volumn , Issue , 2011, Pages 33-42

High performance matrix inversion based on LU factorization for multicore architectures

Author keywords

LU factorization; multicore parallel performance; runtime DAG scheduling

Indexed keywords

ALGORITHMIC APPROACH; COMPUTATIONAL TASK; DATA DEPENDENCIES; DATA FLOW; DATA LAYOUTS; DIRECTED ACYCLIC GRAPHS; EFFICIENT IMPLEMENTATION; ENERGY EFFICIENT; INVERSION PROCEDURE; LOOSE SYNCHRONIZATIONS; LU FACTORIZATION; MATRIX; MATRIX INVERSIONS; MULTI CORE; MULTICORE ARCHITECTURES; NUMERICAL LIBRARY; ON THE FLIES; PERFORMANCE IMPLEMENTATION; PERFORMANCE MATRICES; POWER CONSUMPTION ANALYSIS; PROCESSING UNITS; RUNTIME DAG SCHEDULING; RUNTIME ENVIRONMENTS; SCALAPACK; SQUARE MATRICES; SYNCHRONIZATION POINTS; SYSTEM MEMORY;

EID: 84857663656     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2132876.2132885     Document Type: Conference Paper
Times cited : (23)

References (38)
  • 10
    • 0031633853 scopus 로고    scopus 로고
    • Multistage linear DS-CDMA receivers
    • Sept.
    • C. Boulanger and L. Ouvry. Multistage linear DS-CDMA receivers. In Proc. IEEE ISSSTA 98, volume 2, pages 663-667, Sept. 1998.
    • (1998) Proc. IEEE ISSSTA 98 , vol.2 , pp. 663-667
    • Boulanger, C.1    Ouvry, L.2
  • 11
    • 84857684600 scopus 로고    scopus 로고
    • A critical path approach to analyzing parallelism of algorithmic variants. Application to Cholesky inversion
    • Technical Report arXiv:1010.2000v1 [cs.DC], arXiv online archive, Oct 11 Submitted to Available at arxiv.org/pdf/1010.2000
    • H. Bouwmeester and J. Langou. A critical path approach to analyzing parallelism of algorithmic variants. Application to Cholesky inversion. Technical Report arXiv:1010.2000v1 [cs.DC], arXiv online archive, Oct 11 2010. Submitted to Parallel Computing. Available at arxiv.org/pdf/1010.2000.
    • (2010) Parallel Computing
    • Bouwmeester, H.1    Langou, J.2
  • 12
    • 38249038136 scopus 로고
    • Solving the algebraic Riccati equation with the matrix sign function
    • R. Byers. Solving the algebraic Riccati equation with the matrix sign function. Linear Algebra and Appl., 85:267-279, 1987.
    • (1987) Linear Algebra and Appl. , vol.85 , pp. 267-279
    • Byers, R.1
  • 15
    • 0012087153 scopus 로고
    • Stability methods for matrix inversion
    • January
    • J. J. D. Croz and N. J. Higham. Stability methods for matrix inversion. IMA J. Numer. Anal., 12, January 1992.
    • (1992) IMA J. Numer. Anal. , pp. 12
    • Croz, J.J.D.1    Higham, N.J.2
  • 16
    • 0025401417 scopus 로고
    • Set of Level 3 Basic Linear Algebra Subprograms. Model implementation and test programs
    • DOI 10.1145/77626.77627
    • J. Dongarra, J. Du Croz, I. Duff, and S. Hammarling. Algorithm 679: A set of Level 3 Basic Linear Algebra Subprograms. ACM Trans. Math. Soft., 16(1):18-28, March 1990. (Pubitemid 20684795)
    • (1990) ACM Transactions on Mathematical Software , vol.16 , Issue.1 , pp. 18-28
    • Dongarra, J.J.1    Du, C.J.2    Hammarling, S.3    Duff, I.4
  • 19
    • 70350509502 scopus 로고
    • Measurement matrix partitioning theorem
    • Dec.
    • S. L. Fagin. Measurement matrix partitioning theorem. IEEE Trans. Autom. Control, AC-14(6):773-774, Dec. 1969.
    • (1969) IEEE Trans. Autom. Control , vol.AC-14 , Issue.6 , pp. 773-774
    • Fagin, S.L.1
  • 23
    • 84857677215 scopus 로고    scopus 로고
    • Hardware Locality: Peering under the hood of your server
    • July
    • B. Goglin, J. Squyres, and S. Thibault. Hardware Locality: Peering under the hood of your server. Linux Pro Magazine, 128:28-33, July 2011.
    • (2011) Linux Pro Magazine , vol.128 , pp. 28-33
    • Goglin, B.1    Squyres, J.2    Thibault, S.3
  • 24
    • 0004236492 scopus 로고    scopus 로고
    • John Hopkins Studies in the Mathematical Sciences. Johns Hopkins University Press, Baltimore, Maryland, third edition
    • G. H. Golub and C. F. Van Loan. Matrix Computation. John Hopkins Studies in the Mathematical Sciences. Johns Hopkins University Press, Baltimore, Maryland, third edition, 1996.
    • (1996) Matrix Computation
    • Golub, G.H.1    Van Loan, C.F.2
  • 25
    • 84857684603 scopus 로고    scopus 로고
    • Analysis of Dynamically Scheduled Tile Algorithms for Dense Linear Algebra on Multicore Architectures
    • ICL Technical Report UT-CS-11-666, LAPACK working note #243, Submitted to
    • A. Haidar, H. Ltaief, A. YarKhan, and J. J. Dongarra. Analysis of Dynamically Scheduled Tile Algorithms for Dense Linear Algebra on Multicore Architectures. ICL Technical Report UT-CS-11-666, LAPACK working note #243, Submitted to Concurrency and Computations, 2010.
    • (2010) Concurrency and Computations
    • Haidar, A.1    Ltaief, H.2    YarKhan, A.3    Dongarra, J.J.4
  • 26
    • 0001692403 scopus 로고
    • Computing the polar decomposition - With applications
    • N. J. Higham. Computing the polar decomposition - with applications. SIAM J. Sci. Stat. Comput., 7:1160-1174, 1986.
    • (1986) SIAM J. Sci. Stat. Comput. , vol.7 , pp. 1160-1174
    • Higham, N.J.1
  • 27
    • 84947688547 scopus 로고
    • Applications of Jordan's procedure for matrix inversion in multiple regression and multivariate distance analysis
    • G. H. Jowett. Applications of Jordan's procedure for matrix inversion in multiple regression and multivariate distance analysis. Journal of the Royal Statistical Society. Series B (Methodological), 25(2):352-357, 1963.
    • (1963) Journal of the Royal Statistical Society. Series B (Methodological) , vol.25 , Issue.2 , pp. 352-357
    • Jowett, G.H.1
  • 28
    • 0032490773 scopus 로고    scopus 로고
    • Simplified polynomial-expansion linear detectors for DS-CDMA systems
    • Z. Lei and T. Lim. Simplified polynomial-expansion linear detectors for DS-CDMA receivers. IEEE Elec. Lett., 34(16):1561-1563, 1998. (Pubitemid 128610604)
    • (1998) Electronics Letters , vol.34 , Issue.16 , pp. 1561-1563
    • Lei, Z.D.1    Lim, T.J.2
  • 31
    • 84857662501 scopus 로고    scopus 로고
    • Math Kernel Library (MKL).
    • Intel, Math Kernel Library (MKL). http://www.intel.com/software/products/ mkl/.
  • 33
    • 39549085762 scopus 로고    scopus 로고
    • Suboptimum search algorithm in conjunction with polynomial expanded multiuser detection for uplink
    • R. T. M. Mozaffaripour. Suboptimum search algorithm in conjunction with polynomial expanded multiuser detection for uplink. Wireless Personnal Comm., 24(1):1-9, 2003.
    • (2003) Wireless Personnal Comm. , vol.24 , Issue.1 , pp. 1-9
    • Mozaffaripour, R.T.M.1
  • 35
    • 0022026625 scopus 로고
    • Analysis of pairwise pivoting in Gaussian elimination
    • DOI: 10.1109/TC.1985.1676570
    • D. C. Sorensen. Analysis of pairwise pivoting in Gaussian elimination. IEEE Transactions on Computers, C-34(3):274-278, 1985. http://dx.doi.org/10. 1109/ TC.1985.1676570DOI: 10.1109/TC.1985.1676570.
    • (1985) IEEE Transactions on Computers , vol.C-34 , Issue.3 , pp. 274-278
    • Sorensen, D.C.1
  • 36
    • 0342444575 scopus 로고
    • Matrix-inversion method: Applications to Möbius inversion deconvolution
    • December
    • Q. Xie and N. xian Chen. Matrix-inversion method: Applications to Möbius inversion deconvolution. Physical Review E, 52(6):6055-6065, December 1995.
    • (1995) Physical Review E , vol.52 , Issue.6 , pp. 6055-6065
    • Xie, Q.1    Xian Chen, N.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.