메뉴 건너뛰기




Volumn 21, Issue 18, 2009, Pages 2438-2456

Parallelizing dense and banded linear algebra libraries using SMPSs

Author keywords

Dynamic scheduling; High performance; Linear algebra libraries; Multi core processors; Programmability

Indexed keywords

APPLICATION PROGRAMMING INTERFACES (API); DIGITAL STORAGE; LINEAR ALGEBRA;

EID: 73349095700     PISSN: 15320626     EISSN: 15320634     Source Type: Journal    
DOI: 10.1002/cpe.1463     Document Type: Article
Times cited : (64)

References (28)
  • 3
    • 58149269099 scopus 로고    scopus 로고
    • A class of parallel tiled linear algebra algorithms for multicore architectures
    • Buttari A, Langou J, Kurzak J, Dongarra J. A class of parallel tiled linear algebra algorithms for multicore architectures. Parallel Computing 2009; 35(1): 38-53.
    • (2009) Parallel Computing , vol.35 , Issue.1 , pp. 38-53
    • Buttari, A.1    Langou, J.2    Kurzak, J.3    Dongarra, J.4
  • 9
    • 73349130534 scopus 로고    scopus 로고
    • Quintana-Ortí G, Quintana-Ortí ES,Remón A, van de Geijn R. Supermatrix for the factorization of band matrices. FLAME Working Note #27 TR-07-51, The University of Texas at Austin, Department of Computer Sciences, September 2007.
    • Quintana-Ortí G, Quintana-Ortí ES,Remón A, van de Geijn R. Supermatrix for the factorization of band matrices. FLAME Working Note #27 TR-07-51, The University of Texas at Austin, Department of Computer Sciences, September 2007.
  • 13
    • 70350666900 scopus 로고    scopus 로고
    • A flexible and portable programming model for SMP and multi-cores
    • Technical Report 03/, Barcelona Supercomputing Center, Centro Nacional de Supercomputacion, Barcelona, Spain
    • Pérez JM, Badia RM, Labarta J. A flexible and portable programming model for SMP and multi-cores. Technical Report 03/2007, Barcelona Supercomputing Center - Centro Nacional de Supercomputacion, Barcelona, Spain, 2007.
    • (2007) , pp. 2007
    • Pérez, J.M.1    Badia, R.M.2    Labarta, J.3
  • 14
    • 57949083229 scopus 로고    scopus 로고
    • Pérez JM, Badia RM, Labarta J. A dependency-aware task-based programming environment for multi-core architectures. Proceedings of the 2008 IEEE International Conference on Cluster Computing, Causal Productions (ed.). September 2008; 142-151. IEEE Catalog Number CFP08235-CDR.
    • Pérez JM, Badia RM, Labarta J. A dependency-aware task-based programming environment for multi-core architectures. Proceedings of the 2008 IEEE International Conference on Cluster Computing, Causal Productions (ed.). September 2008; 142-151. IEEE Catalog Number CFP08235-CDR.
  • 15
    • 0004236492 scopus 로고    scopus 로고
    • 3rd edn, The Johns Hopkins University Press: Baltimore, MD
    • Golub GH, Van Loan CF. Matrix Computations (3rd edn). The Johns Hopkins University Press: Baltimore, MD, 1996.
    • (1996) Matrix Computations
    • Golub, G.H.1    Van Loan, C.F.2
  • 19
    • 65849272637 scopus 로고    scopus 로고
    • A comparison of lookahead and algorithmic blocking techniques for parallel matrix factorization
    • Technical Report TR-CS-98-07, Department of Computer Science, The Australian National University, Canberra 0200 ACT, Australia
    • Strazdins P. A comparison of lookahead and algorithmic blocking techniques for parallel matrix factorization. Technical Report TR-CS-98-07, Department of Computer Science, The Australian National University, Canberra 0200 ACT, Australia, 1998.
    • (1998)
    • Strazdins, P.1
  • 20
    • 0032155271 scopus 로고    scopus 로고
    • GEMM-based level 3 BLAS: High-performance model implementations and performance evaluation benchmark
    • Kågström B, Ling P, Loan CV. GEMM-based level 3 BLAS: High-performance model implementations and performance evaluation benchmark. ACM Transactions on Mathematical Software 1998; 24(3): 268-302.
    • (1998) ACM Transactions on Mathematical Software , vol.24 , Issue.3 , pp. 268-302
    • Kågström, B.1    Ling, P.2    Loan, C.V.3
  • 21
    • 0032155342 scopus 로고    scopus 로고
    • Algorithm 784: GEMM-based level 3 BLAS: portability and optimization issues
    • Kågström B, Ling P, Loan CV. Algorithm 784: GEMM-based level 3 BLAS: portability and optimization issues. ACM Transactions on Mathematical Software 1998; 24(3): 303-316.
    • (1998) ACM Transactions on Mathematical Software , vol.24 , Issue.3 , pp. 303-316
    • Kågström, B.1    Ling, P.2    Loan, C.V.3
  • 23
    • 33745328323 scopus 로고    scopus 로고
    • Rapid development of high-performance out-of-core solvers
    • Proceedings of PARA 2004, Springer: Berlin, Heidelberg
    • Joffrain T, Quintana-Ortí ES, van de Geijn RA. Rapid development of high-performance out-of-core solvers. Proceedings of PARA 2004, Lecture Notes in Computer Science, vol. 3732. Springer: Berlin, Heidelberg, 2005 ; 413-422.
    • (2005) Lecture Notes in Computer Science , vol.3732 , pp. 413-422
    • Joffrain, T.1    Quintana-Ortí, E.S.2    van de Geijn, R.A.3
  • 24
    • 85121159302 scopus 로고    scopus 로고
    • Quintana-Ortí ES, van de Geijn R. Updating an LU factorization with pivoting. ACM Transactions on Mathematical Software 2008; 35(2): 11: 1-11: 16.
    • Quintana-Ortí ES, van de Geijn R. Updating an LU factorization with pivoting. ACM Transactions on Mathematical Software 2008; 35(2): 11: 1-11: 16.
  • 25
    • 73349124198 scopus 로고    scopus 로고
    • Gustavson FG. New generalized matrix data structures lead to a variety of high-performance algorithms. The Architecture of Scientific Software, Boisvert RF, Tang PTP (eds.), 188 of IFIP Conference Proceedings. Kluwer: Dordrecht, 2000; 211-234.
    • Gustavson FG. New generalized matrix data structures lead to a variety of high-performance algorithms. The Architecture of Scientific Software, Boisvert RF, Tang PTP (eds.), vol. 188 of IFIP Conference Proceedings. Kluwer: Dordrecht, 2000; 211-234.
  • 28
    • 47349106165 scopus 로고    scopus 로고
    • An API for manipulating matrices stored by blocks
    • Technical Report TR-2004-15, Department of Computer Sciences, The University of Texas at Austin, May
    • Low TM, van de Gejin R. An API for manipulating matrices stored by blocks. Technical Report TR-2004-15, Department of Computer Sciences, The University of Texas at Austin, May 2004.
    • (2004)
    • Low, T.M.1    van de Gejin, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.