메뉴 건너뛰기




Volumn 20, Issue 13, 2008, Pages 1573-1590

Parallel tiled QR factorization for multicore architectures

Author keywords

Linear algebra; Multicore; QR factorization

Indexed keywords

LINEAR ALGEBRA; PARALLEL ARCHITECTURES; SOFTWARE ARCHITECTURE;

EID: 50249105132     PISSN: 15320626     EISSN: 15320634     Source Type: Journal    
DOI: 10.1002/cpe.1301     Document Type: Article
Times cited : (109)

References (28)
  • 1
    • 50249141640 scopus 로고    scopus 로고
    • 24 July 2007
    • http://top500.org [24 July 2007].
  • 3
    • 50249133005 scopus 로고    scopus 로고
    • Teraflops research chip, 24 July 2007
    • Teraflops research chip. http://www.intel.com/research/platform/ terascale/teraflops.htm [24 July 2007].
  • 6
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimization of software and the ATLAS project
    • l-2:3-25
    • Whaley RC, Petitet A, Dongarra J. Automated empirical optimization of software and the ATLAS project. Parallel Computing 2001; 27(l-2):3-25.
    • (2001) Parallel Computing , vol.27
    • Whaley, R.C.1    Petitet, A.2    Dongarra, J.3
  • 7
    • 34548762396 scopus 로고    scopus 로고
    • High-performance implementation of the level-3 bias
    • Technical Report TR-2006-23, Department of Computer Sciences, The University of Texas at Austin, FLAME Working Note 20
    • Goto K, van de Geijn R. High-performance implementation of the level-3 bias. Technical Report TR-2006-23, Department of Computer Sciences, The University of Texas at Austin, 2006. FLAME Working Note 20.
    • (2006)
    • Goto, K.1    van de Geijn, R.2
  • 8
    • 50249110532 scopus 로고    scopus 로고
    • 24 July 2007
    • http://www.intel.com/cd/software/products/asmo-na/eng/307757.htm [24 July 2007].
  • 9
    • 50249118960 scopus 로고    scopus 로고
    • 24 July 2007
    • http://developer.amd.com/acml.jsp [24 July 2007].
  • 10
    • 50249118105 scopus 로고    scopus 로고
    • International Organization for Standardization. Informational Technology - Portable Operating System Interface (POSIX) - Part 1: System Application Program Interface (API) [C Language], ISO: Adr, 19%; 743. http://www.iso. ch/cate/d24426.html [24 July 2007].
    • International Organization for Standardization. Informational Technology - Portable Operating System Interface (POSIX) - Part 1: System Application Program Interface (API) [C Language], ISO: Adr, 19%; 743. http://www.iso. ch/cate/d24426.html [24 July 2007].
  • 11
    • 0002806690 scopus 로고    scopus 로고
    • OpenMP: An industry-standard API for shared-memory programming
    • Dagum L. Menon R. OpenMP: An industry-standard API for shared-memory programming. IEEE Computational Science and Engineering 1998; 5(1):46-55.
    • (1998) IEEE Computational Science and Engineering , vol.5 , Issue.1 , pp. 46-55
    • Dagum, L.1    Menon, R.2
  • 12
    • 84947808952 scopus 로고    scopus 로고
    • Choi J, Dongarra J, Ostrouchov S, Petitet A, Walker DW, Clinton Whaley R. A proposal for a set of parallel basic linear algebra subprograms. PARA '95: Proceedings of the Second International Workshop on Applied Parallel Computing, Computations in Physics, Chemistry and Engineering Science, London, U.K., 1996. Springer: Berlin, 19%; 107-114.
    • Choi J, Dongarra J, Ostrouchov S, Petitet A, Walker DW, Clinton Whaley R. A proposal for a set of parallel basic linear algebra subprograms. PARA '95: Proceedings of the Second International Workshop on Applied Parallel Computing, Computations in Physics, Chemistry and Engineering Science, London, U.K., 1996. Springer: Berlin, 19%; 107-114.
  • 13
    • 50249129153 scopus 로고    scopus 로고
    • Message passing interface Forum. MPI: A message-passing interface standard. The International Journal of Supercomputer Applications and High Performance Computing 1994; 8:165-414.
    • Message passing interface Forum. MPI: A message-passing interface standard. The International Journal of Supercomputer Applications and High Performance Computing 1994; 8:165-414.
  • 14
    • 38049058008 scopus 로고    scopus 로고
    • The impact of multicore on math software
    • Proceedings of Workshop on State-of-the-art in Scientific and Parallel Computing Para06, Umeå, Sweden
    • Buttari A, Dongarra J, Kurzak J, Langou J, Luszczek P, Tomov S. The impact of multicore on math software. Proceedings of Workshop on State-of-the-art in Scientific and Parallel Computing (Para06). Springer's Lecture Notes in Computer Science 4699, Umeå, Sweden, 2007; 1-10.
    • (2007) Springer's Lecture Notes in Computer Science , vol.4699 , pp. 1-10
    • Buttari, A.1    Dongarra, J.2    Kurzak, J.3    Langou, J.4    Luszczek, P.5    Tomov, S.6
  • 16
    • 38049005629 scopus 로고    scopus 로고
    • Implementing linear algebra routines on multicore processors with pipelining and a look ahead
    • Proceedings of Workshop on State-of-the-art in Scientific and Parallel Computing Para06, Umeå, Sweden
    • Kurzak J, Dongarra J. Implementing linear algebra routines on multicore processors with pipelining and a look ahead. Proceedings of Workshop on State-of-the-art in Scientific and Parallel Computing (Para06). Springer's Lecture Notes in Computer Science 4699, Umeå, Sweden, 2007; 147-156.
    • (2007) Springer's Lecture Notes in Computer Science , vol.4699 , pp. 147-156
    • Kurzak, J.1    Dongarra, J.2
  • 17
    • 50249166476 scopus 로고    scopus 로고
    • Solving systems of linear equations on the CELL processor using Cholesky factorization
    • Technical Report VT-CS-07-596, Innovative Computing Laboratory, University of Tennessee, Knoxville, April
    • Kurzak J, Buttari A, Dongarra J. Solving systems of linear equations on the CELL processor using Cholesky factorization. Technical Report VT-CS-07-596, Innovative Computing Laboratory, University of Tennessee, Knoxville, April 2007.
    • (2007)
    • Kurzak, J.1    Buttari, A.2    Dongarra, J.3
  • 18
    • 0034224207 scopus 로고    scopus 로고
    • Applying recursion to serial and parallel QR factorization leads to better performance
    • Elmroth E, Gustavson FG. Applying recursion to serial and parallel QR factorization leads to better performance. IBM Journal of Research and Development 2000; 44(4):605-624.
    • (2000) IBM Journal of Research and Development , vol.44 , Issue.4 , pp. 605-624
    • Elmroth, E.1    Gustavson, F.G.2
  • 19
    • 0004236492 scopus 로고    scopus 로고
    • 3rd edn, Johns Hopkins University Press: Baltimore, MD, 19
    • Golub G, Van Loan C. Matrix Computations (3rd edn). Johns Hopkins University Press: Baltimore, MD, 19%.
    • Matrix Computations
    • Golub, G.1    Van Loan, C.2
  • 20
    • 0004094905 scopus 로고    scopus 로고
    • 1st edn, SIAM: Philadelphia, PA
    • Stewart GW. Matrix Algorithms (1st edn), vol. 1. SIAM: Philadelphia, PA, 1998.
    • (1998) Matrix Algorithms , vol.1
    • Stewart, G.W.1
  • 22
  • 23
    • 45449092245 scopus 로고
    • FORTRAN subroutines for out-of-core solutions of large complex linear systems
    • Technical Report CR-159I42, NASA, November
    • Yip EL. FORTRAN subroutines for out-of-core solutions of large complex linear systems. Technical Report CR-159I42, NASA, November 1979.
    • (1979)
    • Yip, E.L.1
  • 25
    • 45449110534 scopus 로고    scopus 로고
    • Updating an LU factorization with pivoting
    • Technical Report TR-2006-42, Department of Computer Sciences, The University of Texas at Austin, FLAME Working Note 21
    • Quintana-Orti E, van de Geijn R. Updating an LU factorization with pivoting. Technical Report TR-2006-42, Department of Computer Sciences, The University of Texas at Austin, 2006. FLAME Working Note 21.
    • (2006)
    • Quintana-Orti, E.1    van de Geijn, R.2
  • 26
    • 0029358998 scopus 로고
    • A parallel algorithm for the reduction of a nonsymmetric matrix to block upper-Hessenberg form
    • Berry MW, Dongarra JJ, Kim Y. A parallel algorithm for the reduction of a nonsymmetric matrix to block upper-Hessenberg form. Parallel Computation 1995; 21(8): 1189-1211.
    • (1995) Parallel Computation , vol.21 , Issue.8 , pp. 1189-1211
    • Berry, M.W.1    Dongarra, J.J.2    Kim, Y.3
  • 28
    • 50249182748 scopus 로고    scopus 로고
    • SMP Superscalar (SMPSs) User's Manual, July 2007. www.bsc.es/media/1002. pdf [24 July 2007].
    • SMP Superscalar (SMPSs) User's Manual, July 2007. www.bsc.es/media/1002. pdf [24 July 2007].


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.