메뉴 건너뛰기




Volumn , Issue , 2011, Pages 944-955

Two-stage tridiagonal reduction for dense symmetric matrices using tile algorithms on multicore architectures

Author keywords

Bulge Chasing; Scheduling; Tile Algorithms; Translation Layer; Tridiagonal Reduction

Indexed keywords

BIDIAGONAL; BULGE CHASING; CHOLESKY FACTORIZATIONS; DATA LOCALITY; EFFICIENT IMPLEMENTATION; MATRIX; MATRIX SIZE; MULTICORE ARCHITECTURES; MULTITHREADED; NUMERICAL LIBRARY; NUMERICAL SOFTWARE; PERFORMANCE DATA; PROCESSOR-MEMORY; RESEARCH PROBLEMS; RUNTIME SYSTEMS; SPECTRAL DECOMPOSITION; SYMMETRIC MATRICES; TRIDIAGONAL; TWO STAGE;

EID: 80053252490     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2011.91     Document Type: Conference Paper
Times cited : (34)

References (43)
  • 4
    • 80053278288 scopus 로고    scopus 로고
    • April
    • "The FLAME project," April 2010, http://z.cs.utexas.edu/wiki/ flame.wiki/FrontPage.
    • (2010) The FLAME Project
  • 5
    • 0004236492 scopus 로고    scopus 로고
    • 3rd ed. Baltimore, MD: Johns Hopkins University Press
    • G. Golub and C. van Loan, Matrix Computations, 3rd ed. Baltimore, MD: Johns Hopkins University Press, 1996.
    • (1996) Matrix Computations
    • Golub, G.1    Van Loan, C.2
  • 7
    • 80053289864 scopus 로고    scopus 로고
    • ParaGauss: The Density Functional Program ParaGauss for Complex Systems in Chemistry
    • Springer, dOI: 10.1007/3-540-28555-5-25.
    • N. Rösch, S. Krüger, V. Nasluzov, and A. Matveev, "ParaGauss: The Density Functional Program ParaGauss for Complex Systems in Chemistry," in High Performance Computing in Science and Engineering, Garching 2004, part III. Springer, 2005, pp. 285-296, dOI: 10.1007/3-540-28555- 5-25.
    • (2005) High Performance Computing in Science and Engineering, Garching 2004 , Issue.PART III , pp. 285-296
    • Rösch, N.1    Krüger, S.2    Nasluzov, V.3    Matveev, A.4
  • 8
    • 0542421948 scopus 로고
    • The solution of large dense generalized eigenvalue problems on the Cray X-MP/24 with SSD
    • April [Online]. Available
    • R. Grimes, H. Krakauer, J. Lewis, H. Simon, and S.-H. Wei, "The solution of large dense generalized eigenvalue problems on the Cray X-MP/24 with SSD," J. Comput. Phys., vol. 69, pp. 471-481, April 1987. [Online]. Available: http://portal.acm.org/citation.cfm?id=32855.32865
    • (1987) J. Comput. Phys. , vol.69 , pp. 471-481
    • Grimes, R.1    Krakauer, H.2    Lewis, J.3    Simon, H.4    Wei, S.-H.5
  • 14
    • 0012881041 scopus 로고    scopus 로고
    • Algorithm 807: The SBR Toolbox - Software for successive band reduction
    • C. H. Bischof, B. Lang, and X. Sun, "Algorithm 807: The SBR Toolbox - software for successive band reduction," ACM Trans. Math. Softw., vol. 26, no. 4, pp. 602-616, 2000.
    • (2000) ACM Trans. Math. Softw. , vol.26 , Issue.4 , pp. 602-616
    • Bischof, C.H.1    Lang, B.2    Sun, X.3
  • 18
    • 77957870814 scopus 로고    scopus 로고
    • Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing
    • vol. DOI information: 10.1016/j.parco.2010.06.001
    • S. Tomov, R. Nath, and J. Dongarra, "Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing," Parallel Computing, vol. DOI information: 10.1016/j.parco.2010. 06.001, 2010.
    • (2010) Parallel Computing
    • Tomov, S.1    Nath, R.2    Dongarra, J.3
  • 20
    • 50249105132 scopus 로고    scopus 로고
    • Parallel Tiled QR Factorization for Multicore Architectures
    • DOI: 10.1002/cpe.1301.
    • A. Buttari, J. Langou, J. Kurzak, and J. J. Dongarra, "Parallel Tiled QR Factorization for Multicore Architectures,"Concurrency Computat.: Pract. Exper., vol. 20, no. 13, pp. 1573-1590, 2008, http://dx.doi.org/10.1002/ cpe.1301 DOI: 10.1002/cpe.1301.
    • (2008) Concurrency Computat.: Pract. Exper. , vol.20 , Issue.13 , pp. 1573-1590
    • Buttari, A.1    Langou, J.2    Kurzak, J.3    Dongarra, J.J.4
  • 21
    • 58149269099 scopus 로고    scopus 로고
    • A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures
    • DOI: 10.1016/j.parco.2008.10.002
    • -, "A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures," Parellel Comput. Syst. Appl., vol. 35, pp. 38-53, 2009, http://dx.doi.org/10.1016/j.parco.2008.10.002 DOI: 10.1016/j.parco.2008. 10.002.
    • (2009) Parellel Comput. Syst. Appl. , vol.35 , pp. 38-53
    • Buttari, A.1    Langou, J.2    Kurzak, J.3    Dongarra, J.J.4
  • 22
    • 48849086742 scopus 로고    scopus 로고
    • Updating an LU Factorization with Pivoting
    • DOI: 10.1145/1377612.1377615
    • E. S. Quintana-Ortí and R. A. van de Geijn, "Updating an LU Factorization with Pivoting,"ACM Trans. Math. Softw., vol. 35, no. 2, p. 11, 2008, http://doi.acm.org/10.1145/1377612.1377615 DOI: 10.1145/1377612. 1377615.
    • (2008) ACM Trans. Math. Softw. , vol.35 , Issue.2 , pp. 11
    • Quintana-Ortí, E.S.1    Van De Geijn, R.A.2
  • 23
    • 49349111725 scopus 로고    scopus 로고
    • Solving systems of linear equation on the CELL processor using Cholesky factorization
    • DOI: TPDS.2007.70813
    • J. Kurzak, A. Buttari, and J. J. Dongarra, "Solving systems of linear equation on the CELL processor using Cholesky factorization," Trans. Parallel Distrib. Syst., vol. 19, no. 9, pp. 1175-1186, 2008, http://dx.doi.org/10.1109/TPDS.2007.70813 DOI: TPDS.2007.70813.
    • (2008) Trans. Parallel Distrib. Syst. , vol.19 , Issue.9 , pp. 1175-1186
    • Kurzak, J.1    Buttari, A.2    Dongarra, J.J.3
  • 24
    • 80053238375 scopus 로고    scopus 로고
    • QR factorization for the CELL processor
    • DOI: 10.3233/SPR-2008-0268
    • J. Kurzak and J. J. Dongarra, "QR factorization for the CELL processor," Scientific Programming, vol. 17, pp. 1-12, 2008, http://dx.doi.org/10.3233/SPR-2008-0268 DOI: 10.3233/SPR-2008-0268.
    • (2008) Scientific Programming , vol.17 , pp. 1-12
    • Kurzak, J.1    Dongarra, J.J.2
  • 26
    • 74049122130 scopus 로고    scopus 로고
    • Parallel block hessenberg reduction using algorithms-by-tiles for multicore architectures revisited
    • UT-CS-08-624 also
    • H. Ltaief, J. Kurzak, and J. Dongarra, "Parallel block hessenberg reduction using algorithms-by-tiles for multicore architectures revisited," UT-CS-08-624 (also LAPACK Working Note 208), 2008.
    • (2008) LAPACK Working Note 208
    • Ltaief, H.1    Kurzak, J.2    Dongarra, J.3
  • 28
    • 33244454775 scopus 로고    scopus 로고
    • A Taxonomy of Workflow Management Systems for Grid Computing
    • J. Yu and R. Buyya, "A Taxonomy of Workflow Management Systems for Grid Computing," Journal of Grid Computing, 2005.
    • (2005) Journal of Grid Computing
    • Yu, J.1    Buyya, R.2
  • 30
    • 38049058008 scopus 로고    scopus 로고
    • The Impact of Multicore on Math Software
    • Applied Parallel Computing. State of the Art in Scientific Computing, 8th International Workshop, PARA, ser. B. Kågström, E. Elmroth, J. Dongarra, and J. Wasniewski, Eds., Springer
    • A. Buttari, J. Dongarra, J. Kurzak, J. Langou, P. Luszczek, and S. Tomov, "The Impact of Multicore on Math Software,"in Applied Parallel Computing. State of the Art in Scientific Computing, 8th International Workshop, PARA, ser. Lecture Notes in Computer Science, B. Kågström, E. Elmroth, J. Dongarra, and J. Wasniewski, Eds., vol. 4699. Springer, 2006, pp. 1-10.
    • (2006) Lecture Notes in Computer Science , vol.4699 , pp. 1-10
    • Buttari, A.1    Dongarra, J.2    Kurzak, J.3    Langou, J.4    Luszczek, P.5    Tomov, S.6
  • 34
    • 70350641505 scopus 로고    scopus 로고
    • StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures
    • Euro-Par 2009 Euro-par'09 Proceedings, ser. Delft Pays-Bas, [Online]. Available
    • C. Augonnet, S. Thibault, R. Namyst, and P.-A. Wacrenier, "StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures," in Euro-Par 2009 Euro-par'09 Proceedings, ser. LNCS, Delft Pays-Bas, 2009. [Online]. Available: http://hal.inria.fr/inria-00384363/en/
    • (2009) LNCS
    • Augonnet, C.1    Thibault, S.2    Namyst, R.3    Wacrenier, P.-A.4
  • 35
    • 74049102092 scopus 로고    scopus 로고
    • Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems
    • New York, NY, USA: ACM, DOI: 10.1145/1654059.1654079
    • F. Song, A. YarKhan, and J. Dongarra, "Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems," in SC '09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis. New York, NY, USA: ACM, 2009, pp. 1-11, http://doi.acm.org/10.1145/1654059.1654079 DOI: 10.1145/1654059.1654079.
    • (2009) SC '09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis , pp. 1-11
    • Song, F.1    YarKhan, A.2    Dongarra, J.3
  • 36
    • 0035266229 scopus 로고    scopus 로고
    • Automatic Parallelization Techniques Based on Compact DAG Extraction and Symbolic Scheduling
    • [Online]. Available: http://hal.inria.fr/inria-00000278/en
    • M. Cosnard and E. Jeannot, "Automatic Parallelization Techniques Based on Compact DAG Extraction and Symbolic Scheduling," Parallel Processing Letters, vol. 11, pp. 151-168, 2001. [Online]. Available: http://dx.doi.org/10.1142/S012962640100049X http://hal.inria.fr/inria-00000278/ en/
    • (2001) Parallel Processing Letters , vol.11 , pp. 151-168
    • Cosnard, M.1    Jeannot, E.2
  • 43
    • 77951935506 scopus 로고    scopus 로고
    • Scheduling two-sided transformations using tile algorithms on multicore architectures
    • H. Ltaief, J. Kurzak, J. Dongarra, and R. M. Badia, "Scheduling two-sided transformations using tile algorithms on multicore architectures," Sci. Program., vol. 18, no. 1, pp. 35-50, 2010.
    • (2010) Sci. Program. , vol.18 , Issue.1 , pp. 35-50
    • Ltaief, H.1    Kurzak, J.2    Dongarra, J.3    Badia, R.M.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.