메뉴 건너뛰기




Volumn , Issue , 2011, Pages 87-93

High performance matrix inversion on a multi-core platform with several GPUs

Author keywords

GPUs; linear algebra; matrix inversion

Indexed keywords

COMPUTATIONAL EFFORT; COMPUTATIONAL KERNELS; COMPUTATIONAL UNITS; CONCURRENT EXECUTION; CORE PROCESSORS; FLOATING POINT OPERATIONS PER SECONDS; GAUSS-JORDAN ELIMINATION; GENERAL PURPOSE PROCESSORS; GPUS; GRAPHICS PROCESSOR; HIGH PERFORMANCE CODES; HIGH PERFORMANCE COMPUTING; MATRIX INVERSIONS; MODEL REDUCTION; MULTI-CORE PLATFORMS; MULTI-CORE PROCESSOR; NUMERICAL EXPERIMENTS; OFF-LOAD; OPTIMAL CONTROLS; PARALLEL IMPLEMENTATIONS; PERFORMANCE MATRICES; SCIENTIFIC APPLICATIONS; TARGET ARCHITECTURES;

EID: 79955025807     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/PDP.2011.66     Document Type: Conference Paper
Times cited : (17)

References (14)
  • 2
    • 73349092728 scopus 로고    scopus 로고
    • Exploiting the capabilities of modern GPUs for dense matrix computations
    • "Exploiting the capabilities of modern GPUs for dense matrix computations," Concurrency and Computation: Practice and Experience, vol. 21, pp. 2457-2477, 2009.
    • (2009) Concurrency and Computation: Practice and Experience , vol.21 , pp. 2457-2477
  • 5
    • 0034770552 scopus 로고    scopus 로고
    • A note on parallel matrix inversion
    • "A note on parallel matrix inversion," SIAM J. Sci. Comput., vol. 22, pp. 1762-1771, 2001.
    • (2001) SIAM J. Sci. Comput. , vol.22 , pp. 1762-1771
  • 6
    • 0039435412 scopus 로고    scopus 로고
    • FLAME: Formal linear algebra methods environment
    • DOI 10.1145/504210.504213
    • J. A. Gunnels, F. G. Gustavson, G. M. Henry, and R. A. van de Geijn, "FLAME: Formal linear algebra methods environment," ACM Trans. Math. Soft., vol. 27, no. 4, pp. 422-455, December 2001. [Online]. Available: http://doi.acm.org/10.1145/504210.504213 (Pubitemid 33602331)
    • (2001) ACM Transactions on Mathematical Software , vol.27 , Issue.4 , pp. 422-455
    • Gunnels, J.A.1    Gustavson, F.G.2    Henry, G.M.3    Van De Geijn, R.A.4
  • 8
    • 65849272637 scopus 로고    scopus 로고
    • A comparison of lookahead and algorithmic blocking techniques for parallel matrix factorization
    • The Australian National University, Canberra 0200 ACT, Australia
    • A. Strazdins, "A comparison of lookahead and algorithmic blocking techniques for parallel matrix factorization," Tech. Rep. TR-CS-98-07, Department of Computer Science, The Australian National University, Canberra 0200 ACT, Australia 1998.
    • (1998) Tech. Rep. TR-CS-98-07, Department of Computer Science
    • Strazdins, A.1
  • 9
    • 67650021816 scopus 로고    scopus 로고
    • Solving dense linear algebra problems on platforms with multiple hardware accelerators
    • "Solving dense linear algebra problems on platforms with multiple hardware accelerators," ACM SIGPLAN Symposium on Principles and Practice of Parallel Computing, pp. 121-129, 2009.
    • (2009) ACM SIGPLAN Symposium on Principles and Practice of Parallel Computing , pp. 121-129
  • 10
    • 48849086742 scopus 로고    scopus 로고
    • Updating an LU factorization with pivoting
    • [Online]. Available: http://doi.acm.org/10.1145/1377612.1377615
    • "Updating an LU factorization with pivoting," ACM Trans. Math. Softw., vol. 35, no. 2, pp. 1-16, 2008. [Online]. Available: http://doi.acm.org/10.1145/1377612.1377615
    • (2008) ACM Trans. Math. Softw. , vol.35 , Issue.2 , pp. 1-16
  • 11
    • 70350635626 scopus 로고    scopus 로고
    • An extension of the StarSs programming model for platforms with multiple GPUs
    • "An extension of the StarSs programming model for platforms with multiple GPUs," Lecture Notes in Computer Science 5704, Euro-Par 2009, pp. 851-862, 2009.
    • (2009) Lecture Notes in Computer Science 5704, Euro-par 2009 , pp. 851-862
  • 12
    • 77956773183 scopus 로고    scopus 로고
    • Extending OpenMP to survive the heterogeneous multi-core era
    • [Online]. Available: http://doi.acm.org/10.1145/1377612.1377615
    • "Extending OpenMP to survive the heterogeneous multi-core era," Int. Journal of Parallel Programming, vol. 38, no. 5, pp. 440-459, 2010. [Online]. Available: http://doi.acm.org/10.1145/1377612.1377615
    • (2010) Int. Journal of Parallel Programming , vol.38 , Issue.5 , pp. 440-459
  • 13
    • 77954080759 scopus 로고    scopus 로고
    • Dense linear algebra solvers for multicore with GPU accelerators
    • Published at Atlanta, GA, January
    • S. Tomov, R. Nath, H. Ltaief, J. Dongarra, "Dense linear algebra solvers for multicore with GPU accelerators," Published at HIPS 2010, Atlanta, GA, January 2010.
    • (2010) HIPS 2010
    • Tomov, S.1    Nath, R.2    Ltaief, H.3    Dongarra, J.4
  • 14
    • 70350641505 scopus 로고    scopus 로고
    • StarPU: A unified platform for task scheduling on heterogeneous multicore architectures
    • "StarPU: A unified platform for task scheduling on heterogeneous multicore architectures," Lecture Notes in Computer Science 5704, Euro-Par 2009, pp. 863-874, 2009.
    • (2009) Lecture Notes in Computer Science 5704, Euro-par 2009 , pp. 863-874


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.