메뉴 건너뛰기




Volumn 24, Issue 4, 2010, Pages 511-515

An improved MAGMA GEMM for Fermi graphics processing units

Author keywords

CUDA matrix mutiply; dense linear algebra; Fermi; GPU BLAS; hybrid computing

Indexed keywords

DENSE LINEAR ALGEBRA; FERMI; GPU BLAS; HYBRID COMPUTING; MATRIX;

EID: 78649504961     PISSN: 10943420     EISSN: 17412846     Source Type: Journal    
DOI: 10.1177/1094342010385729     Document Type: Article
Times cited : (158)

References (12)
  • 3
    • 78651269052 scopus 로고    scopus 로고
    • Understanding the Efficiency of GPU Algorithms for Matrix- Matrix Multiplication
    • ACM Press, New York
    • Fatahalian, K., Sugerman, J., and Hanrahan, P. (2004). Understanding the efficiency of GPU algorithms for matrix- matrix multiplication. In Proceedings of HWWS'04. ACM Press, New York, pp. 133-137.
    • (2004) Proceedings of HWWS'04 , pp. 133-137
    • Fatahalian, K.1    Sugerman, J.2    Hanrahan, P.3
  • 4
    • 0012097416 scopus 로고
    • Stability of a method for multiplying complex matrices with three real matrx multiplications
    • Higham, N.J. (1992). Stability of a method for multiplying complex matrices with three real matrix multiplications. SIAM J. Matrix Anal. Appl. 13: 681-687.
    • (1992) SIAM J. Matrix Anal. Appl , vol.13 , pp. 681-687
    • Higham, N.J.1
  • 5
    • 68849128792 scopus 로고    scopus 로고
    • A Note on Auto-tuning GEMM for GPUs
    • Springer-Verlag, Berlin
    • Li, Y., Dongarra, J., and Tomov, S. (2009). A note on auto-tuning GEMM for GPUs. In Proceedings of ICCS'09. Springer-Verlag, Berlin, pp. 884-892.
    • (2009) Proceedings of ICCS'09 , pp. 884-892
    • Li, Y.1    Dongarra, J.2    Tomov, S.3
  • 6
    • 78649502174 scopus 로고    scopus 로고
    • Accelerating GPU Kernels for Dense Linear Algebra
    • Berkeley, CA, 22-25 June
    • Nath, R., Tomov, S., and Dongarra, J. (2010). Accelerating GPU kernels for dense linear algebra. In Proceedings of VEC-PAR'10, Berkeley, CA, 22-25 June 2010.
    • (2010) Proceedings of VEC-PAR'10
    • Nath, R.1    Tomov, S.2    Dongarra, J.3
  • 10
    • 67349149521 scopus 로고    scopus 로고
    • Benchmarking GPUs to Tune Dense Linear Algebra
    • IEEE Press, Piscataway, NJ
    • Volkov, V. and Demmel, J. (2008). Benchmarking GPUs to tune dense linear algebra. In Proceedings of SC'08. IEEE Press, Piscataway, NJ, pp. 1-11.
    • (2008) Proceedings of SC'08 , pp. 1-11
    • Volkov, V.1    Demmel, J.2
  • 11
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimizations of software and the ATLAS project
    • Whaley, R.C., Petitet, A., and Dongarra, J. (2001). Automated empirical optimizations of software and the ATLAS project. Parallel Comput. 27 (1-2). 3-35.
    • (2001) Parallel Comput , vol.27 , Issue.1-2 , pp. 3-35
    • Whaley, R.C.1    Petitet, A.2    Dongarra, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.