메뉴 건너뛰기




Volumn , Issue , 2008, Pages

Benchmarking GPUs to tune dense linear algebra

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMIC OPTIMIZATION; CHOLESKY FACTORIZATIONS; DENSE LINEAR ALGEBRA; MATRIX; MEMORY SYSTEMS; MULTI CORE; MULTITHREADED;

EID: 70350771131     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/SC.2008.5214359     Document Type: Conference Paper
Times cited : (629)

References (23)
  • 1
    • 70350758060 scopus 로고    scopus 로고
    • ABTS, D., BATAINEH, A., SCOTT, S., FAANES, G., SCHWARZMEIER, J., LUNDBERG, E., JOHNSON, T., BYE, M., AND SCHWOERER, G. 2007. The Cray BlackWidow: A Highly Scalable Vector Multiprocessor, SC'07. AGARWAL R. C., AND GUSTAVSON, F.G. 1989. Vector and parallel algorithms for Cholesky factorization on IBM 3090, Supercomputing' 89, 225-233.
    • ABTS, D., BATAINEH, A., SCOTT, S., FAANES, G., SCHWARZMEIER, J., LUNDBERG, E., JOHNSON, T., BYE, M., AND SCHWOERER, G. 2007. The Cray BlackWidow: A Highly Scalable Vector Multiprocessor, SC'07. AGARWAL R. C., AND GUSTAVSON, F.G. 1989. Vector and parallel algorithms for Cholesky factorization on IBM 3090, Supercomputing' 89, 225-233.
  • 2
    • 70350762187 scopus 로고    scopus 로고
    • ALVERSON, R., CALLAHAN, D., CUMMINGS, D., KOBLENZ, B., PORTERFIELD, A., AND SMITH, B. 1990. The Tera Computer System, ICS'90, 1-6. AMD. 2006. ATI CTM Guide, version 1.01.
    • ALVERSON, R., CALLAHAN, D., CUMMINGS, D., KOBLENZ, B., PORTERFIELD, A., AND SMITH, B. 1990. The Tera Computer System, ICS'90, 1-6. AMD. 2006. ATI CTM Guide, version 1.01.
  • 5
    • 70350780783 scopus 로고    scopus 로고
    • BABOULIN, M., DONGARRA J., AND TOMOV, S. 2008. Some Issues in Dense Linear Algebra for Multicore and Special Purpose Architectures, Technical Report UT-CS-08-200, University of Tennessee, May 6, 2008 (also LAPACK Working Note 200).
    • BABOULIN, M., DONGARRA J., AND TOMOV, S. 2008. Some Issues in Dense Linear Algebra for Multicore and Special Purpose Architectures, Technical Report UT-CS-08-200, University of Tennessee, May 6, 2008 (also LAPACK Working Note 200).
  • 9
    • 70350767593 scopus 로고    scopus 로고
    • CASTILLO, M., CHAN, E., IGUAL, F. D., MAYO, R., QUINTANAORTI, E. S., QUINTANA-ORTI, G., VAN DE GEIJN, R., AND VAN ZEE, F. G. 2008. Making Programming Synonymous with Programming for Linear Algebra Libraries, FLAME Working Note #31. The University of Texas at Austin, Department of Computer Sciences. Technical Report TR-08-20, April 17, 2008.
    • CASTILLO, M., CHAN, E., IGUAL, F. D., MAYO, R., QUINTANAORTI, E. S., QUINTANA-ORTI, G., VAN DE GEIJN, R., AND VAN ZEE, F. G. 2008. Making Programming Synonymous with Programming for Linear Algebra Libraries, FLAME Working Note #31. The University of Texas at Austin, Department of Computer Sciences. Technical Report TR-08-20, April 17, 2008.
  • 10
    • 0030244536 scopus 로고    scopus 로고
    • CHOI, J., DONGARRA, J. J., OSTROUCHOV, L. S., PETITET, A. P., WALKER, D. W., AND WHALEY, R. C. 1996. The Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines, Scientific Programming 5, 3, 173-184 (also LAPACK Working Note 80).
    • CHOI, J., DONGARRA, J. J., OSTROUCHOV, L. S., PETITET, A. P., WALKER, D. W., AND WHALEY, R. C. 1996. The Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines, Scientific Programming 5, 3, 173-184 (also LAPACK Working Note 80).
  • 11
    • 70350771484 scopus 로고    scopus 로고
    • DONGARRA, J., DUFF, I. S., SORENSEN, D. C., AND VAN DER VORST, H. A. 1998. Numerical Linear Algebra for High-Performance Computers, SIAM.
    • DONGARRA, J., DUFF, I. S., SORENSEN, D. C., AND VAN DER VORST, H. A. 1998. Numerical Linear Algebra for High-Performance Computers, SIAM.
  • 13
    • 70350769644 scopus 로고    scopus 로고
    • DONGARRA, J., AND OSTROUCHOV, S. 1990. LAPACK Block Factorization Algorithms on the Intel iPSC/860, Technical Report CS-90-115, University of Tennessee (also LAPACK Working Note 24).
    • DONGARRA, J., AND OSTROUCHOV, S. 1990. LAPACK Block Factorization Algorithms on the Intel iPSC/860, Technical Report CS-90-115, University of Tennessee (also LAPACK Working Note 24).
  • 14
    • 33845468997 scopus 로고    scopus 로고
    • LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware
    • GALOPPO, N., GOVINDARAJU, N. K., HENSON, M., AND MANOCHA, D. 2005. LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware, SC'05.
    • (2005) SC , vol.5
    • GALOPPO, N.1    GOVINDARAJU, N.K.2    HENSON, M.3    MANOCHA, D.4
  • 15
    • 34548292052 scopus 로고    scopus 로고
    • A Memory Model for Scientific Algorithms on Graphcs Processors
    • GOVINDARAJU, N. K., LARSEN, S., GRAY, J., AND MANOCHA, D. 2006. A Memory Model for Scientific Algorithms on Graphcs Processors, SC'06.
    • (2006) SC , vol.6
    • GOVINDARAJU, N.K.1    LARSEN, S.2    GRAY, J.3    MANOCHA, D.4
  • 16
    • 78651269052 scopus 로고    scopus 로고
    • Understanding the efficiency of GPU algorithms for matrixmatrix multiplication
    • FATAHALIAN, K., SUGERMAN, J., AND HANRAHAN, P. 2004. Understanding the efficiency of GPU algorithms for matrixmatrix multiplication, In Graphics Hardware 2004, 133-137.
    • (2004) Graphics Hardware 2004 , pp. 133-137
    • FATAHALIAN, K.1    SUGERMAN, J.2    HANRAHAN, P.3
  • 17
    • 56849107345 scopus 로고    scopus 로고
    • Efficient Gather and Scatter Operations on Graphics Processors
    • HE, B., GOVINDARAJU, N. K., LUO, Q., AND SMITH, B. 2007. Efficient Gather and Scatter Operations on Graphics Processors, SC'07.
    • (2007) SC , vol.7
    • HE, B.1    GOVINDARAJU, N.K.2    LUO, Q.3    SMITH, B.4
  • 18
    • 70350769643 scopus 로고    scopus 로고
    • HWU, W. W., AND KIRK, D. 2007. ECE 498 AL1: Programming Massively Parallel Processors, Lecture Slides, University of Illinois, Urbana-Champaign. NVIDIA. 2006.
    • HWU, W. W., AND KIRK, D. 2007. ECE 498 AL1: Programming Massively Parallel Processors, Lecture Slides, University of Illinois, Urbana-Champaign. NVIDIA. 2006.
  • 19
    • 70350777041 scopus 로고    scopus 로고
    • NVIDIA GeForce 8800 GPU Architecture Overview, Technical Brief, November 2006.
    • NVIDIA GeForce 8800 GPU Architecture Overview, Technical Brief, November 2006.
  • 22
    • 70350769642 scopus 로고    scopus 로고
    • QUINTANA-ORTI, G., IGUAL, F. D., QUINTANA-ORTI, E. S., AND VAN DE GEIJN, R. 2008. Solving Dense Linear Systems on Platforms with Multiple Hardware Accelerators, FLAME Working Note #32. The University of Texas at Austin, Department of Computer Sciences. Technical Report TR-08-22. May 9, 2008.
    • QUINTANA-ORTI, G., IGUAL, F. D., QUINTANA-ORTI, E. S., AND VAN DE GEIJN, R. 2008. Solving Dense Linear Systems on Platforms with Multiple Hardware Accelerators, FLAME Working Note #32. The University of Texas at Austin, Department of Computer Sciences. Technical Report TR-08-22. May 9, 2008.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.