메뉴 건너뛰기




Volumn , Issue , 2012, Pages 2-13

A predictive model for solving small linear algebra problems in GPU registers

Author keywords

Dense Linear Algebra; GPGPU; Modeling

Indexed keywords

ADAPTIVE RADAR; COMPLEX MATRICES; DENSE LINEAR ALGEBRA; GPGPU; GRAPHICS PROCESSING UNITS; HARDWARE-ACCELERATED; KERNEL LIBRARIES; LINEAR ALGEBRA PROBLEMS; MATRIX SIZE; MEMORY HIERARCHY; MEMORY SUBSYSTEMS; MULTI-LEVEL PARALLELISM; NUMERICAL LINEAR ALGEBRA; PREDICTIVE MODELS; PROBLEM SIZE; QR FACTORIZATIONS; RADAR PROCESSING; REAL-TIME APPLICATION; SQUARE ROOT FUNCTIONS; SUPERCOMPUTING APPLICATIONS;

EID: 84866859911     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2012.11     Document Type: Conference Paper
Times cited : (42)

References (23)
  • 1
    • 84866858955 scopus 로고    scopus 로고
    • Accessed: 09/26/2011
    • CUDA forums: Lots of small matrices. http://forums.nvidia.com/index.php? showtopic=188430. Accessed: 09/26/2011.
    • CUDA Forums: Lots of Small Matrices
  • 2
    • 84866860138 scopus 로고    scopus 로고
    • CULA - A hybrid GPU linear algebra package
    • CULA - a hybrid GPU linear algebra package. http://nvidia.fullviewmedia. com/gtc2010/0923-a3-2153.html. NVIDIA GPU Technology Conference 2010.
    • NVIDIA GPU Technology Conference 2010
  • 3
    • 84866852864 scopus 로고    scopus 로고
    • Accessed: 09/26/2011
    • CULA forums: Batch level parallelism. http://www.culatools.com/forums/ viewtopic.php?f=14&t=774. Accessed: 09/26/2011.
    • CULA Forums: Batch Level Parallelism
  • 7
    • 0029540641 scopus 로고
    • Issues in using heterogeneous hpc systems for embedded real time signal processing applications
    • Published by the IEEE Computer Society
    • P.B. Bhat, Y.W. Lim, and V.K. Prasanna. Issues in using heterogeneous hpc systems for embedded real time signal processing applications. In rtcsa, page 134. Published by the IEEE Computer Society, 1995.
    • (1995) rtcsa , pp. 134
    • Bhat, P.B.1    Lim, Y.W.2    Prasanna, V.K.3
  • 12
    • 67349241918 scopus 로고    scopus 로고
    • Harnessing graphics processors for the fast computation of acoustic likelihoods in speech recognition
    • P.R. Dixon, T. Oonishi, and S. Furui. Harnessing graphics processors for the fast computation of acoustic likelihoods in speech recognition. Computer Speech & Language, 23(4):510-526, 2009.
    • (2009) Computer Speech & Language , vol.23 , Issue.4 , pp. 510-526
    • Dixon, P.R.1    Oonishi, T.2    Furui, S.3
  • 13
    • 0025402476 scopus 로고
    • A set of level 3 basic linear algebra subprograms
    • J.J. Dongarra, J. Du Croz, S. Hammarling, and I.S. Duff. A set of level 3 basic linear algebra subprograms. ACM TOMS, 16(1):1-17, 1990.
    • (1990) ACM TOMS , vol.16 , Issue.1 , pp. 1-17
    • Dongarra, J.J.1    Du Croz, J.2    Hammarling, S.3    Duff, I.S.4
  • 16
    • 79954854537 scopus 로고    scopus 로고
    • Clinically feasible reconstruction time for l1-spirit parallel imaging and compressed sensing mri
    • M. Murphy, K. Keutzer, S. Vasanawala, and M. Lustig. Clinically feasible reconstruction time for l1-spirit parallel imaging and compressed sensing mri. ISMRM'10, 2010.
    • (2010) ISMRM'10
    • Murphy, M.1    Keutzer, K.2    Vasanawala, S.3    Lustig, M.4
  • 18
    • 77951154340 scopus 로고    scopus 로고
    • The GPU computing era
    • J. Nickolls and W.J. Dally. The GPU computing era. Micro, IEEE, 30(2):56-69, 2010.
    • (2010) Micro, IEEE , vol.30 , Issue.2 , pp. 56-69
    • Nickolls, J.1    Dally, W.J.2
  • 20
    • 85044587281 scopus 로고    scopus 로고
    • Accessed: 09/27/2011
    • Michael Parker. Radar basics. http://www.eetimes.com/ design/programmable-logic/4216104/Radar-basics-Part-1. Accessed: 09/27/2011.
    • Radar Basics
    • Parker, M.1
  • 22
    • 65949107549 scopus 로고    scopus 로고
    • Roofline: An insightful visual performance model for multicore architectures
    • S. Williams, A. Waterman, and D. Patterson. Roofline: an insightful visual performance model for multicore architectures. Communications of the ACM, 52(4):65-76, 2009.
    • (2009) Communications of the ACM , vol.52 , Issue.4 , pp. 65-76
    • Williams, S.1    Waterman, A.2    Patterson, D.3
  • 23
    • 77952579552 scopus 로고    scopus 로고
    • Demystifying GPU microarchitecture through microbenchmarking
    • March
    • H. Wong, M.-M. Papadopoulou, M. Sadooghi-Alvandi, and A. Moshovos. Demystifying GPU microarchitecture through microbenchmarking. In ISPASS, pages 235-246, March 2010.
    • (2010) ISPASS , pp. 235-246
    • Wong, H.1    Papadopoulou, M.-M.2    Sadooghi-Alvandi, M.3    Moshovos, A.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.