-
1
-
-
84866858955
-
-
Accessed: 09/26/2011
-
CUDA forums: Lots of small matrices. http://forums.nvidia.com/index.php? showtopic=188430. Accessed: 09/26/2011.
-
CUDA Forums: Lots of Small Matrices
-
-
-
2
-
-
84866860138
-
CULA - A hybrid GPU linear algebra package
-
CULA - a hybrid GPU linear algebra package. http://nvidia.fullviewmedia. com/gtc2010/0923-a3-2153.html. NVIDIA GPU Technology Conference 2010.
-
NVIDIA GPU Technology Conference 2010
-
-
-
3
-
-
84866852864
-
-
Accessed: 09/26/2011
-
CULA forums: Batch level parallelism. http://www.culatools.com/forums/ viewtopic.php?f=14&t=774. Accessed: 09/26/2011.
-
CULA Forums: Batch Level Parallelism
-
-
-
5
-
-
77953997924
-
Numerical linear algebra on emerging architectures: The plasma and magma projects
-
IOP Publishing
-
E. Agullo, J. Demmel, J. Dongarra, B. Hadri, J. Kurzak, J. Langou, H. Ltaief, P. Luszczek, and S. Tomov. Numerical linear algebra on emerging architectures: The plasma and magma projects. In Journal of Physics: Conference Series, volume 180, page 012037. IOP Publishing, 2009.
-
(2009)
Journal of Physics: Conference Series
, vol.180
, pp. 012037
-
-
Agullo, E.1
Demmel, J.2
Dongarra, J.3
Hadri, B.4
Kurzak, J.5
Langou, J.6
Ltaief, H.7
Luszczek, P.8
Tomov, S.9
-
7
-
-
0029540641
-
Issues in using heterogeneous hpc systems for embedded real time signal processing applications
-
Published by the IEEE Computer Society
-
P.B. Bhat, Y.W. Lim, and V.K. Prasanna. Issues in using heterogeneous hpc systems for embedded real time signal processing applications. In rtcsa, page 134. Published by the IEEE Computer Society, 1995.
-
(1995)
rtcsa
, pp. 134
-
-
Bhat, P.B.1
Lim, Y.W.2
Prasanna, V.K.3
-
10
-
-
0009346826
-
-
ACM
-
D. Culler, R. Karp, D. Patterson, A. Sahay, K.E. Schauser, E. Santos, R. Subramonian, and T. Von Eicken. LogP: Towards a realistic model of parallel computation, volume 28. ACM, 1993.
-
(1993)
LogP: Towards A Realistic Model of Parallel Computation
, vol.28
-
-
Culler, D.1
Karp, R.2
Patterson, D.3
Sahay, A.4
Schauser, K.E.5
Santos, E.6
Subramonian, R.7
Von Eicken, T.8
-
11
-
-
0003424372
-
-
SIAM Philadelphia, PA,, USA
-
J.W. Demmel et al. Applied numerical linear algebra, volume 150. SIAM Philadelphia, PA,, USA, 1997.
-
(1997)
Applied Numerical Linear Algebra
, vol.150
-
-
Demmel, J.W.1
-
12
-
-
67349241918
-
Harnessing graphics processors for the fast computation of acoustic likelihoods in speech recognition
-
P.R. Dixon, T. Oonishi, and S. Furui. Harnessing graphics processors for the fast computation of acoustic likelihoods in speech recognition. Computer Speech & Language, 23(4):510-526, 2009.
-
(2009)
Computer Speech & Language
, vol.23
, Issue.4
, pp. 510-526
-
-
Dixon, P.R.1
Oonishi, T.2
Furui, S.3
-
13
-
-
0025402476
-
A set of level 3 basic linear algebra subprograms
-
J.J. Dongarra, J. Du Croz, S. Hammarling, and I.S. Duff. A set of level 3 basic linear algebra subprograms. ACM TOMS, 16(1):1-17, 1990.
-
(1990)
ACM TOMS
, vol.16
, Issue.1
, pp. 1-17
-
-
Dongarra, J.J.1
Du Croz, J.2
Hammarling, S.3
Duff, I.S.4
-
14
-
-
77953483096
-
CULA: Hybrid GPU accelerated linear algebra routines
-
J.R. Humphrey, D.K. Price, K.E. Spagnoli, A.L. Paolini, and E.J. Kelmelis. CULA: hybrid GPU accelerated linear algebra routines. In SPIE Conference Series, volume 7705, page 1, 2010.
-
(2010)
SPIE Conference Series
, vol.7705
, pp. 1
-
-
Humphrey, J.R.1
Price, D.K.2
Spagnoli, K.E.3
Paolini, A.L.4
Kelmelis, E.J.5
-
16
-
-
79954854537
-
Clinically feasible reconstruction time for l1-spirit parallel imaging and compressed sensing mri
-
M. Murphy, K. Keutzer, S. Vasanawala, and M. Lustig. Clinically feasible reconstruction time for l1-spirit parallel imaging and compressed sensing mri. ISMRM'10, 2010.
-
(2010)
ISMRM'10
-
-
Murphy, M.1
Keutzer, K.2
Vasanawala, S.3
Lustig, M.4
-
17
-
-
79958284905
-
-
ICL, University of Tennessee, Tech. Rep
-
R. Nath, S. Tomov, and J. Dongarra. An improved magma GEMM for Fermi GPUs. ICL, University of Tennessee, Tech. Rep, 2010.
-
(2010)
An Improved Magma GEMM for Fermi GPUs
-
-
Nath, R.1
Tomov, S.2
Dongarra, J.3
-
18
-
-
77951154340
-
The GPU computing era
-
J. Nickolls and W.J. Dally. The GPU computing era. Micro, IEEE, 30(2):56-69, 2010.
-
(2010)
Micro, IEEE
, vol.30
, Issue.2
, pp. 56-69
-
-
Nickolls, J.1
Dally, W.J.2
-
20
-
-
85044587281
-
-
Accessed: 09/27/2011
-
Michael Parker. Radar basics. http://www.eetimes.com/ design/programmable-logic/4216104/Radar-basics-Part-1. Accessed: 09/27/2011.
-
Radar Basics
-
-
Parker, M.1
-
21
-
-
70350771131
-
Benchmarking GPUs to tune dense linear algebra
-
Nov.
-
V. Volkov and J.W. Demmel. Benchmarking GPUs to tune dense linear algebra. In High Performance Computing, Networking, Storage and Analysis, 2008. SC 2008. International Conference for, pages 1-11, Nov. 2008.
-
(2008)
High Performance Computing, Networking, Storage and Analysis, 2008. SC 2008. International Conference for
, pp. 1-11
-
-
Volkov, V.1
Demmel, J.W.2
-
22
-
-
65949107549
-
Roofline: An insightful visual performance model for multicore architectures
-
S. Williams, A. Waterman, and D. Patterson. Roofline: an insightful visual performance model for multicore architectures. Communications of the ACM, 52(4):65-76, 2009.
-
(2009)
Communications of the ACM
, vol.52
, Issue.4
, pp. 65-76
-
-
Williams, S.1
Waterman, A.2
Patterson, D.3
-
23
-
-
77952579552
-
Demystifying GPU microarchitecture through microbenchmarking
-
March
-
H. Wong, M.-M. Papadopoulou, M. Sadooghi-Alvandi, and A. Moshovos. Demystifying GPU microarchitecture through microbenchmarking. In ISPASS, pages 235-246, March 2010.
-
(2010)
ISPASS
, pp. 235-246
-
-
Wong, H.1
Papadopoulou, M.-M.2
Sadooghi-Alvandi, M.3
Moshovos, A.4
|