-
1
-
-
35648995516
-
-
Technical Report UCB/EECS-2006-183, Electrical Engineering, and Computer Sciences Department, University of California at Berkeley
-
K. Asanovic, R. Bodik, B. C. Catanzaro, J. J. Gebis, P. Husbands, K. Keutzer, D. A. Patterson, W. L. Plishker, J. Shalf, S. W. Williams, and K. A. Yelick. The Landscape of Parallel Computing Research: A View from Berkeley. Technical Report UCB/EECS-2006-183, Electrical Engineering, and Computer Sciences Department, University of California at Berkeley, 2006.
-
(2006)
The Landscape of Parallel Computing Research: A View from Berkeley
-
-
Asanovic, K.1
Bodik, R.2
Catanzaro, B.C.3
Gebis, J.J.4
Husbands, P.5
Keutzer, K.6
Patterson, D.A.7
Plishker, W.L.8
Shalf, J.9
Williams, S.W.10
Yelick, K.A.11
-
2
-
-
0003706460
-
-
SIAM, Philadelphia, PA
-
E. Anderson, Z. Bai, C. Bischof, L. S. Blackford, J. W. Demmel, J. J. Dongarra, J. Du Croz, A. Greenbaum, S. Hammarling, A. McKenney, and D. Sorensen. LAPACK Users’ Guide. SIAM, Philadelphia, PA, 1992. http://www.netlib.org/lapack/lug/.
-
(1992)
LAPACK Users’ Guide
-
-
Anderson, E.1
Bai, Z.2
Bischof, C.3
Blackford, L.S.4
Demmel, J.W.5
Dongarra, J.J.6
Du Croz, J.7
Greenbaum, A.8
Hammarling, S.9
McKenney, A.10
Sorensen, D.11
-
6
-
-
70350771131
-
Benchmarking GPUs to tune dense linear algebra
-
Piscataway, NJ, IEEE Press
-
V. Volkov, and J. Demmel. Benchmarking GPUs to tune dense linear algebra. In SC '08: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing, pages 1-11, Piscataway, NJ, 2008. IEEE Press.
-
(2008)
SC '08: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing
, pp. 1-11
-
-
Volkov, V.1
Demmel, J.2
-
7
-
-
68849128792
-
A Note on Auto-tuning GEMM for GPUs
-
Berlin, Heidelberg, Springer-Verlag
-
Y. Li, J. Dongarra, and S. Tomov. A Note on Auto-tuning GEMM for GPUs. In ICCS '09: Proceedings of the 9th International Conference on Computational Science, pages 884-892, Berlin, Heidelberg, 2009. Springer-Verlag.
-
(2009)
ICCS '09: Proceedings of the 9th International Conference on Computational Science
, pp. 884-892
-
-
Li, Y.1
Dongarra, J.2
Tomov, S.3
-
11
-
-
0342583534
-
-
University of California, Berkeley, UCB/CSD-92-702, September
-
James W. Demmel. Trading Off Parallelism, and Numerical Stability, EECS Department, University of California, Berkeley, UCB/CSD-92-702, September 1992.
-
(1992)
Trading Off Parallelism, and Numerical Stability, EECS Department
-
-
Demmel, J.W.1
-
12
-
-
0001707332
-
Stability of parallel triangular system solvers
-
Nicholas J. Higham. Stability of parallel triangular system solvers, SIAM J. Sci. Comput., 16(2): 400-413, 1995.
-
(1995)
SIAM J. Sci. Comput
, vol.16
, Issue.2
, pp. 400-413
-
-
Higham, N.J.1
-
13
-
-
0343462141
-
Automated Empirical Optimizations of Software, and the ATLAS Project
-
R. Whaley, A. Petitet, and J. Dongarra. Automated Empirical Optimizations of Software, and the ATLAS Project. Parallel Computing, 27(1-2): 3-35, 2001.
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Whaley, R.1
Petitet, A.2
Dongarra, J.3
-
14
-
-
20744452904
-
Self adapting linear algebra algorithms, and software
-
special issue on “Program Generation, Optimization, and Adaptation.” Proceddings
-
Jim Demmel, Jack Dongarra, Victor Eijkhout, Erika Fuentes, Antoine Petitet, Rich Vuduc, Clint Whaley, and Katherine Yelick. Self adapting linear algebra algorithms, and software. Proceedings of the IEEE 93 (2005), no. 2, special issue on “Program Generation, Optimization, and Adaptation.” Proceddings, vol. 93, 2, pp. 293-312.
-
(2005)
Proceedings of the IEEE
, vol.93
, Issue.2
, pp. 293-312
-
-
Demmel, J.1
Dongarra, J.2
Eijkhout, V.3
Fuentes, E.4
Petitet, A.5
Vuduc, R.6
Whaley, C.7
Yelick, K.8
-
15
-
-
0030661485
-
Optimizing Matrix Multiply Using PHiPAC: A Portable, High-Performance, ANSI C Coding Methodology
-
Jeff Bilmes, Krste Asanovic, Chee-Whye Chin, and James Demmel. Optimizing Matrix Multiply Using PHiPAC: A Portable, High-Performance, ANSI C Coding Methodology. International Conference on Supercomputing, 1997, pp. 340-347.
-
(1997)
International Conference on Supercomputing
, pp. 340-347
-
-
Bilmes, J.1
Asanovic, K.2
Chin, C.-W.3
Demmel, J.4
-
16
-
-
0031636309
-
FFTW: An adaptive software architecture for the FFT
-
IEEE
-
Matteo Frigo, and Steven G. Johnson. FFTW: An adaptive software architecture for the FFT. Proc. 1998 IEEE Intl. Conf. Acoustics Speech, and Signal Processing, vol. 3, IEEE, 1998, pp. 1381-1384.
-
(1998)
Proc. 1998 IEEE Intl. Conf. Acoustics Speech, and Signal Processing
, vol.3
, pp. 1381-1384
-
-
Frigo, M.1
Johnson, S.G.2
-
17
-
-
68849128792
-
A note on auto-tuning GEMMfor GPUs
-
Berlin, Heidelberg, Springer-Verlag
-
Y. Li, J. Dongarra, and S. Tomov. A note on auto-tuning GEMMfor GPUs. In ICCS '09, pages 884-892, Berlin, Heidelberg, 2009. Springer-Verlag.
-
(2009)
ICCS '09
, pp. 884-892
-
-
Li, Y.1
Dongarra, J.2
Tomov, S.3
-
18
-
-
85054454423
-
Compilers, and more: Optimizing GPU kernels
-
October
-
Michael Wolfe. Compilers, and more: Optimizing GPU kernels. HPC Wire, http://www.hpcwire.com/features/33607434.html, October 2008.
-
(2008)
HPC Wire
-
-
Wolfe, M.1
|