-
1
-
-
63249083649
-
-
Asanovic, Krste, Ras Bodik, Bryan Christopher Catanzaro, Joseph James Gebis, Parry Husbands, Kurt Keutzer, David A. Patterson, William Lester Plishker, John Shalf, Samuel Webb Williams, and Katherine A. Yelick, The landscape of parallel computing research: A view from Berkeley. Tech. Report UCB/EECS-2006-183, EECS Department, University of California, Berkeley, Dec 2006.
-
Asanovic, Krste, Ras Bodik, Bryan Christopher Catanzaro, Joseph James Gebis, Parry Husbands, Kurt Keutzer, David A. Patterson, William Lester Plishker, John Shalf, Samuel Webb Williams, and Katherine A. Yelick, "The landscape of parallel computing research: A view from Berkeley." Tech. Report UCB/EECS-2006-183, EECS Department, University of California, Berkeley, Dec 2006.
-
-
-
-
2
-
-
63249085856
-
Some issues in dense linear algebra for multicore and special purpose architectures
-
University of Tennessee, LA-PACKWorking Note 200
-
Baboulin, Marc, Jack Dongarra, and Stanimire Tomov, "Some issues in dense linear algebra for multicore and special purpose architectures." Technical Report UT-CS-08-615, University of Tennessee, 2008, LA-PACKWorking Note 200.
-
(2008)
Technical Report UT-CS-08-615
-
-
Baboulin, M.1
Dongarra, J.2
Tomov, S.3
-
4
-
-
63249083138
-
-
Buttari, A., J. Langou, J. Kurzak, and J. Dongarra, A class of parallel tiled linear algebra algorithms for multicore architectures. Technical Report UT-CS-07-600, University of Tennessee, 2007, LAPACK Working Note 191.
-
Buttari, A., J. Langou, J. Kurzak, and J. Dongarra, "A class of parallel tiled linear algebra algorithms for multicore architectures." Technical Report UT-CS-07-600, University of Tennessee, 2007, LAPACK Working Note 191.
-
-
-
-
5
-
-
38049058008
-
The impact of multicore on math software
-
The Proceedings of workshop on state-of-the-art in scientific and parallel computing Para06, Umeå, Sweden, pp
-
Buttari, Alfredo, Jack Dongarra, Jakub Kurzak, Julien Langou, Piotr Luszczek, and Stanimire Tomov, "The impact of multicore on math software." The Proceedings of workshop on state-of-the-art in scientific and parallel computing (Para06), Springer's Lecture Notes in Computer Science 4699 (Umeå, Sweden), pp. 1-10, 2007.
-
(2007)
Springer's Lecture Notes in Computer Science
, vol.4699
, pp. 1-10
-
-
Buttari, A.1
Dongarra, J.2
Kurzak, J.3
Langou, J.4
Luszczek, P.5
Tomov, S.6
-
6
-
-
35548992612
-
Using mixed precision for sparse matrix computations to enhance the performance while achieving 64-bit accuracy
-
Buttari, Alfredo, Jack Dongarra, Jakub Kurzak, Piotr Luszczek, and StanimireTomov, "Using mixed precision for sparse matrix computations to enhance the performance while achieving 64-bit accuracy." ACM Transactions on Mathematical Software, 34, no. 4, 2008.
-
(2008)
ACM Transactions on Mathematical Software
, vol.34
, Issue.4
-
-
Buttari, A.1
Dongarra, J.2
Kurzak, J.3
Luszczek, P.4
StanimireTomov5
-
7
-
-
35548933706
-
Mixed precision iterative refinement techniques for the solution of dense linear systems
-
Buttari, Alfredo, Jack Dongarra, Julie Langou, Julien Langou, Piotr Luszczek, and Jakub Kurzak, "Mixed precision iterative refinement techniques for the solution of dense linear systems." Int. J. High Perform. Comput. Appl., 21, no. 4, pp. 457-466, 2007.
-
(2007)
Int. J. High Perform. Comput. Appl
, vol.21
, Issue.4
, pp. 457-466
-
-
Buttari, A.1
Dongarra, J.2
Langou, J.3
Langou, J.4
Luszczek, P.5
Kurzak, J.6
-
8
-
-
0034174025
-
The density advantage of configurable computing
-
DeHon, André, "The density advantage of configurable computing." IEEE Computer, 33, no. 4, pp. 41-49, 2000.
-
(2000)
IEEE Computer
, vol.33
, Issue.4
, pp. 41-49
-
-
DeHon, A.1
-
9
-
-
0003310398
-
Numerical linear algebra for high-performance computers
-
Dongarra, J., I. Duff, D. Sorensen, and H. van der Vorst, "Numerical linear algebra for high-performance computers." SIAM, 1998.
-
(1998)
SIAM
-
-
Dongarra, J.1
Duff, I.2
Sorensen, D.3
van der Vorst, H.4
-
10
-
-
17644368925
-
Parallel out-of-core computation and updating of the QR factorization
-
Gunter, B. and R. van de Geijn, "Parallel out-of-core computation and updating of the QR factorization." ACM Trans. Math. Softw., 31, no. 1, 60-78, 2005.
-
(2005)
ACM Trans. Math. Softw
, vol.31
, Issue.1
, pp. 60-78
-
-
Gunter, B.1
van de Geijn, R.2
-
11
-
-
63249128293
-
-
Gustavson, F.G., New generalized data structures for matrices lead to a variety of high performance dense linear algebra algorithms. In Proceedings of PARA 2004, Workshop on state-of-the art in scientific computing, pp. 11-20, June 20-23, 2004.
-
Gustavson, F.G., "New generalized data structures for matrices lead to a variety of high performance dense linear algebra algorithms." In Proceedings of PARA 2004, Workshop on state-of-the art in scientific computing, pp. 11-20, June 20-23, 2004.
-
-
-
-
12
-
-
34547360464
-
Implementation of mixed precision in solving systems of linear equations on the cell processor: Research articles
-
Kurzak, Jakub and Jack Dongarra, "Implementation of mixed precision in solving systems of linear equations on the cell processor: Research articles." Concurr. Comput.: Pract. Exper., 19, no. 10, pp. 1371-1385, 2007.
-
(2007)
Concurr. Comput.: Pract. Exper
, vol.19
, Issue.10
, pp. 1371-1385
-
-
Kurzak, J.1
Dongarra, J.2
-
14
-
-
63249123336
-
Programming algorithms-byblocks for matrix computations on multithreaded architectures
-
Technical Report TR-0804, University of Texas at Austin, FLAME Working Note 29
-
Quintana-Orti, G., E.S. Quintana-Orti, E. Chan, F. G. van Zee, and R.A. van de Geijn, "Programming algorithms-byblocks for matrix computations on multithreaded architectures." Technical Report TR-0804, University of Texas at Austin, FLAME Working Note 29, 2008.
-
(2008)
-
-
Quintana-Orti, G.1
Quintana-Orti, E.S.2
Chan, E.3
van Zee, F.G.4
van de Geijn, R.A.5
-
16
-
-
47349126591
-
Sparse matrix-vector multiplication design on FPGAs
-
Washington, DC, USA, IEEE Computer Society, pp
-
th Annual IEEE Symposium on Field-Programmable Custom Computing Machines, Washington, DC, USA, IEEE Computer Society, pp. 349-352, 2007.
-
(2007)
th Annual IEEE Symposium on Field-Programmable Custom Computing Machines
, pp. 349-352
-
-
Sun, J.1
Peterson, G.2
Storaasli, O.3
-
18
-
-
13444302326
-
The free lunch is over: A fundamental turn toward concurrency in software
-
Sutter, Herb, "The free lunch is over: A fundamental turn toward concurrency in software." Dr. Dobb's Journal, 30, no. 3, 2005.
-
(2005)
Dr. Dobb's Journal
, vol.30
, Issue.3
-
-
Sutter, H.1
-
19
-
-
67650056991
-
LU, QR and Cholesky using vector capabilities of GPUs
-
Technical Report, LAPACK Working Note 202
-
Volkov, V. and J.W. Demmel, "LU, QR and Cholesky using vector capabilities of GPUs." Technical Report, 2008, LAPACK Working Note 202.
-
(2008)
-
-
Volkov, V.1
Demmel, J.W.2
-
20
-
-
24344485098
-
-
Vuduc, Richard, James Demmel, and Katherine Yelick, Oski: A library of automatically tuned sparse matrix kernels. Journal of Physics: Conference Series, 16, no. 1, 521+, 2005.
-
Vuduc, Richard, James Demmel, and Katherine Yelick, "Oski: A library of automatically tuned sparse matrix kernels." Journal of Physics: Conference Series, 16, no. 1, 521+, 2005.
-
-
-
-
21
-
-
56749158843
-
Optimization of sparse matrix-vector multiplication on emerging multicore platforms
-
The Proceedings of workshop on state-of-the-art in scientific and parallel computing Para06, Supercomputing
-
Williams, S., L. Oliker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel, "Optimization of sparse matrix-vector multiplication on emerging multicore platforms." The Proceedings of workshop on state-of-the-art in scientific and parallel computing (Para06), Springer's Lecture Notes in Computer Science 4699 (Supercomputing), 2007.
-
(2007)
Springer's Lecture Notes in Computer Science
, vol.4699
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
|