-
1
-
-
74049126244
-
-
K. Asanovic, R. Bodik, B. C. Catanzaro, J. J. Gebis, P. Husbands, K. Keutzer, D. A. Patterson, W. L. Plishker, J. Shalf, S. W. Williams, and K. A. Yelick. The landscape of parallel computing research: A view from Berkeley. Technical Report UCB/EECS-2006-183, EECS Department, University of California, Berkeley, Dec 2006.
-
K. Asanovic, R. Bodik, B. C. Catanzaro, J. J. Gebis, P. Husbands, K. Keutzer, D. A. Patterson, W. L. Plishker, J. Shalf, S. W. Williams, and K. A. Yelick. The landscape of parallel computing research: A view from Berkeley. Technical Report UCB/EECS-2006-183, EECS Department, University of California, Berkeley, Dec 2006.
-
-
-
-
2
-
-
0003615167
-
-
Society for Industrial and Applied Mathematics, Philadelphia, PA, USA
-
L. S. Blackford, J. Choi, A. Cleary, E. D'Azeuedo, J. Demmel, I. Dhillon, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, and R. C. Whaley. ScaLAPACK user's guide. Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 1997.
-
(1997)
ScaLAPACK user's guide
-
-
Blackford, L.S.1
Choi, J.2
Cleary, A.3
D'Azeuedo, E.4
Demmel, J.5
Dhillon, I.6
Hammarling, S.7
Henry, G.8
Petitet, A.9
Stanley, K.10
Walker, D.11
Whaley, R.C.12
-
3
-
-
33847230814
-
ScaLAPACK: A portable linear algebra library for distributed memory computers - design issues and performance
-
Washington, DC, USA, IEEE Computer Society
-
L. S. Blackford, J. Choi, A. Cleary, A. Petitet, R. C. Whaley, J. Demmel, I. Dhillon, K. Stanley, J. Dongarra, S. Hammarling, G. Henry, and D. Walker. ScaLAPACK: A portable linear algebra library for distributed memory computers - design issues and performance. In Supercomputing '96: Proceedings of the 1996 ACM/IEEE conference on Supercomputing (CDROM), page 5, Washington, DC, USA, 1996. IEEE Computer Society.
-
(1996)
Supercomputing '96: Proceedings of the 1996 ACM/IEEE conference on Supercomputing (CDROM)
, pp. 5
-
-
Blackford, L.S.1
Choi, J.2
Cleary, A.3
Petitet, A.4
Whaley, R.C.5
Demmel, J.6
Dhillon, I.7
Stanley, K.8
Dongarra, J.9
Hammarling, S.10
Henry, G.11
Walker, D.12
-
4
-
-
0000269759
-
Scheduling multithreaded computations by work stealing
-
R. D. Blumofe and C. E. Leiserson. Scheduling multithreaded computations by work stealing. J. ACM, 46(5):720-748, 1999.
-
(1999)
J. ACM
, vol.46
, Issue.5
, pp. 720-748
-
-
Blumofe, R.D.1
Leiserson, C.E.2
-
5
-
-
58149269099
-
A class of parallel tiled linear algebra algorithms for multicore architectures
-
A. Buttari, J. Langou, J. Kurzak, and J. Dongarra. A class of parallel tiled linear algebra algorithms for multicore architectures. Parallel Comput., 35(1):38-53, 2009.
-
(2009)
Parallel Comput
, vol.35
, Issue.1
, pp. 38-53
-
-
Buttari, A.1
Langou, J.2
Kurzak, J.3
Dongarra, J.4
-
6
-
-
0030244536
-
-
J. Choi, J. J. Dongarra, L. S. Ostrouchov, A. P. Petitet, D. W. Walker, and R. C. Whaley. Design and implementation of the ScaLAPACK LU, QR, and Cholesky factorization routines. Sci. Program., 5(3):173-184, 1996.
-
J. Choi, J. J. Dongarra, L. S. Ostrouchov, A. P. Petitet, D. W. Walker, and R. C. Whaley. Design and implementation of the ScaLAPACK LU, QR, and Cholesky factorization routines. Sci. Program., 5(3):173-184, 1996.
-
-
-
-
8
-
-
0031622953
-
The implementation of the Cilk-5 multithreaded language
-
New York, NY, USA, ACM
-
M. Frigo, C. E. Leiserson, and K. H. Randall. The implementation of the Cilk-5 multithreaded language. In PLDI '98: Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation, pages 212-223, New York, NY, USA, 1998. ACM.
-
(1998)
PLDI '98: Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation
, pp. 212-223
-
-
Frigo, M.1
Leiserson, C.E.2
Randall, K.H.3
-
9
-
-
74049158447
-
The Roadrunner supercomputer: A petaflop's no problem
-
J. Gray. The Roadrunner supercomputer: A petaflop's no problem. Linux J., 2008(175):1, 2008.
-
(2008)
Linux J
, vol.2008
, Issue.175
, pp. 1
-
-
Gray, J.1
-
10
-
-
0000667923
-
The torus-wrap mapping for dense matrix calculations on massively parallel computers
-
B. A. Hendrickson and D. E. Womble. The torus-wrap mapping for dense matrix calculations on massively parallel computers. SIAM J. Sci. Comput., 15(5):1201-1226, 1994.
-
(1994)
SIAM J. Sci. Comput
, vol.15
, Issue.5
, pp. 1201-1226
-
-
Hendrickson, B.A.1
Womble, D.E.2
-
11
-
-
49349111725
-
Solving systems of linear equations on the CELL processor using Cholesky factorization
-
J. Kurzak, A. Buttari, and J. Dongarra. Solving systems of linear equations on the CELL processor using Cholesky factorization. IEEE Trans. Parallel Distrib. Syst., 19(9):1175-1186, 2008.
-
(2008)
IEEE Trans. Parallel Distrib. Syst
, vol.19
, Issue.9
, pp. 1175-1186
-
-
Kurzak, J.1
Buttari, A.2
Dongarra, J.3
-
12
-
-
37549032725
-
IBM Power6 microarchitecture
-
H. Q. Le, W. J. Starke, J. S. Fields, F. P. O'Connell, D. Q. Nguyen, B. J. Ronchetti, W. M. Sauer, E. M. Schwarz, and M. T. Vaden. IBM Power6 microarchitecture. IBM J. Res. Dev., 51(6):639-662, 2007.
-
(2007)
IBM J. Res. Dev
, vol.51
, Issue.6
, pp. 639-662
-
-
Le, H.Q.1
Starke, W.J.2
Fields, J.S.3
O'Connell, F.P.4
Nguyen, D.Q.5
Ronchetti, B.J.6
Sauer, W.M.7
Schwarz, E.M.8
Vaden, M.T.9
-
13
-
-
0042235298
-
Tiling, block data layout, and memory hierarchy performance
-
N. Park, B. Hong, and V. K. Prasanna. Tiling, block data layout, and memory hierarchy performance. IEEE Transactions on Parallel and Distributed Systems, 14(7):640-654, 2003.
-
(2003)
IEEE Transactions on Parallel and Distributed Systems
, vol.14
, Issue.7
, pp. 640-654
-
-
Park, N.1
Hong, B.2
Prasanna, V.K.3
-
14
-
-
57949083229
-
-
J. Perez, R. Badia, and J. Labarta. A dependency-aware task-based programming environment for multi-core architectures. Cluster Computing, 2008 IEEE International Conference on, pages 142-151, 29 2008-Oct. 1 2008.
-
J. Perez, R. Badia, and J. Labarta. A dependency-aware task-based programming environment for multi-core architectures. Cluster Computing, 2008 IEEE International Conference on, pages 142-151, 29 2008-Oct. 1 2008.
-
-
-
-
15
-
-
49249086142
-
Larrabee: A many-core x86 architecture for visual computing
-
L. Seiler, D. Carmean, E. Sprangle, T. Forsyth, and M. Abrash. Larrabee: A many-core x86 architecture for visual computing. ACM Trans. Graph., 27(3):1-15, 2008.
-
(2008)
ACM Trans. Graph
, vol.27
, Issue.3
, pp. 1-15
-
-
Seiler, L.1
Carmean, D.2
Sprangle, E.3
Forsyth, T.4
Abrash, M.5
-
16
-
-
74049123725
-
-
University of Tennessee. PLASMA. http://icl.cs.utk.edu/plasma, 2009.
-
University of Tennessee. PLASMA. http://icl.cs.utk.edu/plasma, 2009.
-
-
-
|