-
2
-
-
0003706460
-
-
SIAM: Philadelphia, PA, (Available from:)
-
Anderson E, Bai Z, Bischof C, Blackford LS, Demmel JW, Dongarra JJ, Du Croz J, Greenbaum A, Hammarling S, McKenney A, Sorensen D,. LAPACK Users' Guide, SIAM: Philadelphia, PA, 1992. (Available from:).
-
(1992)
LAPACK Users' Guide
-
-
Anderson, E.1
Bai, Z.2
Bischof, C.3
Blackford, L.S.4
Demmel, J.W.5
Dongarra, J.J.6
Du Croz, J.7
Greenbaum, A.8
Hammarling, S.9
McKenney, A.10
Sorensen, D.11
-
3
-
-
74049090446
-
Comparative study of one-sided factorizations with multiple software packages on multi-core hardware
-
ACM: New York, NY, USA
-
Agullo E, Hadri B, Ltaief H, Dongarrra J,. Comparative study of one-sided factorizations with multiple software packages on multi-core hardware. In Sc '09: Proceedings of the conference on high performance computing networking, storage and analysis. ACM: New York, NY, USA, 2009; 1-12.
-
(2009)
Sc '09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
, pp. 1-12
-
-
Agullo, E.1
Hadri, B.2
Ltaief, H.3
Dongarrra, J.4
-
4
-
-
0025402476
-
Set of Level 3 Basic Linear Algebra Subprograms
-
DOI 10.1145/77626.79170
-
Dongarra JJ, Croz JD, Duff IS, Hammarling S,. A set of level 3 basic linear algebra subprograms. ACM Transactions on Mathematical Software (TOMS) 1990; 16: 1-17. (Pubitemid 20684794)
-
(1990)
ACM Transactions on Mathematical Software
, vol.16
, Issue.1
, pp. 1-17
-
-
Dongarra, J.J.1
Croz, J.D.2
Hammarling, S.3
Duff, I.4
-
5
-
-
0018515759
-
Basic linear algebra subprograms for fortran usage
-
DOI 10.1145/355841.355847
-
Lawson CL, Hanson RJ, Kincaid D, Krogh FT,. Basic linear algebra subprograms for FORTRAN usage. ACM Transactions on Mathematical Software (TOMS) 1979; 5: 308-323. (Pubitemid 10415072)
-
(1979)
ACM Transactions on Mathematical Software
, vol.5
, Issue.3
, pp. 308-323
-
-
Lawson, C.L.1
Hanson, R.J.2
Kincaid, D.R.3
Krogh, F.T.4
-
6
-
-
50249105132
-
Parallel tiled QR factorization for multicore architectures
-
Buttari A, Langou J, Kurzak J, Dongarra JJ,. Parallel tiled QR factorization for multicore architectures. Concurrency and Computation: Practice & Experience 2008; 20 (13): 1573-1590.
-
(2008)
Concurrency and Computation: Practice & Experience
, vol.20
, Issue.13
, pp. 1573-1590
-
-
Buttari, A.1
Langou, J.2
Kurzak, J.3
Dongarra, J.J.4
-
10
-
-
60649117581
-
QR factorization for the CELL processor
-
Kurzak J, Dongarra JJ,. QR factorization for the CELL processor. Scientific Programming 2009; 17 (1-2): 31-42.
-
(2009)
Scientific Programming
, vol.17
, Issue.12
, pp. 31-42
-
-
Kurzak, J.1
Dongarra, J.J.2
-
12
-
-
70450228489
-
The libflame Library for Dense Matrix Computations
-
DOI: 10.1109/MCSE.2009.207
-
Van Zee FG, Chan E, van de Geijn RA, Quintana-Orti ES, Quintana-Orti G,. The libflame Library for Dense Matrix Computations. Computing in Science and Engineering 2009; 11 (6): 56-63. DOI: 10.1109/MCSE.2009.207.
-
(2009)
Computing in Science and Engineering
, vol.11
, Issue.6
, pp. 56-63
-
-
Van Zee, F.G.1
Chan, E.2
Van De Geijn, R.A.3
Quintana-Orti, E.S.4
Quintana-Orti, G.5
-
13
-
-
58149269099
-
A class of parallel tiled linear algebra algorithms for multicore architectures
-
Buttari A, Langou J, Kurzak J, Dongarra JJ,. A class of parallel tiled linear algebra algorithms for multicore architectures. Parallel Computing Systems Application 2009; 35: 38-53.
-
(2009)
Parallel Computing Systems Application
, vol.35
, pp. 38-53
-
-
Buttari, A.1
Langou, J.2
Kurzak, J.3
Dongarra, J.J.4
-
15
-
-
17644368925
-
Parallel out-of-core computation and updating of the QR factorization
-
DOI 10.1145/1055531.1055534
-
Gunter BC, van de Geijn RA,. Parallel out-of-core computation and updating the QR factorization. ACM Transactions on Mathematical Software 2005; 31 (1): 60-78. (Pubitemid 40557862)
-
(2005)
ACM Transactions on Mathematical Software
, vol.31
, Issue.1
, pp. 60-78
-
-
Gunter, B.C.1
Van De Geijn, R.A.2
-
16
-
-
35248843628
-
SuperMatrix out-of-order scheduling of matrix operations for SMP and multi-core architectures
-
DOI 10.1145/1248377.1248397, SPAA'07: Proceedings of the Nineteenth Annual Symposium on Parallelism in Algorithms and Architectures
-
Chan E, Quintana-Orti ES, Quintana-Orti G, van de Geijn R,. Supermatrix Out-of-Order Scheduling of Matrix Operations for SMP and Multi-Core Architectures. Nineteenth Annual ACM Symposium on Parallel Algorithms and Architectures SPAA'07, 2007; 116-125. (Pubitemid 47568560)
-
(2007)
Annual ACM Symposium on Parallelism in Algorithms and Architectures
, pp. 116-125
-
-
Chan, E.1
Quintana-Orti, E.S.2
Quintana-Orti, G.3
Van De Geijn, R.4
-
18
-
-
0003078924
-
A storage-efficient WY representation for products of Householder transformations
-
Schreiber R, van Loan C,. A storage-efficient WY representation for products of Householder transformations. Journal on Scientific and Statistical Computing 1991; 10: 53-57.
-
(1991)
Journal on Scientific and Statistical Computing
, vol.10
, pp. 53-57
-
-
Schreiber, R.1
Van Loan, C.2
-
19
-
-
0003615167
-
-
SIAM: Philadelphia, PA, (Available from:)
-
Blackford LS, Choi J, Cleary A, D'Azevedo E, Demmel J, Dhillon I, Dongarra JJ, Hammarling S, Henry G, Petitet A, Stanley K, Walker D, Whaley RC,. ScaLAPACK Users' Guide. SIAM: Philadelphia, PA, 1997. (Available from:).
-
(1997)
ScaLAPACK Users' Guide
-
-
Blackford, L.S.1
Choi, J.2
Cleary, A.3
D'Azevedo, E.4
Demmel, J.5
Dhillon, I.6
Dongarra, J.J.7
Hammarling, S.8
Henry, G.9
Petitet, A.10
Stanley, K.11
Walker, D.12
Whaley, R.C.13
-
20
-
-
84857683415
-
-
Innovative Computing Laboratory, University of Tennessee
-
YarKhan A, Kurzak J, Dongarra J,. QUARK users' guide: QUeueing And Runtime for Kernels, Technical Report, ICL-UT-11-02, Innovative Computing Laboratory, University of Tennessee, 2011.
-
(2011)
QUARK Users' Guide: QUeueing and Runtime for Kernels, Technical Report, ICL-UT-11-02
-
-
Yarkhan, A.1
Kurzak, J.2
Dongarra, J.3
-
21
-
-
78449238949
-
-
UT-CS-09-643, Innovative Computing Lab, University of Tennessee
-
Kurzak J, Dongarra J,. Fully dynamic scheduler for numerical scheduling on multicore processors. Technical Report LAWN (LAPACK Working Note) 220, UT-CS-09-643, Innovative Computing Lab, University of Tennessee, 2009.
-
(2009)
Fully Dynamic Scheduler for Numerical Scheduling on Multicore Processors. Technical Report LAWN (LAPACK Working Note) 220
-
-
Kurzak, J.1
Dongarra, J.2
-
23
-
-
57949083229
-
A dependency-aware task-based programming environment for multi-core architectures
-
Perez JM, Badia RM, Labarta J,. A dependency-aware task-based programming environment for multi-core architectures. Cluster'08, 2008; 142-151.
-
(2008)
Cluster'08
, pp. 142-151
-
-
Perez, J.M.1
Badia, R.M.2
Labarta, J.3
-
24
-
-
0029191296
-
Cilk: An efficient multithreaded runtime system
-
In: Santa Barbara, California
-
Blumofe RD, Joerg CF, Kuszmaul BC, Leiserson CE, Randall KH, Zhou Y,. Cilk: an efficient multithreaded runtime system. In Proceedings of the fifth ACM SIGPLAN symposium on principles and practice of parallel programming (PPoPP): Santa Barbara, California, 1995; 207-216.
-
(1995)
Proceedings of the Fifth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP)
, pp. 207-216
-
-
Blumofe, R.D.1
Joerg, C.F.2
Kuszmaul, B.C.3
Leiserson, C.E.4
Randall, K.H.5
Zhou, Y.6
-
25
-
-
77954744880
-
-
Supercomputing Technologies Group, Massachusetts Institute of Technology Laboratory for Computer Science
-
Cilk 5.4.6 reference manual, Supercomputing Technologies Group, Massachusetts Institute of Technology Laboratory for Computer Science, 2001.
-
(2001)
Cilk 5.4.6 Reference Manual
-
-
|