-
1
-
-
84857670376
-
Towards an efficient tile matrix inversion of symmetric positive definite matrices on multicore architectures
-
E. Agullo, H. Bouwmeester, J. Dongarra, J. Kurzak, J. Langou, and L. Rosenberg. Towards an efficient tile matrix inversion of symmetric positive definite matrices on multicore architectures. In VECPAR'10. 9th International Meeting High Performance Computing for Computational Science, Berkeley, CA (USA), June 22-25 2010.
-
VECPAR'10. 9th International Meeting High Performance Computing for Computational Science, Berkeley, CA (USA), June 22-25 2010
-
-
Agullo, E.1
Bouwmeester, H.2
Dongarra, J.3
Kurzak, J.4
Langou, J.5
Rosenberg, L.6
-
2
-
-
0003706460
-
-
Society for Industrial and Applied Mathematics, Philadelphia, Third edition
-
E. Anderson, Z. Bai, C. Bischof, S. L. Blackford, J. W. Demmel, J. J. Dongarra, J. D. Croz, A. Greenbaum, S. Hammarling, A. McKenney, and D. C. Sorensen. LAPACK User's Guide. Society for Industrial and Applied Mathematics, Philadelphia, Third edition, 1999.
-
(1999)
LAPACK User's Guide
-
-
Anderson, E.1
Bai, Z.2
Bischof, C.3
Blackford, S.L.4
Demmel, J.W.5
Dongarra, J.J.6
Croz, J.D.7
Greenbaum, A.8
Hammarling, S.9
McKenney, A.10
Sorensen, D.C.11
-
4
-
-
0003615167
-
-
Society for Industrial and Applied Mathematics, Philadelphia
-
L. S. Blackford, J. Choi, A. Cleary, E. F. D'Azevedo, J. W. Demmel, I. S. Dhillon, J. J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. W. Walker, and R. C. Whaley. ScaLAPACK Users' Guide. Society for Industrial and Applied Mathematics, Philadelphia, 1997.
-
(1997)
ScaLAPACK Users' Guide
-
-
Blackford, L.S.1
Choi, J.2
Cleary, A.3
D'Azevedo, E.F.4
Demmel, J.W.5
Dhillon, I.S.6
Dongarra, J.J.7
Hammarling, S.8
Henry, G.9
Petitet, A.10
Stanley, K.11
Walker, D.W.12
Whaley, R.C.13
-
5
-
-
83455220868
-
Flexible development of dense linear algebra algorithms on massively parallel architectures with DPLASMA
-
page to appear, Anchorage, AK, May
-
G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, A. Haidar, T. Herault, J. b Kurzak, J. Langou, P. Lemarinier, H. Ltaief, P. Luszczek, A. YarKhan, and J. Dongarra. Flexible development of dense linear algebra algorithms on massively parallel architectures with DPLASMA. In 12th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC-11), page to appear, Anchorage, AK, May 2011.
-
(2011)
12th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC-11)
-
-
Bosilca, G.1
Bouteiller, A.2
Danalis, A.3
Faverge, M.4
Haidar, A.5
Herault, T.6
B Kurzak, J.7
Langou, J.8
Lemarinier, P.9
Ltaief, H.10
Luszczek, P.11
YarKhan, A.12
Dongarra, J.13
-
6
-
-
84857684601
-
Distributed-memory task execution and dependence tracking within DAGuE and the DPLASMA project
-
Technical Report 232, Sept.
-
G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, H. Haidar, T. Herault, J. Kurzak, J. Langou, P. Le marinier, H. Ltaief, P. Luszczek, A. YarKhan, and J. Dongarra. Distributed-memory task execution and dependence tracking within DAGuE and the DPLASMA project. Technical Report 232, LAPACK Working Note, Sept. 2010.
-
(2010)
LAPACK Working Note
-
-
Bosilca, G.1
Bouteiller, A.2
Danalis, A.3
Faverge, M.4
Haidar, H.5
Herault, T.6
Kurzak, J.7
Langou, J.8
Le Marinier, P.9
Ltaief, H.10
Luszczek, P.11
YarKhan, A.12
Dongarra, J.13
-
7
-
-
80053226363
-
DAGuE: A generic distributed DAG engine for high performance computing
-
Technical Report 231, Sept.
-
G. Bosilca, A. Bouteiller, A. Danalis, T. Herault, P. Lemarinier, and J. Dongarra. DAGuE: A generic distributed DAG engine for high performance computing. Technical Report 231, LAPACK Working Note, Sept. 2010.
-
(2010)
LAPACK Working Note
-
-
Bosilca, G.1
Bouteiller, A.2
Danalis, A.3
Herault, T.4
Lemarinier, P.5
Dongarra, J.6
-
8
-
-
83455173186
-
DAGuE: A generic distributed DAG engine for high performance computing
-
to appear
-
G. Bosilca, A. Bouteiller, A. Danalis, T. Herault, P. Lemarinier, and J. Dongarra. DAGuE: A generic distributed DAG engine for high performance computing. In Proceedings of the 16th International Workshop on High-Level Parallel Programming Models and Supportive Environments (HIPS'11), Anchorage, AL, USA, May, 20 2011. to appear.
-
Proceedings of the 16th International Workshop on High-Level Parallel Programming Models and Supportive Environments (HIPS'11), Anchorage, AL, USA, May, 20 2011
-
-
Bosilca, G.1
Bouteiller, A.2
Danalis, A.3
Herault, T.4
Lemarinier, P.5
Dongarra, J.6
-
9
-
-
83455173186
-
DAGuE: A generic distributed dag engine for high performance computing
-
page to appear, Anchorage, AK, May
-
G. Bosilca, A. Bouteiller, A. Danalis, T. Herault, P. Lemarinier, and J. Dongarra. DAGuE: A generic distributed dag engine for high performance computing. In 16th International Workshop on High-Level Parallel Programming Models and Supportive Environments (HIPS-11), page to appear, Anchorage, AK, May 2011.
-
(2011)
16th International Workshop on High-Level Parallel Programming Models and Supportive Environments (HIPS-11)
-
-
Bosilca, G.1
Bouteiller, A.2
Danalis, A.3
Herault, T.4
Lemarinier, P.5
Dongarra, J.6
-
10
-
-
0031633853
-
Multistage linear DS-CDMA receivers
-
Sept.
-
C. Boulanger and L. Ouvry. Multistage linear DS-CDMA receivers. In Proc. IEEE ISSSTA 98, volume 2, pages 663-667, Sept. 1998.
-
(1998)
Proc. IEEE ISSSTA 98
, vol.2
, pp. 663-667
-
-
Boulanger, C.1
Ouvry, L.2
-
11
-
-
84857684600
-
A critical path approach to analyzing parallelism of algorithmic variants. Application to Cholesky inversion
-
Technical Report arXiv:1010.2000v1 [cs.DC], arXiv online archive, Oct 11 Submitted to Available at arxiv.org/pdf/1010.2000
-
H. Bouwmeester and J. Langou. A critical path approach to analyzing parallelism of algorithmic variants. Application to Cholesky inversion. Technical Report arXiv:1010.2000v1 [cs.DC], arXiv online archive, Oct 11 2010. Submitted to Parallel Computing. Available at arxiv.org/pdf/1010.2000.
-
(2010)
Parallel Computing
-
-
Bouwmeester, H.1
Langou, J.2
-
12
-
-
38249038136
-
Solving the algebraic Riccati equation with the matrix sign function
-
R. Byers. Solving the algebraic Riccati equation with the matrix sign function. Linear Algebra and Appl., 85:267-279, 1987.
-
(1987)
Linear Algebra and Appl.
, vol.85
, pp. 267-279
-
-
Byers, R.1
-
14
-
-
0030564728
-
ScaLAPACK: A portable linear algebra library for distributed memory computers - Design issues and performance
-
PII S0010465596000173
-
J. Choi, J. Demmel, I. Dhillon, J. Dongarra, Ostrouchov, S., A. Petitet, K. Stanley, D. Walker, and R. C. Whaley. ScaLAPACK, a portable linear algebra library for distributed memory computers-design issues and performance. Computer Physics Communications, 97(1-2):1-15, 1996. (Pubitemid 126387751)
-
(1996)
Computer Physics Communications
, vol.97
, Issue.1-2
, pp. 1-15
-
-
Choi, J.1
Demmel, J.2
Dhillon, I.3
Dongarra, J.4
Ostrouchov, S.5
Petitet, A.6
Stanley, K.7
Walker, D.8
Whaley, R.C.9
-
15
-
-
0012087153
-
Stability methods for matrix inversion
-
January
-
J. J. D. Croz and N. J. Higham. Stability methods for matrix inversion. IMA J. Numer. Anal., 12, January 1992.
-
(1992)
IMA J. Numer. Anal.
, pp. 12
-
-
Croz, J.J.D.1
Higham, N.J.2
-
16
-
-
0025401417
-
Set of Level 3 Basic Linear Algebra Subprograms. Model implementation and test programs
-
DOI 10.1145/77626.77627
-
J. Dongarra, J. Du Croz, I. Duff, and S. Hammarling. Algorithm 679: A set of Level 3 Basic Linear Algebra Subprograms. ACM Trans. Math. Soft., 16(1):18-28, March 1990. (Pubitemid 20684795)
-
(1990)
ACM Transactions on Mathematical Software
, vol.16
, Issue.1
, pp. 18-28
-
-
Dongarra, J.J.1
Du, C.J.2
Hammarling, S.3
Duff, I.4
-
18
-
-
0025402476
-
Set of Level 3 Basic Linear Algebra Subprograms
-
DOI 10.1145/77626.79170
-
J. J. Dongarra, I. S. Duff, J. D. Croz, and S. Hammarling. A set of level 3 basic linear algebra subprograms. ACM TOMS, 16(1):1-17, March 1990. (Pubitemid 20684794)
-
(1990)
ACM Transactions on Mathematical Software
, vol.16
, Issue.1
, pp. 1-17
-
-
Dongarra, J.J.1
Croz, J.D.2
Hammarling, S.3
Duff, I.4
-
19
-
-
70350509502
-
Measurement matrix partitioning theorem
-
Dec.
-
S. L. Fagin. Measurement matrix partitioning theorem. IEEE Trans. Autom. Control, AC-14(6):773-774, Dec. 1969.
-
(1969)
IEEE Trans. Autom. Control
, vol.AC-14
, Issue.6
, pp. 773-774
-
-
Fagin, S.L.1
-
20
-
-
0003495930
-
-
Prentice-Hall, Englewood Cliffs, New Jersey
-
G. E. Forsythe, M. A. Malcolm, and C. B. Moler. Computer Methods for Mathematical Computations. Prentice-Hall, Englewood Cliffs, New Jersey, 1977.
-
(1977)
Computer Methods for Mathematical Computations
-
-
Forsythe, G.E.1
Malcolm, M.A.2
Moler, C.B.3
-
22
-
-
77950629423
-
Powerpack: Energy profiling and analysis of high-performance systems and applications
-
May
-
R. Ge, X. Feng, S. Song, H.-C. Chang, D. Li, and K. W. Cameron. Powerpack: Energy profiling and analysis of high-performance systems and applications. IEEE Transactions on Parallel and Distributed Systems, PDS-21(5):658-671, May 2010.
-
(2010)
IEEE Transactions on Parallel and Distributed Systems
, vol.PDS-21
, Issue.5
, pp. 658-671
-
-
Ge, R.1
Feng, X.2
Song, S.3
Chang, H.-C.4
Li, D.5
Cameron, K.W.6
-
23
-
-
84857677215
-
Hardware Locality: Peering under the hood of your server
-
July
-
B. Goglin, J. Squyres, and S. Thibault. Hardware Locality: Peering under the hood of your server. Linux Pro Magazine, 128:28-33, July 2011.
-
(2011)
Linux Pro Magazine
, vol.128
, pp. 28-33
-
-
Goglin, B.1
Squyres, J.2
Thibault, S.3
-
24
-
-
0004236492
-
-
John Hopkins Studies in the Mathematical Sciences. Johns Hopkins University Press, Baltimore, Maryland, third edition
-
G. H. Golub and C. F. Van Loan. Matrix Computation. John Hopkins Studies in the Mathematical Sciences. Johns Hopkins University Press, Baltimore, Maryland, third edition, 1996.
-
(1996)
Matrix Computation
-
-
Golub, G.H.1
Van Loan, C.F.2
-
25
-
-
84857684603
-
Analysis of Dynamically Scheduled Tile Algorithms for Dense Linear Algebra on Multicore Architectures
-
ICL Technical Report UT-CS-11-666, LAPACK working note #243, Submitted to
-
A. Haidar, H. Ltaief, A. YarKhan, and J. J. Dongarra. Analysis of Dynamically Scheduled Tile Algorithms for Dense Linear Algebra on Multicore Architectures. ICL Technical Report UT-CS-11-666, LAPACK working note #243, Submitted to Concurrency and Computations, 2010.
-
(2010)
Concurrency and Computations
-
-
Haidar, A.1
Ltaief, H.2
YarKhan, A.3
Dongarra, J.J.4
-
26
-
-
0001692403
-
Computing the polar decomposition - With applications
-
N. J. Higham. Computing the polar decomposition - with applications. SIAM J. Sci. Stat. Comput., 7:1160-1174, 1986.
-
(1986)
SIAM J. Sci. Stat. Comput.
, vol.7
, pp. 1160-1174
-
-
Higham, N.J.1
-
27
-
-
84947688547
-
Applications of Jordan's procedure for matrix inversion in multiple regression and multivariate distance analysis
-
G. H. Jowett. Applications of Jordan's procedure for matrix inversion in multiple regression and multivariate distance analysis. Journal of the Royal Statistical Society. Series B (Methodological), 25(2):352-357, 1963.
-
(1963)
Journal of the Royal Statistical Society. Series B (Methodological)
, vol.25
, Issue.2
, pp. 352-357
-
-
Jowett, G.H.1
-
28
-
-
0032490773
-
Simplified polynomial-expansion linear detectors for DS-CDMA systems
-
Z. Lei and T. Lim. Simplified polynomial-expansion linear detectors for DS-CDMA receivers. IEEE Elec. Lett., 34(16):1561-1563, 1998. (Pubitemid 128610604)
-
(1998)
Electronics Letters
, vol.34
, Issue.16
, pp. 1561-1563
-
-
Lei, Z.D.1
Lim, T.J.2
-
29
-
-
84857670375
-
Profiling high performance dense linear algebra algorithms on multicore architectures for power and energy efficiency
-
H. Ltaief, P. Luszczek, and J. Dongarra. Profiling high performance dense linear algebra algorithms on multicore architectures for power and energy efficiency. In EnA-HPC 2011: International Conference on Energy-Aware High Performance Computing, Hamburg, Germany, September 07-09 2011.
-
EnA-HPC 2011: International Conference on Energy-Aware High Performance Computing, Hamburg, Germany, September 07-09 2011
-
-
Ltaief, H.1
Luszczek, P.2
Dongarra, J.3
-
31
-
-
84857662501
-
-
Math Kernel Library (MKL).
-
Intel, Math Kernel Library (MKL). http://www.intel.com/software/products/ mkl/.
-
-
-
-
32
-
-
0029733667
-
Multistage linear receivers for DS-CDMA systems
-
S. Moshavi, E. Kanterakis, and D. Schilling. Multistage linear receivers for DS-CDMA systems. Int. J. Wireless Inf. Networks, 3(1):1-17, Jan. 1996. (Pubitemid 126711392)
-
(1996)
International Journal of Wireless Information Networks
, vol.3
, Issue.1
, pp. 1-17
-
-
Moshavi, S.1
Kanterakis, E.G.2
Schilling, D.L.3
-
33
-
-
39549085762
-
Suboptimum search algorithm in conjunction with polynomial expanded multiuser detection for uplink
-
R. T. M. Mozaffaripour. Suboptimum search algorithm in conjunction with polynomial expanded multiuser detection for uplink. Wireless Personnal Comm., 24(1):1-9, 2003.
-
(2003)
Wireless Personnal Comm.
, vol.24
, Issue.1
, pp. 1-9
-
-
Mozaffaripour, R.T.M.1
-
35
-
-
0022026625
-
Analysis of pairwise pivoting in Gaussian elimination
-
DOI: 10.1109/TC.1985.1676570
-
D. C. Sorensen. Analysis of pairwise pivoting in Gaussian elimination. IEEE Transactions on Computers, C-34(3):274-278, 1985. http://dx.doi.org/10. 1109/ TC.1985.1676570DOI: 10.1109/TC.1985.1676570.
-
(1985)
IEEE Transactions on Computers
, vol.C-34
, Issue.3
, pp. 274-278
-
-
Sorensen, D.C.1
-
36
-
-
0342444575
-
Matrix-inversion method: Applications to Möbius inversion deconvolution
-
December
-
Q. Xie and N. xian Chen. Matrix-inversion method: Applications to Möbius inversion deconvolution. Physical Review E, 52(6):6055-6065, December 1995.
-
(1995)
Physical Review E
, vol.52
, Issue.6
, pp. 6055-6065
-
-
Xie, Q.1
Xian Chen, N.2
|