-
6
-
-
38049058008
-
The impact of multicore on math software
-
Lecture Notes in Computer Science Springer
-
A. Buttari, J. Dongarra, J. Kurzak, J. Langou, P. Luszczek, and S. Tomov The impact of multicore on math software Applied Parallel Computing. State of the Art in Scientific Computing, 8th International Workshop, PARA Lecture Notes in Computer Science vol. 4699 2006 Springer 1 10
-
(2006)
Applied Parallel Computing. State of the Art in Scientific Computing, 8th International Workshop, PARA
, vol.4699 VOL.
, pp. 1-10
-
-
Buttari, A.1
Dongarra, J.2
Kurzak, J.3
Langou, J.4
Luszczek, P.5
Tomov, S.6
-
7
-
-
67650056933
-
Supermatrix: A multithreaded runtime scheduling system for algorithms-by-blocks
-
ACM
-
E. Chan, F.G. Van Zee, P. Bientinesi, E.S. Quintana-Ortí, G. Quintana-Ortí, and R. van de Geijn Supermatrix: a multithreaded runtime scheduling system for algorithms-by-blocks PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming 2008 ACM 123 132
-
(2008)
PPoPP '08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 123-132
-
-
Chan, E.1
Van Zee, F.G.2
Bientinesi, P.3
Quintana-Ortí, E.S.4
Quintana-Ortí, G.5
Van De Geijn, R.6
-
8
-
-
77953997924
-
Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects
-
E. Agullo, J. Demmel, J. Dongarra, B. Hadri, J. Kurzak, J. Langou, H. Ltaief, P. Luszczek, S. Tomov, Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects, Journal of Physics: Conference Series 180.
-
Journal of Physics: Conference Series
, pp. 180
-
-
Agullo, E.1
Demmel, J.2
Dongarra, J.3
Hadri, B.4
Kurzak, J.5
Langou, J.6
Ltaief, H.7
Luszczek, P.8
Tomov, S.9
-
10
-
-
78651103346
-
StarPU: A unified platform for task scheduling on heterogeneous multicore architectures
-
C. Augonnet, S. Thibault, R. Namyst, and P.-A. Wacrenier StarPU: a unified platform for task scheduling on heterogeneous multicore architectures Concurrency and Computation: Practice and Experience 23 2 2011 187 198
-
(2011)
Concurrency and Computation: Practice and Experience
, vol.23
, Issue.2
, pp. 187-198
-
-
Augonnet, C.1
Thibault, S.2
Namyst, R.3
Wacrenier, P.-A.4
-
11
-
-
57949083229
-
A dependency-aware task-based programming environment for multi-core architectures
-
J. Perez, R. Badia, J. Labarta, A dependency-aware task-based programming environment for multi-core architectures, in: IEEE International Conference on Cluster Computing, 2008, pp. 142-151.
-
(2008)
IEEE International Conference on Cluster Computing
, pp. 142-151
-
-
Perez, J.1
Badia, R.2
Labarta, J.3
-
12
-
-
74049102092
-
Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems
-
ACM New York, NY, USA
-
F. Song, A. YarKhan, and J. Dongarra Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems SC '09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis 2009 ACM New York, NY, USA 1 11
-
(2009)
SC '09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
, pp. 1-11
-
-
Song, F.1
Yarkhan, A.2
Dongarra, J.3
-
13
-
-
84655160745
-
StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures
-
LNCS, Delft Pays-Bas
-
C. Augonnet, S. Thibault, R. Namyst, P.-A. Wacrenier, StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures, in: Euro-Par 2009 Euro-par'09 Proceedings, LNCS, Delft Pays-Bas, 2009.
-
(2009)
Euro-Par 2009 Euro-par'09 Proceedings
-
-
Augonnet, C.1
Thibault, S.2
Namyst, R.3
Wacrenier, P.-A.4
-
14
-
-
0035266229
-
Automatic parallelization techniques based on compact dag extraction and symbolic scheduling
-
M. Cosnard, and E. Jeannot Automatic parallelization techniques based on compact DAG extraction and symbolic scheduling Parallel Processing Letters 11 2001 151 168 (Pubitemid 32697656)
-
(2001)
Parallel Processing Letters
, vol.11
, Issue.1
, pp. 151-168
-
-
Cosnard, M.1
Jeannot, E.2
-
15
-
-
10844269765
-
Compact DAG representation and its symbolic scheduling
-
DOI 10.1016/j.jpdc.2004.05.001
-
M. Cosnard, E. Jeannot, and T. Yang Compact DAG representation and its symbolic scheduling Journal of Parallel and Distributed Computing 64 8 2004 921 935 (Pubitemid 40000764)
-
(2004)
Journal of Parallel and Distributed Computing
, vol.64
, Issue.8
, pp. 921-935
-
-
Cosnard, M.1
Jeannot, E.2
Yang, T.3
-
16
-
-
83455228041
-
Automatic multithreaded parallel program generation for message passing multiprocessors using parameterized task graphs
-
E. Jeannot, Automatic multithreaded parallel program generation for message passing multiprocessors using parameterized task graphs, in: International Conference 'Parallel Computing 2001' (ParCo2001), 2001.
-
(2001)
International Conference 'Parallel Computing 2001' (ParCo2001)
-
-
Jeannot, E.1
-
17
-
-
56749169455
-
Multi-threading and one-sided communication in parallel lu factorization
-
B. Verastegui, ACM Press
-
P. Husbands, and K.A. Yelick Multi-threading and one-sided communication in parallel lu factorization B. Verastegui, Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, SC 2007, November 10-16, 2007, Reno, Nevada, USA 2007 ACM Press
-
(2007)
Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, SC 2007, November 10-16, 2007, Reno, Nevada, USA
-
-
Husbands, P.1
Yelick, K.A.2
-
19
-
-
0026278958
-
The omega test: A fast and practical integer programming algorithm for dependence analysis
-
New York, NY, USA
-
W. Pugh, The omega test: a fast and practical integer programming algorithm for dependence analysis, in: Supercomputing '91: Proceedings of the 1991 ACM/IEEE Conference on Supercomputing, New York, NY, USA, 1991, pp. 4-13.
-
(1991)
Supercomputing '91: Proceedings of the 1991 ACM/IEEE Conference on Supercomputing
, pp. 4-13
-
-
Pugh, W.1
-
20
-
-
0033645154
-
The data locality of work stealing
-
U.A. Acar, G.E. Blelloch, R.D. Blumofe, The data locality of work stealing., in: SPAA'00, 2000, pp. 1-12.
-
(2000)
SPAA'00
, pp. 1-12
-
-
Acar, U.A.1
Blelloch, G.E.2
Blumofe, R.D.3
-
21
-
-
77952719747
-
Hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications
-
IEEE (Ed.) Pisa Italy
-
F. Broquedis, J. Clet Ortega, S. Moreaud, N. Furmento, B. Goglin, G. Mercier, S. Thibault, R. Namyst, hwloc: a Generic Framework for Managing Hardware Affinities in HPC Applications, in: IEEE (Ed.), PDP 2010 - The 18th Euromicro International Conference on Parallel, Distributed and Network-Based Computing, Pisa Italy, 2010.
-
(2010)
PDP 2010 - The 18th Euromicro International Conference on Parallel, Distributed and Network-Based Computing
-
-
Broquedis, F.1
Clet Ortega, J.2
Moreaud, S.3
Furmento, N.4
Goglin, B.5
Mercier, G.6
Thibault, S.7
Namyst, R.8
-
22
-
-
38049054439
-
Minimal data copy for dense linear algebra factorization
-
LNCS Ume, Sweden
-
F.G. Gustavson, J.A. Gunnels, and J.C. Sexton Minimal data copy for dense linear algebra factorization Applied Parallel Computing, State of the Art in Scientific Computing, 8th International Workshop, PARA 2006 vol. 4699 2006 LNCS Ume, Sweden 540 549
-
(2006)
Applied Parallel Computing, State of the Art in Scientific Computing, 8th International Workshop, PARA 2006
, vol.4699 VOL.
, pp. 540-549
-
-
Gustavson, F.G.1
Gunnels, J.A.2
Sexton, J.C.3
-
23
-
-
0003454919
-
-
Philadelphia, PA, USA
-
G.W. Stewart, Matrix algorithms, Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 2001.
-
(2001)
Matrix Algorithms, Society for Industrial and Applied Mathematics
-
-
Stewart, G.W.1
-
24
-
-
58149269099
-
A class of parallel tiled linear algebra algorithms for multicore architectures
-
A. Buttari, J. Langou, J. Kurzak, and J. Dongarra A class of parallel tiled linear algebra algorithms for multicore architectures Parallel Computation 35 1 2009 38 53
-
(2009)
Parallel Computation
, vol.35
, Issue.1
, pp. 38-53
-
-
Buttari, A.1
Langou, J.2
Kurzak, J.3
Dongarra, J.4
-
25
-
-
50249105132
-
Parallel tiled QR factorization for multicore architectures
-
A. Buttari, J. Langou, J. Kurzak, and J.J. Dongarra Parallel tiled QR factorization for multicore architectures Concurrency Computation: Practice and Experience 20 13 2008 1573 1590
-
(2008)
Concurrency Computation: Practice and Experience
, vol.20
, Issue.13
, pp. 1573-1590
-
-
Buttari, A.1
Langou, J.2
Kurzak, J.3
Dongarra, J.J.4
-
26
-
-
0003078924
-
A storage-efficient WY representation for products of householder transformations
-
R. Schreiber, and C. van Loan A storage-efficient WY representation for products of householder transformations J. Sci. Stat. Comput. *** 10 1991 53 57
-
(1991)
J. Sci. Stat. Comput.***
, vol.10
, pp. 53-57
-
-
Schreiber, R.1
Van Loan, C.2
-
28
-
-
33750205459
-
Grid'5000: A large scale and highly reconfigurable experimental Grid testbed
-
DOI 10.1177/1094342006070078
-
R. Bolze, F. Cappello, E. Caron, M.J. Daydé, F. Desprez, E. Jeannot, Y. Jégou, S. Lanteri, J. Leduc, N. Melab, G. Mornet, R. Namyst, P. Primet, B. Quétier, O. Richard, E.-G. Talbi, and I. Touche Grid'5000: a large scale and highly reconfigurable experimental grid testbed IJHPCA 20 4 2006 481 494 (Pubitemid 44605333)
-
(2006)
International Journal of High Performance Computing Applications
, vol.20
, Issue.4
, pp. 481-494
-
-
Bolze, R.1
Cappello, F.2
Caron, E.3
Dayde, M.4
Desprez, F.5
Jeannot, E.6
Jegou, Y.7
Lanteri, S.8
Leduc, J.9
Melab, N.10
Mornet, G.11
Namyst, R.12
Primet, P.13
Quetier, B.14
Richard, O.15
Talbi, E.-G.16
Touche, I.17
-
29
-
-
27144559253
-
ScaLAPACK: A linear algebra library for message-passing computers
-
SIAM
-
L.S. Blackford, J. Choi, A.J. Cleary, E.F. D'Azevedo, J. Demmel, I.S. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D.W. Walker, and R.C. Whaley ScaLAPACK: a linear algebra library for message-passing computers Proceedings of the Eighth SIAM Conference on Parallel Processing for Scientific Computing 1997 SIAM
-
(1997)
Proceedings of the Eighth SIAM Conference on Parallel Processing for Scientific Computing
-
-
Blackford, L.S.1
Choi, J.2
Cleary, A.J.3
D'Azevedo, E.F.4
Demmel, J.5
Dhillon, I.S.6
Dongarra, J.7
Hammarling, S.8
Henry, G.9
Petitet, A.10
Stanley, K.11
Walker, D.W.12
Whaley, R.C.13
-
30
-
-
0042674307
-
The LINPACK benchmark: Past, present and future
-
J.J. Dongarra, P. Luszczek, and A. Petitet The LINPACK benchmark: past, present and future Concurrency and Computation: Practice and Experience 15 9 2003 803 820
-
(2003)
Concurrency and Computation: Practice and Experience
, vol.15
, Issue.9
, pp. 803-820
-
-
Dongarra, J.J.1
Luszczek, P.2
Petitet, A.3
-
31
-
-
0040617241
-
ScaLAPACK: A Portable Linear Algebra Library for Distributed Memory Computers - Design Issues and Performance
-
Applied Parallel Computing: Computations in Physics, Chemistry and Engineering Science
-
J. Choi, J. Demmel, I.S. Dhillon, J. Dongarra, S. Ostrouchov, A. Petitet, K. Stanley, D.W. Walker, and R.C. Whaley ScaLAPACK: a portable linear algebra library for distributed memory computers - design issues and performance J. Dongarra, K. Madsen, J. Wasniewski, Applied Parallel Computing, Computations in Physics, Chemistry and Engineering Science, Second International Workshop, PARA '95, Lyngby, Denmark, August 21-24, 1995, Proceedings Lecture Notes in Computer Science vol. 1041 1995 Springer 95 106 (Pubitemid 126043350)
-
(1996)
Lecture Notes in Computer Science
, Issue.1041
, pp. 95-106
-
-
Choi, J.1
Demmel, J.2
Dhillon, I.3
Dongarra, J.4
Ostrouchov, S.5
Petitet, A.6
Stanley, K.7
Walker, D.8
Whaley, R.C.9
-
33
-
-
78649256719
-
The international exascale software project roadmap
-
J. Dongarra, P. Beckman, et al., The international exascale software project roadmap, Tech. rep., IESP, 2011, http://www.exascale.org/iesp.
-
(2011)
Tech. Rep., IESP
-
-
Dongarra, J.1
Beckman, P.2
|