-
1
-
-
0037834788
-
OpenMP issues arising in the development of parallel BLAS and LAPACK libraries
-
C. Addison, Y. Ren, and M. van Waveren. OpenMP issues arising in the development of parallel BLAS and LAPACK libraries. Scientific Programming, 11(2), 2003.
-
(2003)
Scientific Programming
, vol.11
, Issue.2
-
-
Addison, C.1
Ren, Y.2
Van Waveren, M.3
-
2
-
-
0024891893
-
Vector and parallel algorithms for Cholesky factorization on IBM 3090
-
New York, NY, USA
-
R. C. Agarwal and F. G. Gustavson. Vector and parallel algorithms for Cholesky factorization on IBM 3090. In SC '89: Proceedings of the 1989 ACM/IEEE Conference on Supercomputing, pages 225-233, New York, NY, USA, 1989.
-
(1989)
SC '89: Proceedings of the 1989 ACM/IEEE Conference on Supercomputing
, pp. 225-233
-
-
Agarwal, R.C.1
Gustavson, F.G.2
-
3
-
-
0003706460
-
-
Society for Industrial and Applied Mathematics, Philadelphia, PA, USA
-
E. Anderson, Z. Bai, C. Bischof, L. S. Blackford, J. Demmel, Jack J. Dongarra, J. Du Croz, S. Hammarling, A. Greenbaum, A. McKenney, and D. Sorensen. LAPACK Users' guide (third ed.). Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 1999.
-
(1999)
LAPACK Users' Guide (Third Ed.)
-
-
Anderson, E.1
Bai, Z.2
Bischof, C.3
Blackford, L.S.4
Demmel, J.5
Dongarra, J.J.6
Croz, J.D.7
Hammarling, S.8
Greenbaum, A.9
McKenney, A.10
Sorensen, D.11
-
4
-
-
34548265764
-
CeIlSs: A programming model for the Cell BE architecture
-
Tampa, FL, USA, November
-
Pieter Bellens, Josep M. Perez, Rosa M. Badia, and Jesus Labarta. CeIlSs: A programming model for the Cell BE architecture. In SC '06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, pages 5-15, Tampa, FL, USA, November 2006.
-
(2006)
SC '06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing
, pp. 5-15
-
-
Bellens, P.1
Perez, J.M.2
Badia, R.M.3
Labarta, J.4
-
6
-
-
17644412337
-
The science of deriving dense linear algebra algorithms
-
March
-
Paolo Bientinesi, John A. Gunnels, Margaret E. Myers, Enrique S. Quintana-Orti, and Robert A. van de Geijn. The science of deriving dense linear algebra algorithms. ACM Transactions on Mathematical Software, 31(1):1-26, March 2005.
-
(2005)
ACM Transactions on Mathematical Software
, vol.31
, Issue.1
, pp. 1-26
-
-
Bientinesi, P.1
Gunnels, J.A.2
Myers, M.E.3
Quintana-Orti, E.S.4
Van De Geijn, R.A.5
-
8
-
-
17644370328
-
Representing linear algebra algorithms in code: The FLAME application programming interfaces
-
March
-
Paolo Bientinesi, Enrique S. Quintana-Orti, and Robert A. van de Geijn. Representing linear algebra algorithms in code: The FLAME application programming interfaces. ACM Transaction? on Mathematical Software, 31(1):27-59, March 2005.
-
(2005)
ACM Transaction? on Mathematical Software
, vol.31
, Issue.1
, pp. 27-59
-
-
Bientinesi, P.1
Quintana-Orti, E.S.2
Van De Geijn, R.A.3
-
9
-
-
35248843628
-
SuperMatrix out-of-order scheduling of matrix operations for SMP and multi-core architectures
-
San Diego, CA, USA, June
-
Ernie Chan, Enrique S. Quintana-Orti, Gregorio Quintana-Orti, and Robert van de Geijn. SuperMatrix out-of-order scheduling of matrix operations for SMP and multi-core architectures. In SPAA '07: Proceedings of the Nineteenth Annual ACM Symposium on Parallelism in Algorithms and Architectures, pages 116-125, San Diego, CA, USA, June 2007.
-
(2007)
SPAA '07: Proceedings of the Nineteenth Annual ACM Symposium on Parallelism in Algorithms and Architectures
, pp. 116-125
-
-
Chan, E.1
Quintana-Orti, E.S.2
Quintana-Orti, G.3
Van De Geijn, R.4
-
10
-
-
51049099053
-
Satisfying your dependencies with SuperMatrix
-
Austin, TX, USA, September
-
Ernie Chan, Field G. Van Zee, Enrique S. Quintana-Ortí, Gregorio Quintana-Orti, and Robert van de Geijn. Satisfying your dependencies with SuperMatrix. In Proceedings of the 2007 IEEE International Conference on Cluster Computing, pages 91-99, Austin, TX, USA, September 2007.
-
(2007)
Proceedings of the 2007 IEEE International Conference on Cluster Computing
, pp. 91-99
-
-
Chan, E.1
Van Zee, F.G.2
Quintana-Ortí, E.S.3
Quintana-Orti, G.4
Van De Geijn, R.5
-
11
-
-
0036870763
-
Recursive array layouts and fast matrix multiplication
-
S. Chatterjee, A. R. Lebeck, P. K. Patnala, and M. Thottethodi. Recursive array layouts and fast matrix multiplication. IEEE Transaction? on Parallel and Distributed Systems, 13(11): 1105-1123, 2002.
-
(2002)
IEEE Transaction? on Parallel and Distributed Systems
, vol.13
, Issue.11
, pp. 1105-1123
-
-
Chatterjee, S.1
Lebeck, A.R.2
Patnala, P.K.3
Thottethodi, M.4
-
12
-
-
0002924772
-
ScaLAPACK: A scalable linear algebra library for distributed memory concurrent computers
-
IEEE Computer Society Press
-
J. Choi, J. J. Dongarra, R. Pozo, and D. W. Walker. ScaLAPACK: A scalable linear algebra library for distributed memory concurrent computers. In Proceedings of the Fourth Symposium on the Frontiers of Massively Parallel Computation, pages 120-127. IEEE Computer Society Press, 1992.
-
(1992)
Proceedings of the Fourth Symposium on the Frontiers of Massively Parallel Computation
, pp. 120-127
-
-
Choi, J.1
Dongarra, J.J.2
Pozo, R.3
Walker, D.W.4
-
15
-
-
0025402476
-
A set of level 3 Basic Linear Algebra Subprograms
-
March
-
Jack J. Dongarra, Jeremy Du Croz, Sven Hammarling, and Iain Duff. A set of level 3 Basic Linear Algebra Subprograms. ACM Transactions on Mathematical Software, 16(1):1-17, March 1990.
-
(1990)
ACM Transactions on Mathematical Software
, vol.16
, Issue.1
, pp. 1-17
-
-
Dongarra, J.J.1
Croz, J.D.2
Hammarling, S.3
Duff, I.4
-
16
-
-
1842832833
-
Recursive blocked algorithms and hybrid data structures for dense matrix library software
-
Erik Elmroth, Fred Gustavson, Isak Jonsson, and Bo Kagstrom. Recursive blocked algorithms and hybrid data structures for dense matrix library software. SIAMReview, 46(1):3-45, 2004.
-
(2004)
SIAMReview
, vol.46
, Issue.1
, pp. 3-45
-
-
Elmroth, E.1
Gustavson, F.2
Jonsson, I.3
Kagstrom, B.4
-
17
-
-
79959386800
-
-
Kazushige Goto. http ://www.tace.utexas.edu/resources/software.
-
-
-
-
19
-
-
0039435412
-
FLAME: Formal linear algebra methods environment
-
December
-
John A. Gunnels, Fred G. Gustavson, Greg M. Henry, and Robert A. van de Geijn. FLAME: Formal linear algebra methods environment. ACM Transactions on Mathematical Software, 27(4):422-455, December 2001.
-
(2001)
ACM Transactions on Mathematical Software
, vol.27
, Issue.4
, pp. 422-455
-
-
Gunnels, J.A.1
Gustavson, F.G.2
Henry, G.M.3
Van De Geijn, R.A.4
-
21
-
-
35248867212
-
BLAS based on block data structures
-
Cornell University, February
-
Greg Henry. BLAS based on block data structures. Theory Center Technical Report CTC92TR89, Cornell University, February 1992.
-
(1992)
Theory Center Technical Report CTC92TR89
-
-
Henry, G.1
-
23
-
-
25844503119
-
Introduction to the Cell multiprocessor
-
September
-
James Kahle, Michael Day, Peter Hofstee, Charles Johns, Theodore Maeurer, and David Shippy. Introduction to the Cell multiprocessor. IBM Journal of Research and Development, 49(4/5):589-604, September 2005.
-
(2005)
IBM Journal of Research and Development
, vol.49
, Issue.4-5
, pp. 589-604
-
-
Kahle, J.1
Day, M.2
Hofstee, P.3
Johns, C.4
Maeurer, T.5
Shippy, D.6
-
24
-
-
50249166476
-
Solving systems of linear equations on the Cell processor using Cholesky factorization
-
Innovative Computing Laboratory, University of Tennesse, April
-
Jakub Kurzak, Alfredo Buttari, and Jack Dongarra. Solving systems of linear equations on the Cell processor using Cholesky factorization. Technical Report UT-CS-07-596, Innovative Computing Laboratory, University of Tennesse, April 2007.
-
(2007)
Technical Report UT-CS-07-596
-
-
Kurzak, J.1
Buttari, A.2
Dongarra, J.3
-
25
-
-
35248868578
-
Implementing linear algebra routines on multi-core processors with pipelining and a look ahead
-
University of Tennessee, September
-
Jakub Kurzak and Jack Dongarra. Implementing linear algebra routines on multi-core processors with pipelining and a look ahead. LAPACK Working Note 178 Technical Report UT-CS-06-581, University of Tennessee, September 2006.
-
(2006)
LAPACK Working Note 178 Technical Report UT-CS-06-581
-
-
Kurzak, J.1
Dongarra, J.2
-
26
-
-
0012525494
-
Programming parallel applications in CiIk
-
Charles Leiserson and Aske Plaat. Programming parallel applications in CiIk. SINEWS: SIAM News, 31, 1998.
-
(1998)
SINEWS: SIAM News
, vol.31
-
-
Leiserson, C.1
Plaat, A.2
-
27
-
-
47349106165
-
An API for manipulating matrices stored by blocks
-
Department of Computer Sciences, The University of Texas at Austin, May
-
Tze Meng Low and Robert van de Geijn. An API for manipulating matrices stored by blocks. FLAME Working Note #12 TR-2004-15, Department of Computer Sciences, The University of Texas at Austin, May 2004.
-
(2004)
FLAME Working Note #12 TR-2004-15
-
-
Low, T.M.1
Van De Geijn, R.2
-
28
-
-
38049132009
-
Toward scalable matrix multiply on multithreaded architectures
-
Rennes, France, August
-
Bryan Marker, Field G. Van Zee, Kazushige Goto, Gregorio Quintana-Ortí, and Robert A. van de Geijn. Toward scalable matrix multiply on multithreaded architectures. In Euro-Par '07: Proceedings of the Thirteenth International European Conference on Parallel and Distributed Computing, pages 748-757, Rennes, France, August 2007.
-
(2007)
Euro-Par '07: Proceedings of the Thirteenth International European Conference on Parallel and Distributed Computing
, pp. 748-757
-
-
Marker, B.1
Van Zee, F.G.2
Goto, K.3
Quintana-Ortí, G.4
Van De Geijn, R.A.5
-
29
-
-
0042235298
-
Tiling, block data layout, and memory hierarchy performance
-
N. Park, B. Hong, and V. K. Prasanna. Tiling, block data layout, and memory hierarchy performance. IEEE Transaction? on Parallel and Distributed Systems, 14(7):640-654, 2003.
-
(2003)
IEEE Transaction? on Parallel and Distributed Systems
, vol.14
, Issue.7
, pp. 640-654
-
-
Park, N.1
Hong, B.2
Prasanna, V.K.3
-
30
-
-
47349122478
-
Scheduling of QR factorization algorithms on SMP and multi-core architectures
-
Toulouse, France, February To appear
-
Gregorio Quintana-Ortí, Enrique S. Quintana-Ortí, Ernie Chan, Robert A. van de Geijn, and Field G. Van Zee. Scheduling of QR factorization algorithms on SMP and multi-core architectures. In PDP '08: Proceedings of the Sixteenth Euromicro International Conference on Parallel, Distributed and network-based Processing, Toulouse, France, February 2008. To appear.
-
(2008)
PDP '08: Proceedings of the Sixteenth Euromicro International Conference on Parallel, Distributed and Network-based Processing
-
-
Quintana-Ortí, G.1
Quintana-Ortí, E.S.2
Chan, E.3
Van De Geijn, R.A.4
Van Zee, F.G.5
-
31
-
-
0035003299
-
A comparison of lookahead and algorithmic blocking techniques for parallel matrix factorization
-
June
-
Peter Strazdins. A comparison of lookahead and algorithmic blocking techniques for parallel matrix factorization. International Journal of Parallel and Distributed Systems and Networks, 4(1):26-35, June 2001.
-
(2001)
International Journal of Parallel and Distributed Systems and Networks
, vol.4
, Issue.1
, pp. 26-35
-
-
Strazdins, P.1
-
32
-
-
0003081830
-
An efficient algorithm for exploiting multiple arithmetic units
-
R. Tomasulo. An efficient algorithm for exploiting multiple arithmetic units. IBM Journal of Research and Development, 11(1), 1967.
-
(1967)
IBM Journal of Research and Development
, vol.11
, Issue.1
-
-
Tomasulo, R.1
-
33
-
-
0037173976
-
A framework for highperformance matrix multiplication based on hierarchical abstractions, algorithms and optimized low-level kernels
-
Vinod Valsalam and Anthony Skjellum. A framework for highperformance matrix multiplication based on hierarchical abstractions, algorithms and optimized low-level kernels. Concurrency and Computation: Practice and Experience, 14(10):805-840, 2002.
-
(2002)
Concurrency and Computation: Practice and Experience
, vol.14
, Issue.10
, pp. 805-840
-
-
Valsalam, V.1
Skjellum, A.2
-
35
-
-
41149134895
-
Scalable parallelization of FLAME code via the workqueuing model
-
Field G. Van Zee, Paolo Bientinesi, Tze Meng Low, and Robert A. van de Geijn. Scalable parallelization of FLAME code via the workqueuing model. ACM Transactions on Mathematical Software, 34(2), 2008.
-
(2008)
ACM Transactions on Mathematical Software
, vol.34
, Issue.2
-
-
Van Zee, F.G.1
Bientinesi, P.2
Low, T.M.3
Van De Geijn, R.A.4
|