-
1
-
-
70449629588
-
Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks
-
ACM
-
A. Bulu, J.T. Fineman, M. Frigo, J.R. Gilbert, and C.E. Leiserson Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks SPAA '09: Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures, New York, NY, USA 2009 ACM 233 244
-
(2009)
SPAA '09: Proceedings of the Twenty-first Annual Symposium on Parallelism in Algorithms and Architectures, New York, NY, USA
, pp. 233-244
-
-
Bulu, A.1
Fineman, J.T.2
Frigo, M.3
Gilbert, J.R.4
Leiserson, C.E.5
-
2
-
-
0033360524
-
Hypergraph-partitioning-based decomposition for parallel sparse-matrix vector multiplication
-
DOI 10.1109/71.780863
-
U.V. atalyürek, and C. Aykanat Hypergraph-partitioning-based decomposition for parallel sparse-matrix vector multiplication IEEE Trans. Parallel Distrib. Syst. 10 1999 673 693 (Pubitemid 30500688)
-
(1999)
IEEE Transactions on Parallel and Distributed Systems
, vol.10
, Issue.7
, pp. 673-693
-
-
Catalyurek, U.V.1
Aykanat, C.2
-
3
-
-
35048838799
-
A fine-grain hypergraph model for 2D decomposition of sparse matrices
-
IEEE Press, Los Alamitos, CA
-
Ü.V. atalyürek, C. Aykanat, A fine-grain hypergraph model for 2D decomposition of sparse matrices, in: Proceedings 8th International Workshop on Solving Irregularly Structured Problems in Parallel, IEEE Press, Los Alamitos, CA, 2001, p. 118.
-
(2001)
Proceedings 8th International Workshop on Solving Irregularly Structured Problems in Parallel
, pp. 118
-
-
Atalyürek, V.1
-
5
-
-
0033350255
-
Cache-oblivious algorithms
-
IEEE Press Washington, DC
-
M. Frigo, C.E. Leiserson, H. Prokop, and S. Ramachandran Cache-oblivious algorithms Proceedings 40th Annual Symposium on Foundations of Computer Science 1999 IEEE Press Washington, DC 285
-
(1999)
Proceedings 40th Annual Symposium on Foundations of Computer Science
, pp. 285
-
-
Frigo, M.1
Leiserson, C.E.2
Prokop, H.3
Ramachandran, S.4
-
6
-
-
1542392269
-
On reducing TLB misses in matrix multiplication
-
University of Texas at Austin, Department of Computer Sciences, 2002. FLAME Working Note #9
-
K. Goto, R. van de Geijn, On reducing TLB misses in matrix multiplication, Tech. Rep. TR-2002-55, University of Texas at Austin, Department of Computer Sciences, 2002. FLAME Working Note #9.
-
Tech. Rep. TR-2002-55
-
-
Goto, K.1
Geijn De R.Van2
-
7
-
-
34250347767
-
A Hilbert-order multiplication scheme for unstructured sparse matrices
-
DOI 10.1080/17445760601122084, PII 779509037
-
G. Haase, M. Liebmann, and G. Plank A Hilbert-order multiplication scheme for unstructured sparse matrices Int. J. Parallel Emer. Distrib. Syst. 22 2007 213 220 (Pubitemid 46925815)
-
(2007)
International Journal of Parallel, Emergent and Distributed Systems
, vol.22
, Issue.4
, pp. 213-220
-
-
Haase, G.1
Liebmann, M.2
Plank, G.3
-
8
-
-
84949647432
-
Optimizing Sparse Matrix Computations for Register Reuse in SPARSITY
-
Computational Science - ICCS 2001
-
E.-J. Im, K. Yelick, Optimizing sparse matrix computations for register reuse in SPARSITY, in: Proceedings International Conference on Computational Science, Part I, Lecture Notes in Computer Science, vol. 2073, 2001, pp. 127-136. (Pubitemid 33285441)
-
(2001)
Lecture Notes in Computer Science
, Issue.2073
, pp. 127-136
-
-
Im, E.-J.1
Yelick, K.2
-
9
-
-
17444432688
-
-
Master's thesis, Utrecht University, Department of Mathematics, July
-
J. Koster, Parallel templates for numerical linear algebra, a high-performance computation library, Master's thesis, Utrecht University, Department of Mathematics, July 2002.
-
(2002)
Parallel Templates for Numerical Linear Algebra, A High-performance Computation Library
-
-
Koster, J.1
-
10
-
-
70449690102
-
Analyzing block locality in Morton-order and Morton-hybrid matrices
-
K.P. Lorton, and D.S. Wise Analyzing block locality in Morton-order and Morton-hybrid matrices SIGARCH Comput. Archit. News 35 2007 6 12
-
(2007)
SIGARCH Comput. Archit. News
, vol.35
, pp. 6-12
-
-
Lorton, K.P.1
Wise, D.S.2
-
11
-
-
0003460690
-
A computer oriented geodetic data base and a new technique in file sequencing
-
IBM, Ottawa, Canada, March
-
G. Morton, A computer oriented geodetic data base and a new technique in file sequencing, Tech. Rep., IBM, Ottawa, Canada, March 1966.
-
(1966)
Tech. Rep.
-
-
Morton, G.1
-
12
-
-
34547744862
-
When cache blocking of sparse matrix vector multiply works and why
-
DOI 10.1007/s00200-007-0038-9
-
R. Nishtala, R.W. Vuduc, J.W. Demmel, and K.A. Yelick When cache blocking of sparse matrix vector multiply works and why Appl. Algebr. Eng. Commun. Comput. 18 2007 297 311 (Pubitemid 47224626)
-
(2007)
Applicable Algebra in Engineering, Communications and Computing
, vol.18
, Issue.3
, pp. 297-311
-
-
Nishtala, R.1
Vuduc, R.W.2
Demmel, J.W.3
Yelick, K.A.4
-
13
-
-
0031269220
-
Improving the memory-system performance of sparse-matrix vector multiplication
-
S. Toledo Improving the memory-system performance of sparse-matrix vector multiplication IBM J. Res. Dev. 41 1997 711 725 (Pubitemid 127557044)
-
(1997)
IBM Journal of Research and Development
, vol.41
, Issue.6
, pp. 711-725
-
-
Toledo, S.1
-
14
-
-
0037173976
-
A framework for high-performance matrix multiplication based on hierarchical abstractions, algorithms and optimized low-level kernels
-
DOI 10.1002/cpe.630
-
V. Valsalam, and A. Skjellum A framework for high-performance matrix multiplication based on hierarchical abstractions, algorithms and optimized low-level kernels Concurrency Comput.: Practice Exp. 14 2002 805 839 (Pubitemid 34965359)
-
(2002)
Concurrency Computation Practice and Experience
, vol.14
, Issue.10
, pp. 805-839
-
-
Valsalam, V.1
Skjellum, A.2
-
15
-
-
40449112015
-
Memory hierarchy in cache-based systems
-
Sun Microsystems, Inc., Santa Clara, CA, Nov.
-
R. van der Pas, Memory hierarchy in cache-based systems, Tech. Rep. 817-0742-10, Sun Microsystems, Inc., Santa Clara, CA, Nov. 2002.
-
(2002)
Tech. Rep. 817-0742-10
-
-
Pas Der R.Van1
-
16
-
-
17444414573
-
A two-dimensional data distribution method for parallel sparse matrix-vector multiplication
-
DOI 10.1137/S0036144502409019
-
B. Vastenhouw, and R.H. Bisseling A two-dimensional data distribution method for parallel sparse matrix-vector multiplication SIAM Rev. 47 2005 67 95 (Pubitemid 40535972)
-
(2005)
SIAM Review
, vol.47
, Issue.1
, pp. 67-95
-
-
Vastenhouw, B.1
Bisseling, R.H.2
-
17
-
-
24344485098
-
OSKI: A library of automatically tuned sparse matrix kernels
-
DOI 10.1088/1742-6596/16/1/071
-
R. Vuduc, J.W. Demmel, and K.A. Yelick OSKI: A library of automatically tuned sparse matrix kernels J. Phys. Conf. Ser. 16 2005 521 530 (Pubitemid 41259393)
-
(2005)
Journal of Physics: Conference Series
, vol.16
, Issue.1
, pp. 521-530
-
-
Vuduc, R.1
Demmel, J.W.2
Yelick, K.A.3
-
18
-
-
0343462141
-
Automated empirical optimizations of software and the ATLAS project
-
DOI 10.1016/S0167-8191(00)00087-9
-
R.C. Whaley, A. Petitet, and J.J. Dongarra Automated empirical optimizations of software and the ATLAS project Parallel Comput. 27 2001 3 35 (Pubitemid 32264775)
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Clint Whaley, R.1
Petitet, A.2
Dongarra, J.J.3
-
19
-
-
84930675361
-
A cache-oblivious sparse matrix-vector multiplication scheme based on the Hilbert curve
-
Springer, in press
-
A.N. Yzelman, R.H. Bisseling, A cache-oblivious sparse matrix-vector multiplication scheme based on the Hilbert curve, in Progress in Industrial Mathematics at ECMI 2010, Springer, in press.
-
(2010)
Progress in Industrial Mathematics at ECMI
-
-
Yzelman, A.N.1
Bisseling, R.H.2
-
20
-
-
77954707501
-
Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods
-
A.N. Yzelman, and R.H. Bisseling Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods SIAM J. Scientif. Comput. 31 2009 3128 3154
-
(2009)
SIAM J. Scientif. Comput.
, vol.31
, pp. 3128-3154
-
-
Yzelman, A.N.1
Bisseling, R.H.2
|