-
1
-
-
0030491606
-
An approximate minimum degree ordering algorithm
-
Oct.
-
P. R. Amestoy, T. A. Davis, and I. S. Du-. An approximate minimum degree ordering algorithm. SIAM J. Matrix Anal. Appl., 17(4):886-905, Oct. 1996.
-
(1996)
SIAM J. Matrix Anal. Appl.
, vol.17
, Issue.4
, pp. 886-905
-
-
Amestoy, P.R.1
Davis, T.A.2
Du, I.S.3
-
3
-
-
78650279432
-
Pattern-based sparse matrix representation for memory-efficient SMVM kernels
-
New York, NY, USA
-
M. Belgin, G. Back, and C. J. Ribbens. Pattern-based sparse matrix representation for memory-efficient SMVM kernels. In Proceedings of the 23rd international conference on Supercomputing, ICS'09, pages 100-109, New York, NY, USA, 2009.
-
(2009)
Proceedings of the 23rd International Conference on Supercomputing, ICS'09
, pp. 100-109
-
-
Belgin, M.1
Back, G.2
Ribbens, C.J.3
-
4
-
-
74049143158
-
Implementing sparse matrix-vector multiplication on throughput-oriented processors
-
New York, NY, USA
-
N. Bell and M. Garland. Implementing sparse matrix-vector multiplication on throughput-oriented processors. In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, SC'09, pages 18:1-18:11, New York, NY, USA, 2009.
-
(2009)
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, SC'09
, pp. 181-1811
-
-
Bell, N.1
Garland, M.2
-
5
-
-
84899687916
-
-
Cusp library
-
N. Bell and M. Garland. Cusp library, 2012. http://cusp-library. googlecode. com.
-
(2012)
-
-
Bell, N.1
Garland, M.2
-
6
-
-
77957679421
-
Model-driven autotuning of sparse matrix-vector multiply on GPUs
-
New York, NY, USA
-
J. W. Choi, A. Singh, and R. W. Vuduc. Model-driven autotuning of sparse matrix-vector multiply on GPUs. In Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP'10, pages 115-126, New York, NY, USA, 2010.
-
(2010)
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP'10
, pp. 115-126
-
-
Choi, J.W.1
Singh, A.2
Vuduc, R.W.3
-
7
-
-
81355161778
-
The University of Florida sparse matrix collection
-
Dec.
-
T. A. Davis and Y. Hu. The University of Florida sparse matrix collection. ACM Trans. Math. Softw., 38(1):1:1-1:25, Dec. 2011. http://www. cise. u. edu/research/sparse/matrices/.
-
(2011)
ACM Trans. Math. Softw.
, vol.38
, Issue.1
, pp. 11-125
-
-
Davis, T.A.1
Hu, Y.2
-
8
-
-
25144499116
-
Vectorized sparse matrix multiply for compressed row storage format
-
Berlin, Heidelberg
-
E. F. D'Azevedo, M. R. Fahey, and R. T. Mills. Vectorized sparse matrix multiply for compressed row storage format. In Proceedings of the 5th international conference on Computational Science-Volume Part I, ICCS'05, pages 99-106, Berlin, Heidelberg, 2005.
-
(2005)
Proceedings of the 5th International Conference on Computational Science-Volume Part I, ICCS'05
, pp. 99-106
-
-
D'Azevedo, E.F.1
Fahey, M.R.2
Mills, R.T.3
-
10
-
-
84858763464
-
High-performance sparse matrix-vector multiplication on GPUs for structured grid computations
-
New York, NY, USA
-
J. Godwin, J. Holewinski, and P. Sadayappan. High-performance sparse matrix-vector multiplication on GPUs for structured grid computations. In Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units, GPGPU-5, pages 47-56, New York, NY, USA, 2012.
-
(2012)
Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units, GPGPU-5
, pp. 47-56
-
-
Godwin, J.1
Holewinski, J.2
Sadayappan, P.3
-
11
-
-
84864039129
-
Automatically generating and tuning GPU code for sparse matrix-vector multiplication from a high-level representation
-
New York, NY, USA
-
D. Grewe and A. Lokhmotov. Automatically generating and tuning GPU code for sparse matrix-vector multiplication from a high-level representation. In Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-4, pages 12:1-12:8, New York, NY, USA, 2011.
-
(2011)
Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units
, vol.GPGPU-4
, pp. 121-128
-
-
Grewe, D.1
Lokhmotov, A.2
-
13
-
-
84949647432
-
Optimizing sparse matrix computations for register reuse in SPARSITY
-
London, UK
-
E.-J. Im and K. A. Yelick. Optimizing sparse matrix computations for register reuse in SPARSITY. In Proceedings of the International Conference on Computational Sciences-Part I, ICCS'01, pages 127-136, London, UK, 2001.
-
(2001)
Proceedings of the International Conference on Computational Sciences-Part I, ICCS'01
, pp. 127-136
-
-
Im, E.-J.1
Yelick, K.A.2
-
14
-
-
77950369345
-
Data clustering: 50 years beyond k-means
-
June
-
A. K. Jain. Data clustering: 50 years beyond k-means. Pattern Recogn. Lett., 31(8):651-666, June 2010.
-
(2010)
Pattern Recogn. Lett.
, vol.31
, Issue.8
, pp. 651-666
-
-
Jain, A.K.1
-
16
-
-
55849146932
-
Optimizing sparse matrix-vector multiplication using index and value compression
-
New York, NY, USA
-
K. Kourtis, G. Goumas, and N. Koziris. Optimizing sparse matrix-vector multiplication using index and value compression. In Proceedings of the 5th Conference on Computing frontiers, CF'08, pages 87-96, New York, NY, USA, 2008.
-
(2008)
Proceedings of the 5th Conference on Computing Frontiers, CF'08
, pp. 87-96
-
-
Kourtis, K.1
Goumas, G.2
Koziris, N.3
-
17
-
-
2942628343
-
Optimizing sparse matrix-vector product computations using unroll and jam
-
May
-
J. Mellor-Crummey and J. Garvin. Optimizing sparse matrix-vector product computations using unroll and jam. Int. J. High Perform. Comput. Appl., 18(2):225-236, May 2004.
-
(2004)
Int. J. High Perform. Comput. Appl.
, vol.18
, Issue.2
, pp. 225-236
-
-
Mellor-Crummey, J.1
Garvin, J.2
-
18
-
-
77949577730
-
Automatically tuning sparse matrix-vector multiplication for GPU architectures
-
Berlin, Heidelberg
-
A. Monakov, A. Lokhmotov, and A. Avetisyan. Automatically tuning sparse matrix-vector multiplication for GPU architectures. In Proceedings of the 5th international conference on High Performance Embedded Architectures and Compilers, HiPEAC'10, pages 111-125, Berlin, Heidelberg, 2010.
-
(2010)
Proceedings of the 5th International Conference on High Performance Embedded Architectures and Compilers, HiPEAC'10
, pp. 111-125
-
-
Monakov, A.1
Lokhmotov, A.2
Avetisyan, A.3
-
20
-
-
85031264203
-
Improving performance of sparse matrix-vector multiplication
-
New York, NY, USA
-
A. Pinar and M. T. Heath. Improving performance of sparse matrix-vector multiplication. In Proceedings of the 1999 ACM/IEEE conference on Supercomputing, Supercomputing'99, New York, NY, USA, 1999.
-
(1999)
Proceedings of the 1999 ACM/IEEE Conference on Supercomputing, Supercomputing'99
-
-
Pinar, A.1
Heath, M.T.2
-
21
-
-
24144467633
-
Iterative methods for sparse linear systems
-
Philadelphia, PA, USA, 2nd edition
-
Y. Saad. Iterative Methods for Sparse Linear Systems. Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 2nd edition, 2003.
-
(2003)
Society for Industrial and Applied Mathematics
-
-
Saad, Y.1
-
22
-
-
84864051848
-
ClSpMV: A cross-platform OpenCL SpMV framework on GPUs
-
New York, NY, USA
-
B.-Y. Su and K. Keutzer. clSpMV: A cross-platform OpenCL SpMV framework on GPUs. In Proceedings of the 26th ACM international conference on Supercomputing, ICS'12, pages 353-364, New York, NY, USA, 2012.
-
(2012)
Proceedings of the 26th ACM International Conference on Supercomputing, ICS'12
, pp. 353-364
-
-
Su, B.-Y.1
Keutzer, K.2
-
23
-
-
79955614550
-
A new approach for sparse matrix vector product on NVIDIA GPUs
-
June
-
F. Vazquez, J. J. Fernandez, and E. M. Garzon. A new approach for sparse matrix vector product on NVIDIA GPUs. Concurr. Comput.: Pract. Exper., 23(8):815-826, June 2011.
-
(2011)
Concurr. Comput.: Pract. Exper.
, vol.23
, Issue.8
, pp. 815-826
-
-
Vazquez, F.1
Fernandez, J.J.2
Garzon, E.M.3
-
24
-
-
24344485098
-
OSKI: A library of automatically tuned sparse matrix kernels
-
R. Vuduc, J. W. Demmel, and K. A. Yelick. OSKI: A library of automatically tuned sparse matrix kernels. In Proc. SciDAC, J. Physics: Conf. Ser., volume 16, pages 521-530, 2005.
-
(2005)
Proc. SciDAC, J. Physics: Conf. Ser.
, vol.16
, pp. 521-530
-
-
Vuduc, R.1
Demmel, J.W.2
Yelick, K.A.3
-
25
-
-
33646389518
-
Fast sparse matrix-vector multiplication by exploiting variable block structure
-
Berlin, Heidelberg
-
R. W. Vuduc and H.-J. Moon. Fast sparse matrix-vector multiplication by exploiting variable block structure. In Proceedings of the First international conference on High Performance Computing and Communications, HPCC'05, pages 807-816, Berlin, Heidelberg, 2005.
-
(2005)
Proceedings of the First International Conference on High Performance Computing and Communications, HPCC'05
, pp. 807-816
-
-
Vuduc, R.W.1
Moon, H.-J.2
-
26
-
-
34547468948
-
Accelerating sparse matrix computations via data compression
-
New York, NY, USA
-
J. Willcock and A. Lumsdaine. Accelerating sparse matrix computations via data compression. In Proceedings of the 20th annual international conference on Supercomputing, ICS'06, pages 307-316, New York, NY, USA, 2006.
-
(2006)
Proceedings of the 20th Annual International Conference on Supercomputing, ICS'06
, pp. 307-316
-
-
Willcock, J.1
Lumsdaine, A.2
-
27
-
-
56749158843
-
Optimization of sparse matrix-vector multiplication on emerging multicore platforms
-
New York, NY, USA
-
S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. In Proceedings of the 2007 ACM/IEEE conference on Supercomputing, SC'07, pages 38:1-38:12, New York, NY, USA, 2007.
-
(2007)
Proceedings of the 2007 ACM/IEEE Conference on Supercomputing, SC'07
, pp. 381-3812
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
|