-
3
-
-
77956260008
-
Implementing sparse matrix-vector multiplication on throughput-oriented processors
-
ACM, November
-
N. Bell and M. Garland. Implementing Sparse Matrix-Vector Multiplication on Throughput-Oriented Processors. In Proceedings of Supercomputing (SC '09), pages 18:1-18:11. ACM, November 2009.
-
(2009)
Proceedings of Supercomputing (SC '09)
, pp. 181-1811
-
-
Bell, N.1
Garland, M.2
-
4
-
-
84864034684
-
Hierarchical diagonal blocking and precision reduction applied to combinatorial multigrid
-
November
-
G. Blelloch, I. Koutis, G. Miller, and K. Tangwongsan. Hierarchical Diagonal Blocking and Precision Reduction Applied to Combinatorial Multigrid. In Proceedings of Supercomputing (SC '10), pages 1-12, November 2010.
-
(2010)
Proceedings of Supercomputing (SC '10)
, pp. 1-12
-
-
Blelloch, G.1
Koutis, I.2
Miller, G.3
Tangwongsan, K.4
-
6
-
-
70449629588
-
Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks
-
ACM
-
A. Buluç, J. T. Fineman, M. Frigo, J. R. Gilbert, and C. E. Leiserson. Parallel Sparse Matrix-Vector and Matrix-Transpose-Vector Multiplication Using Compressed Sparse Blocks. In Proceedings of the 21st Annual Symposium on Parallelism in Algorithms and Architectures, SPAA '09, pages 233-244. ACM, 2009.
-
(2009)
Proceedings of the 21st Annual Symposium on Parallelism in Algorithms and Architectures, SPAA '09
, pp. 233-244
-
-
Buluç, A.1
Fineman, J.T.2
Frigo, M.3
Gilbert, J.R.4
Leiserson, C.E.5
-
7
-
-
80053263342
-
Reduced-bandwidth multithreaded algorithms for sparse matrix-vector multiplication
-
2011 IEEE International May
-
A. Buluç, S. Williams, L. Oliker, and J. Demmel. Reduced-Bandwidth Multithreaded Algorithms for Sparse Matrix-Vector Multiplication. In International Parallel Distributed Processing Symposium (IPDPS), 2011 IEEE International, pages 721-733, May 2011.
-
(2011)
International Parallel Distributed Processing Symposium (IPDPS)
, pp. 721-733
-
-
Buluç, A.1
Williams, S.2
Oliker, L.3
Demmel, J.4
-
9
-
-
84858769083
-
HICAMP: Architectural support for efficient concurrency-safe shared structured data access
-
New York, NY, USA ACM
-
D. R. Cheriton, A. Firoozshahian, A. Solomatnikov, J. P. Stevenson, and O. Azizi. HICAMP: Architectural Support for Efficient Concurrency-Safe Shared Structured Data Access. In Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS '12, pages 287-300, New York, NY, USA, 2012. ACM.
-
(2012)
Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS '12
, pp. 287-300
-
-
Cheriton, D.R.1
Firoozshahian, A.2
Solomatnikov, A.3
Stevenson, J.P.4
Azizi, O.5
-
10
-
-
81355161778
-
The university of florida sparse matrix collection
-
November
-
T. A. Davis and Y. Hu. The University of Florida Sparse Matrix Collection. ACM Trans. Math. Softw., 38:1:1-1:25, November 2011.
-
(2011)
ACM Trans. Math. Softw.
, vol.38
, pp. 11-125
-
-
Davis, T.A.1
Hu, Y.2
-
11
-
-
0033884908
-
Xtensa: A configurable and extensible processor
-
DOI 10.1109/40.848473
-
R. Gonzalez. Xtensa: A Configurable and Extensible Processor. Micro, IEEE, 20(2):60-70, Mar/Apr 2000. (Pubitemid 30585385)
-
(2000)
IEEE Micro
, vol.20
, Issue.2
, pp. 60-70
-
-
Gonzalez, R.E.1
-
12
-
-
1542501019
-
SPARSITY: Framework for optimizing sparse matrix-vector multiply
-
February
-
E.-J. Im, K. A. Yelick, and R. Vuduc. SPARSITY: Framework for Optimizing Sparse Matrix-Vector Multiply. International Journal of High Performance Computing Applications, 18(1):135-158, February 2004.
-
(2004)
International Journal of High Performance Computing Applications
, vol.18
, Issue.1
, pp. 135-158
-
-
Im, E.-J.1
Yelick, K.A.2
Vuduc, R.3
-
13
-
-
55849145179
-
Improving the performance of multithreaded sparse matrix-vector multiplication using index and value compression
-
September
-
K. Kourtis, G. Goumas, and N. Koziris. Improving the Performance of Multithreaded Sparse Matrix-Vector Multiplication Using Index and Value Compression. In 37th International Conference on Parallel Processing (ICPP '08), pages 511-519, September 2008.
-
(2008)
37th International Conference on Parallel Processing (ICPP '08)
, pp. 511-519
-
-
Kourtis, K.1
Goumas, G.2
Koziris, N.3
-
15
-
-
79952786461
-
CSX: An extended compression format for SpMV on shared memory systems
-
February
-
K. Kourtis, V. Karakasis, G. Goumas, and N. Koziris. CSX: An Extended Compression Format for SpMV on Shared Memory Systems. In Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming, pages 247-256, February 2011.
-
(2011)
Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming
, pp. 247-256
-
-
Kourtis, K.1
Karakasis, V.2
Goumas, G.3
Koziris, N.4
-
16
-
-
77949657892
-
Parallel symmetric sparse matrix-vector product on scalar multi-core CPUs
-
M. Krotkiewski and M. Dabrowski. Parallel Symmetric Sparse Matrix-Vector Product on Scalar Multi-Core CPUs. Parallel Computing, 36(4):181 - 198, 2010.
-
(2010)
Parallel Computing
, vol.36
, Issue.4
, pp. 181-198
-
-
Krotkiewski, M.1
Dabrowski, M.2
-
17
-
-
79551537591
-
Use of hybrid recursive CSR/COO data structures in sparse matrix-vector multiplication
-
October
-
M. Martone, S. Filippone, P. Gepner, M. Paprzycki, and S. Tucci. Use of Hybrid Recursive CSR/COO Data Structures in Sparse Matrix-Vector Multiplication. In IMCSIT, pages 327-335, October 2010.
-
(2010)
IMCSIT
, pp. 327-335
-
-
Martone, M.1
Filippone, S.2
Gepner, P.3
Paprzycki, M.4
Tucci, S.5
-
20
-
-
0031269220
-
Improving the memory-system performance of sparse-matrix vector multiplication
-
S. Toledo. Improving the Memory-System Performance of Sparse-Matrix Vector Multiplication. IBM Journal of Research and Development, 41(6):711-725, November 1997. (Pubitemid 127557044)
-
(1997)
IBM Journal of Research and Development
, vol.41
, Issue.6
, pp. 711-725
-
-
Toledo, S.1
-
22
-
-
24344485098
-
OSKI: A library of automatically tuned sparse matrix kernels
-
San Francisco, CA, USA, June Institute of Physics Publishing
-
R. Vuduc, J. W. Demmel, and K. A. Yelick. OSKI: A Library of Automatically Tuned Sparse Matrix Kernels. In Proceedings of SciDAC 2005, Journal of Physics: Conference Series, San Francisco, CA, USA, June 2005. Institute of Physics Publishing.
-
(2005)
Proceedings of SciDAC 2005, Journal of Physics: Conference Series
-
-
Vuduc, R.1
Demmel, J.W.2
Yelick, K.A.3
-
23
-
-
84990830919
-
Performance optimizations and bounds for sparse matrix-vector multiply
-
Baltimore, MD, USA, November
-
R. Vuduc, J. W. Demmel, K. A. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance Optimizations and Bounds for Sparse Matrix-Vector Multiply. In Proceedings of Supercomputing (SC '02), Baltimore, MD, USA, November 2002.
-
(2002)
Proceedings of Supercomputing (SC '02)
-
-
Vuduc, R.1
Demmel, J.W.2
Yelick, K.A.3
Kamil, S.4
Nishtala, R.5
Lee, B.6
-
24
-
-
34547468948
-
Accelerating sparse matrix computations via data compression
-
DOI 10.1145/1183401.1183444, Proceedings of the 20th Annual International Conference on Supercomputing, ICS 2006
-
J. Willcock and A. Lumsdaine. Accelerating Sparse Matrix Computations via Data Compression. In Proceedings of the 20th International Conference on Supercomputing (ICS '06), pages 307-316. ACM, June 2006. (Pubitemid 47168517)
-
(2006)
Proceedings of the International Conference on Supercomputing
, pp. 307-316
-
-
Willcock, J.1
Lumsdaine, A.2
-
25
-
-
56749158843
-
Optimization of sparse matrix-vector multiplication on emerging multicore platforms
-
ACM, November
-
S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel. Optimization of Sparse Matrix-Vector Multiplication on Emerging Multicore Platforms. In Proceedings of Supercomputing (SC '07), pages 38:1-38:12. ACM, November 2007.
-
(2007)
Proceedings of Supercomputing (SC '07)
, pp. 381-3812
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
|