-
1
-
-
35648995516
-
The landscape of parallel computing research: A view from berkeley
-
Univ. of California, Berkeley
-
K. Asanovic, R. Bodik, B.C. Catanzaro, J.J. Gebis, P. Husbands, K. Keutzer, D.A. Patterson, W.L. Plishker, J. Shalf, S.W. Williams, and K.A. Yelick, "The Landscape of Parallel Computing Research: A View from Berkeley," Technical Report UCB/EECS-2006-183, Univ. of California, Berkeley, 2006.
-
(2006)
Technical Report UCB/EECS-2006-2183
-
-
Asanovic, K.1
Bodik, R.2
Catanzaro, B.C.3
Gebis, J.J.4
Husbands, P.5
Keutzer, K.6
Patterson, D.A.7
Plishker, W.L.8
Shalf, J.9
Williams, S.W.10
Yelick, K.A.11
-
2
-
-
70450227686
-
Performance evaluation of the sparse matrix-vector multiplication on modern architectures
-
G. Goumas, K. Kourtis, N. Anastopoulos, V. Karakasis, and N. Koziris, "Performance Evaluation of the Sparse Matrix-Vector Multiplication on Modern Architectures," J. Supercomputing, vol. 50, no. 1, pp. 36-77, 2009.
-
(2009)
J. Supercomputing
, vol.50
, Issue.1
, pp. 36-77
-
-
Goumas, G.1
Kourtis, K.2
Anastopoulos, N.3
Karakasis, V.4
Koziris, N.5
-
3
-
-
56749158843
-
Optimization of sparse matrix-vector multiplication on emerging multicore platforms
-
S. Williams, L. Oilker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel, "Optimization of Sparse Matrix-Vector Multiplication on Emerging Multicore Platforms," Proc. ACM/IEEE Conf. Supercomputing, 2007.
-
(2007)
Proc. ACM/IEEE Conf. Supercomputing
-
-
Williams, S.1
Oilker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
-
4
-
-
65949107549
-
Roofline: An insightful visual performance model for multicore architectures
-
Apr
-
S. Williams, A. Waterman, and D. Patterson, "Roofline: An Insightful Visual Performance Model for Multicore Architectures," Comm. ACM-A Direct Path to Dependable Software, vol. 52, no. 4, pp. 65-76, Apr. 2009.
-
(2009)
Comm ACM A Direct Path to Dependable Software
, vol.52
, Issue.4
, pp. 65-76
-
-
Williams, S.1
Waterman, A.2
Patterson, D.3
-
6
-
-
84983621818
-
A high performance algorithm using pre-processing for the sparse matrix-vector multiplication
-
R.C. Agarwal, F.G. Gustavson, and M. Zubair, "A High Performance Algorithm Using Pre-Processing for the Sparse Matrix-Vector Multiplication, " Proc. ACM/IEEE Conf. Supercomputing, pp. 32-41, 1992.
-
(1992)
Proc. ACM/IEEE Conf. Supercomputing
, pp. 32-41
-
-
Agarwal, R.C.1
Gustavson, F.G.2
Zubair, M.3
-
9
-
-
0035370546
-
Towards a fast parallel sparse matrix-vector multiplication
-
R. Geus and S. Rollin, "Towards a Fast Parallel Sparse Matrix-Vector Multiplication," Parallel Computing, vol. 27, pp. 883-896, 2001.
-
(2001)
Parallel Computing
, vol.27
, pp. 883-896
-
-
Geus, R.1
Rollin, S.2
-
14
-
-
78650279432
-
Pattern-based sparse matrix representation for memory-efficient SMVM kernels
-
M. Belgin, G. Back, and C.J. Ribbens, "Pattern-Based Sparse Matrix Representation for Memory-Efficient SMVM Kernels," Proc. 23rd Int'l Conf. Supercomputing (ICS '09), pp. 100-109, 2009.
-
(2009)
Proc. 23rd Int'l Conf. Supercomputing (ICS '09)
, pp. 100-109
-
-
Belgin, M.1
Back, G.2
Ribbens, C.J.3
-
15
-
-
79952786461
-
CSX: An extended compression format for spmv on shared memory systems
-
K. Kourtis, V. Karakasis, G. Goumas, and N. Koziris, "CSX: An Extended Compression Format for SpMV on Shared Memory Systems," Proc. 16th ACM SIGPLAN Ann. Symp. Principles and Practice of Parallel Programming (PPoPP '11), pp. 247-256, 2011.
-
(2011)
Proc. 16th ACM SIGPLAN Ann. Symp. Principles and Practice of Parallel Programming (PPoPP '11)
, pp. 247-256
-
-
Kourtis, K.1
Karakasis, V.2
Goumas, G.3
Koziris, N.4
-
16
-
-
70449913281
-
Exploring the effect of block shapes on the performance of sparse kernels
-
V. Karakasis, G. Goumas, and N. Koziris, "Exploring the Effect of Block Shapes on the Performance of Sparse Kernels," Proc. IEEE Int'l Symp. Parallel and Distributed Processing, pp. 1-8, 2009.
-
(2009)
Proc. IEEE Int'l Symp. Parallel and Distributed Processing
, pp. 1-8
-
-
Karakasis, V.1
Goumas, G.2
Koziris, N.3
-
17
-
-
81355161778
-
The university of florida sparse matrix collection
-
T. Davis and Y. Hu, "The University of Florida Sparse Matrix Collection," ACM Trans. Math. Software, vol. 38, pp. 1-25, 2011.
-
(2011)
ACM Trans. Math. Software
, vol.38
, pp. 1-25
-
-
Davis, T.1
Hu, Y.2
-
18
-
-
84937995839
-
Direct solutions of sparse network equations by optimally ordered triangular factorization
-
Nov
-
W. Tinney and J. Walker, "Direct Solutions of Sparse Network Equations by Optimally Ordered Triangular Factorization," Proc. IEEE, vol. 55, no. 11, pp. 1801-1809, Nov. 1967.
-
(1967)
Proc. IEEE
, vol.55
, Issue.11
, pp. 1801-1809
-
-
Tinney, W.1
Walker, J.2
-
19
-
-
84976809508
-
A survey of indexing techniques for sparse matrices
-
U.W. Pooch and A. Nieder, "A Survey of Indexing Techniques for Sparse Matrices," ACM Computing Surveys, vol. 5, pp. 109-133, 1973.
-
(1973)
ACM Computing Surveys
, vol.5
, pp. 109-133
-
-
Pooch, U.W.1
Nieder, A.2
-
21
-
-
1542501019
-
Sparsity: Optimization framework for sparse matrix kernels
-
E.-J. Im, K. Yelick, and R. Vuduc, "Sparsity: Optimization Framework for Sparse Matrix Kernels," Int'l J. High Performance Computing Applications, vol. 18, pp. 135-158, 2004.
-
(2004)
Int'l J. High Performance Computing Applications
, vol.18
, pp. 135-158
-
-
Im, E.-J.1
Yelick, K.2
Vuduc, R.3
-
23
-
-
84990830919
-
Performance optimizations and bounds for sparse matrix-vector multiply
-
R. Vuduc, J.W. Demmel, K.A. Yelick, S. Kamil, R. Nishtala, and B. Lee, "Performance Optimizations and Bounds for Sparse Matrix-Vector Multiply," Proc. ACM/IEEE Conf. Supercomputing, pp. 1-35, 2002.
-
(2002)
Proc. ACM/IEEE Conf. Supercomputing
, pp. 1-35
-
-
Vuduc, R.1
Demmel, J.W.2
Yelick, K.A.3
Kamil, S.4
Nishtala, R.5
Lee, B.6
-
24
-
-
24344485098
-
OSKI: A library of automatically tuned sparse matrix kernels
-
R. Vuduc, J.W. Demmel, and K.A. Yelick, "OSKI: A Library of Automatically Tuned Sparse Matrix Kernels," J. Physics: Conf. Series, vol. 16, no. 521, 2005.
-
(2005)
J. Physics: Conf. Series
, vol.16
, Issue.521
-
-
Vuduc, R.1
Demmel, J.W.2
Yelick, K.A.3
-
25
-
-
70449629588
-
Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks
-
A. Buluç, J.T. Fineman, M. Frigo, J.R. Gilbert, and C.E. Leiserson, "Parallel Sparse Matrix-Vector and Matrix-Transpose-Vector Multiplication Using Compressed Sparse Blocks," Proc. 21st Ann. Symp. Parallelism in Algorithms and Architectures (SPAA '09), pp. 233-244, 2009.
-
(2009)
Proc. 21st Ann. Symp. Parallelism in Algorithms and Architectures (SPAA '09)
, pp. 233-244
-
-
Buluç, A.1
Fineman, J.T.2
Frigo, M.3
Gilbert, J.R.4
Leiserson, C.E.5
-
26
-
-
80053263342
-
Reduced-bandwidth multithreaded algorithms for sparse matrix-vector multiplication
-
A. Buluç, S. Williams, L. Oliker, and J. Demmel, "Reduced-Bandwidth Multithreaded Algorithms for Sparse Matrix-Vector Multiplication," Proc. IEEE Int'l Parallel and Distributed Processing Symp., pp. 721-733, 2011.
-
(2011)
Proc. IEEE Int'l Parallel and Distributed Processing Symp
, pp. 721-733
-
-
Buluç, A.1
Williams, S.2
Oliker, L.3
Demmel, J.4
-
28
-
-
78651410164
-
Exploiting compression opportunities to improve SpMxV performance on shared memory systems
-
article 16
-
K. Kourtis, G. Goumas, and N. Koziris, "Exploiting Compression Opportunities to Improve SpMxV Performance on Shared Memory Systems," ACM Trans. Architecture and Code Optimization, vol. 7, no. 3, article 16, 2010.
-
(2010)
ACM Trans. Architecture and Code Optimization
, vol.7
, Issue.3
-
-
Kourtis, K.1
Goumas, G.2
Koziris, N.3
|