-
2
-
-
78650279432
-
Pattern-based sparse matrix representation for memory-efficient SMVM kernels
-
M. Belgin, G. Back, C.J. Ribbens, Pattern-based sparse matrix representation for memory-efficient SMVM kernels, in: Proc. of the Int. Conf. on Supercomputing, 2009, pp. 100-109.
-
(2009)
Proc. of the Int. Conf. on Supercomputing
, pp. 100-109
-
-
Belgin, M.1
Back, G.2
Ribbens, C.J.3
-
4
-
-
35549013711
-
Performance optimization and modeling of blocked sparse kernels
-
DOI 10.1177/1094342007083801
-
A. Buttari, V. Eijkhout, J. Langou, and S. Filippone Performance optimization and modeling of blocked sparse kernels Int. J. High Perform. Comput. Appl. 21 4 2007 467 484 (Pubitemid 350011340)
-
(2007)
International Journal of High Performance Computing Applications
, vol.21
, Issue.4
, pp. 467-484
-
-
Buttari, A.1
Eijkhout, V.2
Langou, J.3
Filippone, S.4
-
5
-
-
33645913852
-
Performance comparison of data-reordering algorithms for sparse matrix-vector multiplication in edge-based unstructured grid computations
-
A.L.G.A. Coutinho, M.A.D. Martins, R.M. Sydenstricker, and R.N. Elias Performance comparison of data-reordering algorithms for sparse matrix-vector multiplication in edge-based unstructured grid computations Internat. J. Numer. Methods Engrg. 66 3 2006 431 460
-
(2006)
Internat. J. Numer. Methods Engrg.
, vol.66
, Issue.3
, pp. 431-460
-
-
Coutinho, A.L.G.A.1
Martins, M.A.D.2
Sydenstricker, R.M.3
Elias, R.N.4
-
7
-
-
84885951571
-
-
University of Florida Sparse Matrix Collection
-
T. Davis, University of Florida Sparse Matrix Collection, NA Digest 97 (23). http://www.cise.ufl.edu/research/sparse/matrices.
-
NA Digest
, vol.97
, Issue.23
-
-
Davis, T.1
-
8
-
-
70450227686
-
Performance evaluation of the sparse matrix-vector multiplication on modern architectures
-
G. Goumas, K. Kourtis, N. Anastopoulos, V. Karakasis, and N. Koziris Performance evaluation of the sparse matrix-vector multiplication on modern architectures J. Supercomput. 50 1 2009 36 77
-
(2009)
J. Supercomput.
, vol.50
, Issue.1
, pp. 36-77
-
-
Goumas, G.1
Kourtis, K.2
Anastopoulos, N.3
Karakasis, V.4
Koziris, N.5
-
9
-
-
80955167944
-
Performance analysis and benchmarking of the Intel SCC
-
P. Gschwandtner, T. Fahringer, R. Prodan, Performance analysis and benchmarking of the Intel SCC, in: IEEE Int. Conf. on Cluster Computing, CLUSTER, 2011, pp. 139-149.
-
(2011)
IEEE Int. Conf. on Cluster Computing, CLUSTER
, pp. 139-149
-
-
Gschwandtner, P.1
Fahringer, T.2
Prodan, R.3
-
12
-
-
84885960296
-
-
Intel, Measuring Processor Power: TDP vs. ACP, white paper, April 2011
-
Intel, Measuring Processor Power: TDP vs. ACP, white paper, April 2011.
-
-
-
-
13
-
-
84885947825
-
-
Intel Labs, SCC External Architecture Specification (EAS), November 2010
-
Intel Labs, SCC External Architecture Specification (EAS), November 2010.
-
-
-
-
14
-
-
84885958699
-
-
Intel Labs, SCC Programmer's Guide, November 2011
-
Intel Labs, SCC Programmer's Guide, November 2011.
-
-
-
-
15
-
-
70749149210
-
A comparative study of blocking storage methods for sparse matrices on multicore architectures
-
V. Karakasis, G. Goumas, N. Koziris, A comparative study of blocking storage methods for sparse matrices on multicore architectures, in: Proc. of IEEE Int. Conf. on Computational Science and Engineering, 2009, pp. 247-256.
-
(2009)
Proc. of IEEE Int. Conf. on Computational Science and Engineering
, pp. 247-256
-
-
Karakasis, V.1
Goumas, G.2
Koziris, N.3
-
17
-
-
81555226707
-
Memory intensive applications on a many-core processor
-
M. Korch, T. Rauber, C. Scholtes, Memory intensive applications on a many-core processor, in: IEEE Int. Conf. on High Performance Computing and Communications, 2011, pp. 126-134.
-
(2011)
IEEE Int. Conf. on High Performance Computing and Communications
, pp. 126-134
-
-
Korch, M.1
Rauber, T.2
Scholtes, C.3
-
18
-
-
55849146932
-
Optimizing sparse matrix-vector multiplication using index and value compression
-
K. Kourtis, G. Goumas, N. Koziris, Optimizing sparse matrix-vector multiplication using index and value compression, in: Proc. of the Conference on Computing Frontiers, 2008, pp. 87-96.
-
(2008)
Proc. of the Conference on Computing Frontiers
, pp. 87-96
-
-
Kourtis, K.1
Goumas, G.2
Koziris, N.3
-
19
-
-
84870486690
-
Investigation of main memory bandwidth on Intel single-chip cloud computer
-
N. Melot, K. Avdic, C. Kessler, J. Keller, Investigation of main memory bandwidth on Intel single-chip cloud computer, in: 3rd Many-core Application Research Community Symposium, 2011, pp. 107-110.
-
(2011)
3rd Many-core Application Research Community Symposium
, pp. 107-110
-
-
Melot, N.1
Avdic, K.2
Kessler, C.3
Keller, J.4
-
20
-
-
0036734103
-
Effects of ordering strategies and programming paradigms on sparse matrix computations
-
L. Oliker, X. Li, P. Husbands, and R. Biswas Effects of ordering strategies and programming paradigms on sparse matrix computations SIAM Rev. 44 3 2002 373 393
-
(2002)
SIAM Rev.
, vol.44
, Issue.3
, pp. 373-393
-
-
Oliker, L.1
Li, X.2
Husbands, P.3
Biswas, R.4
-
21
-
-
80054888287
-
Analyzing the execution of sparse matrix-vector product on the Finisterrae SMP-NUMA system
-
J.C. Pichel, J.A. Lorenzo, D.B. Heras, J.C. Cabaleiro, and T.F. Pena Analyzing the execution of sparse matrix-vector product on the Finisterrae SMP-NUMA system J. Supercomput. 58 2 2011 195 205
-
(2011)
J. Supercomput.
, vol.58
, Issue.2
, pp. 195-205
-
-
Pichel, J.C.1
Lorenzo, J.A.2
Heras, D.B.3
Cabaleiro, J.C.4
Pena, T.F.5
-
22
-
-
84867432441
-
Experiences with the sparse matrix-vector multiplication on a many-core processor
-
J.C. Pichel, F.F. Rivera, Experiences with the sparse matrix-vector multiplication on a many-core processor, in: Proc. of the IEEE 26th Int. Parallel and Distributed Processing Symposium Workshops, IPDPSW, 2012, pp. 7-15.
-
(2012)
Proc. of the IEEE 26th Int. Parallel and Distributed Processing Symposium Workshops, IPDPSW
, pp. 7-15
-
-
Pichel, J.C.1
Rivera, F.F.2
-
23
-
-
84857332778
-
Optimization of sparse matrix-vector multiplication using reordering techniques on GPUs
-
J.C. Pichel, F.F. Rivera, M. Fernández, and A. Rodríguez Optimization of sparse matrix-vector multiplication using reordering techniques on GPUs Microprocess. Microsyst. 36 2 2012 65 77
-
(2012)
Microprocess. Microsyst.
, vol.36
, Issue.2
, pp. 65-77
-
-
Pichel, J.C.1
Rivera, F.F.2
Fernández, M.3
Rodríguez, A.4
-
24
-
-
56349128909
-
Reordering algorithms for increasing locality on multicore processors
-
J.C. Pichel, D.E. Singh, J. Carretero, Reordering algorithms for increasing locality on multicore processors, in: Proc. of the IEEE Int. Conf. on High Performance Computing and Communications, 2008, pp. 123-130.
-
(2008)
Proc. of the IEEE Int. Conf. on High Performance Computing and Communications
, pp. 123-130
-
-
Pichel, J.C.1
Singh, D.E.2
Carretero, J.3
-
25
-
-
3042576437
-
Improving performance of sparse matrix-vector multiplication
-
A. Pinar, M. Heath, Improving performance of sparse matrix-vector multiplication, in: Proc. of Supercomputing, 1999.
-
(1999)
Proc. of Supercomputing
-
-
Pinar, A.1
Heath, M.2
-
28
-
-
33646389518
-
Fast sparse matrix-vector multiplication by exploiting variable block structure
-
Lecture Notes in Computer Science
-
R. Vuduc, and H. Moon Fast sparse matrix-vector multiplication by exploiting variable block structure High Performance Computing and Communications Lecture Notes in Computer Science vol. 3726 2005 807 816
-
(2005)
High Performance Computing and Communications
, vol.3726 VOL.
, pp. 807-816
-
-
Vuduc, R.1
Moon, H.2
-
29
-
-
56749158843
-
Optimization of sparse matrix-vector multiply on emerging multicore platforms
-
12
-
S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, J. Demmel, Optimization of sparse matrix-vector multiply on emerging multicore platforms, in: Proc. of the ACM/IEEE Conf. on Supercomputing, 2007, pp. 38:1-38:12.
-
(2007)
Proc. of the ACM/IEEE Conf. on Supercomputing
, vol.38
, pp. 1-38
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
|