-
1
-
-
35549013711
-
Performance optimization and modeling of blocked sparse kernels
-
DOI 10.1177/1094342007083801
-
Buttari, A., Eijkhout, V., Langou, J., and Filippone, S. (2007). Performance optimization and modeling of blocked sparse kernels. Int. J. High Perform. Comput. Appl. 21 (4). 467-484. (Pubitemid 350011340)
-
(2007)
International Journal of High Performance Computing Applications
, vol.21
, Issue.4
, pp. 467-484
-
-
Buttari, A.1
Eijkhout, V.2
Langou, J.3
Filippone, S.4
-
4
-
-
85031252197
-
Achieving high sustained performance in an unstructured mesh CFD application
-
Anderson, W.K., Gropp, W.D., Kaushik, D.K., Keyes, D.E., and Smith, B.F. (1999). Achieving high sustained performance in an unstructured mesh CFD application. In Proceedings of the ACM/IEEE SC99 Conference on High Performance Networking and Computing. IEEE Computer Society.
-
Proceedings of the ACM/IEEE SC99 Conference on High Performance Networking and Computing
-
-
Anderson, W.K.1
Gropp, W.D.2
Kaushik, D.K.3
Keyes, D.E.4
Smith, B.F.5
-
6
-
-
0036298603
-
POWER4 system microarchitecture. IBM
-
Tendler, J.M., Dodson, J.S., Fields, J.S., Jr., Le, H., and Sinharoy, B. (2002). POWER4 system microarchitecture. IBM J. Res. Dev. 46 (1). 5-25.
-
(2002)
J. Res. Dev
, vol.46
, Issue.1
, pp. 5-25
-
-
Tendler, J.M.1
Dodson, J.S.2
Fields Jr., J.S.3
Le, H.4
Sinharoy, B.5
-
7
-
-
25844437046
-
POWER5 system microarchitecture. IBM
-
Sinharoy, B., Kalla, R.N., Tendler, J.M., Eickemeyer, R.J., and Joyner, J.B. (2005). POWER5 system microarchitecture. IBM J. Res. Dev. 49 (4 / 5). 505-521.
-
(2005)
J. Res. Dev
, vol.49
, Issue.4-5
, pp. 505-521
-
-
Sinharoy, B.1
Kalla, R.N.2
Tendler, J.M.3
Eickemeyer, R.J.4
Joyner, J.B.5
-
8
-
-
37549032725
-
IBM POWER6 microarchitecture
-
Le, H.Q., Starke, W.J., Fields, J.S., O'Connell, F.P., Nguyen, D.Q., Ronchetti, B.J., Sauer, W.M., Schwarz, E.M., and Vaden, M.T. (2007). IBM POWER6 microarchitecture. IBM J. Res. Dev. 51 (6). 639-662.
-
(2007)
IBM J. Res. Dev
, vol.51
, Issue.6
, pp. 639-662
-
-
Le, H.Q.1
Starke, W.J.2
Fields, J.S.3
O'Connell, F.P.4
Nguyen, D.Q.5
Ronchetti, B.J.6
Sauer, W.M.7
Schwarz, E.M.8
Vaden, M.T.9
-
10
-
-
84990830919
-
Performance optimizations and bounds for sparse matrix-vector multiply
-
Vuduc, R., Demmel, W.J., Yelick, A.K., Kamil, S., Nishtala, R., and Lee, B. (2002). Performance optimizations and bounds for sparse matrix-vector multiply. In Supercomputing '02: Proceedings of the 2002 ACM/IEEE Conference on Supercomputing, Baltimore, MD.
-
Supercomputing '02: Proceedings of the 2002 ACM/IEEE Conference on Supercomputing
-
-
Vuduc, R.1
Demmel, W.J.2
Yelick, A.K.3
Kamil, S.4
Nishtala, R.5
Lee, B.6
-
11
-
-
25144499116
-
Vectorized sparse matrix multiply for compressed row storage format
-
D'Azevedo, F.E., Fahey, R.M., and Mills, T.R. (2005). Vectorized sparse matrix multiply for compressed row storage format. In Computational Science - ICCS 2005 (Lecture Notes in Computer Science). pp. 99-106.
-
(2005)
Lecture Notes in Computer Science
, pp. 99-106
-
-
D'Azevedo, F.E.1
Fahey, R.M.2
Mills, T.R.3
-
12
-
-
0029290074
-
Data distributions for sparse matrix vector multiplication
-
Romero, L.F. and Zapata, E.L. (2005). Data distributions for sparse matrix vector multiplication. Parallel Comput. 21, 583-605.
-
(2005)
Parallel Comput
, vol.21
, pp. 583-605
-
-
Romero, L.F.1
Zapata, E.L.2
-
13
-
-
60949098907
-
Optimization of sparse matrix-vector multiplication on emerging multicore platforms
-
Williams, S., Oliker, L., Vuduc, R., Shalf, J., Yelick, K., and Demmel, J. (2009). Optimization of sparse matrix-vector multiplication on emerging multicore platforms. Parallel Comput. 35 (3). 178-194.
-
(2009)
Parallel Comput
, vol.35
, Issue.3
, pp. 178-194
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
-
14
-
-
2942628343
-
Optimizing sparse matrix-vector product computations using unroll and jam
-
Mellor-Crummey, J. and Garvin, J. (2004). Optimizing sparse matrix-vector product computations using unroll and jam. Int. J. High Perform. Comput. Appl. 18 (2). 225-236.
-
(2004)
Int. J. High Perform. Comput. Appl
, vol.18
, Issue.2
, pp. 225-236
-
-
Mellor-Crummey, J.1
Garvin, J.2
-
19
-
-
0003660984
-
-
Mathematics and Computer Science Division, Argonne National Laboratory, ANL-95/11 Revision 3.0.0.
-
Balay, S., Buschelman, K., Eijkhout, V., Gropp, W., Kaushik, D., Knepley, M., Curfman McInnes, L., Smith, B., and Zhang, H. (2008). PETSc Users Manual. Mathematics and Computer Science Division, Argonne National Laboratory, ANL-95/11 Revision 3.0.0.
-
(2008)
PETSc Users Manual
-
-
Balay, S.1
Buschelman, K.2
Eijkhout, V.3
Gropp, W.4
Kaushik, D.5
Knepley, M.6
Curfman McInnes, L.7
Smith, B.8
Zhang, H.9
|