-
1
-
-
85031252197
-
Achieving high sustained performance in an unstructured mesh cfd application
-
New York, NY, USA, ACM
-
W. K. Anderson, W. D. Gropp, D. K. Kaushik, D. E. Keyes, and B. F. Smith. Achieving high sustained performance in an unstructured mesh cfd application. In SC '99: Proceedings of the 1999 ACM/IEEE conference on Supercomputing, page 69, New York, NY, USA, 1999. ACM.
-
(1999)
SC '99: Proceedings of the 1999 ACM/IEEE conference on Supercomputing
, pp. 69
-
-
Anderson, W.K.1
Gropp, W.D.2
Kaushik, D.K.3
Keyes, D.E.4
Smith, B.F.5
-
2
-
-
55849138680
-
-
K. Asanovic et al. The landscape of parallel computing research: A view from berkeley. Technical Report UCB/EECS-2006-183, EECS Department, University of California, Berkeley, December 18 2006.
-
K. Asanovic et al. The landscape of parallel computing research: A view from berkeley. Technical Report UCB/EECS-2006-183, EECS Department, University of California, Berkeley, December 18 2006.
-
-
-
-
3
-
-
0003473816
-
-
SIAM, Philadelphia
-
R. Barrett, M. Berry, T. F. Chan, J. Demmel, J. M. Donato, J. Dongarra, V. Eijkhout, R. Pozo, C. Romine, and H. V. der Vorst. Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods. SIAM, Philadelphia, 1994.
-
(1994)
Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods
-
-
Barrett, R.1
Berry, M.2
Chan, T.F.3
Demmel, J.4
Donato, J.M.5
Dongarra, J.6
Eijkhout, V.7
Pozo, R.8
Romine, C.9
der Vorst, H.V.10
-
4
-
-
34547626759
-
High throughput compression of double-precision floating-point data
-
Washington, DC, USA, IEEE Computer Society
-
M. Burtscher and P. Ratanaworabhan. High throughput compression of double-precision floating-point data. In DCC '07: Proceedings of the 2007 Data Compression Conference, pages 293-302, Washington, DC, USA, 2007. IEEE Computer Society.
-
(2007)
DCC '07: Proceedings of the 2007 Data Compression Conference
, pp. 293-302
-
-
Burtscher, M.1
Ratanaworabhan, P.2
-
5
-
-
84947911090
-
Decomposing irregularly sparse matrices for parallel matrix-vector multiplication
-
U. V. Catalyuerek and C. Aykanat. Decomposing irregularly sparse matrices for parallel matrix-vector multiplication. Lecture Notes In Computer Science, 1117:75-86, 1996.
-
(1996)
Lecture Notes In Computer Science
, vol.1117
, pp. 75-86
-
-
Catalyuerek, U.V.1
Aykanat, C.2
-
7
-
-
0003197949
-
University of Florida sparse matrix collection
-
T. Davis. University of Florida sparse matrix collection. NA Digest, 97(23):7, 1997.
-
(1997)
NA Digest
, vol.97
, Issue.23
, pp. 7
-
-
Davis, T.1
-
8
-
-
20344401552
-
Chip makers turn to multicore processors
-
D. Geer. Chip makers turn to multicore processors. IEEE Computer, 38(5):11-13, 2005.
-
(2005)
IEEE Computer
, vol.38
, Issue.5
, pp. 11-13
-
-
Geer, D.1
-
10
-
-
47349103843
-
Understanding the performance of sparse matrix-vector multiplication
-
G. Goumas, K. Kourtis, N. Anastopoulos, V. Karakasis, and N. Koziris. Understanding the performance of sparse matrix-vector multiplication. In PDF '08: Proceedings of the 16th Euromicro International Conference on Parallel, Distributed and Network-based Processing, 2008.
-
(2008)
PDF '08: Proceedings of the 16th Euromicro International Conference on Parallel, Distributed and Network-based Processing
-
-
Goumas, G.1
Kourtis, K.2
Anastopoulos, N.3
Karakasis, V.4
Koziris, N.5
-
12
-
-
42549168687
-
Exploring the cache design space for large scale CMPs
-
L. Hsu, R. Iyer, S. Makineni, S. Reinhardt, and D. Newell. Exploring the cache design space for large scale CMPs. ACMSIGARCH Computer Architecture News, 33(4):24-33, 2005.
-
(2005)
ACMSIGARCH Computer Architecture News
, vol.33
, Issue.4
, pp. 24-33
-
-
Hsu, L.1
Iyer, R.2
Makineni, S.3
Reinhardt, S.4
Newell, D.5
-
14
-
-
84949647432
-
Optimizing sparse matrix computations for register reuse in SPARSITY
-
E. Im and K. Yelick. Optimizing sparse matrix computations for register reuse in SPARSITY. Lecture Notes in Computer Science, 2073:127-136, 2001.
-
(2001)
Lecture Notes in Computer Science
, vol.2073
, pp. 127-136
-
-
Im, E.1
Yelick, K.2
-
16
-
-
55849146932
-
Optimizing sparse matrix-vector multiplication using index and value compression
-
New York, NY, USA, ACM
-
K. Kourtis, G. Goumas, and N. Koziris. Optimizing sparse matrix-vector multiplication using index and value compression. In CF '08: Proceedings of the 2008 conference on Computing frontiers, pages 87-96, New York, NY, USA, 2008. ACM.
-
(2008)
CF '08: Proceedings of the 2008 conference on Computing frontiers
, pp. 87-96
-
-
Kourtis, K.1
Goumas, G.2
Koziris, N.3
-
17
-
-
34548206782
-
Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems)
-
New York, NY, USA, ACM
-
J. Langou, J. Langou, P. Luszczek, J. Kurzak, A. Buttari, and J. Dongarra. Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems). In SC '06: Proceedings of the 2006 ACM/IEEE conference on Supercomputing, page 113, New York, NY, USA, 2006. ACM.
-
(2006)
SC '06: Proceedings of the 2006 ACM/IEEE conference on Supercomputing
, pp. 113
-
-
Langou, J.1
Langou, J.2
Luszczek, P.3
Kurzak, J.4
Buttari, A.5
Dongarra, J.6
-
18
-
-
10044248780
-
Performance models for evaluation and automatic tuning of symmetric sparse matrix-vector multiply
-
15-18 Aug
-
B. Lee, R. Vuduc, J. Demmel, and K. Yelick. Performance models for evaluation and automatic tuning of symmetric sparse matrix-vector multiply. In ICPP '04: Proceedings of the International Conference on Parallel Processing, pages 169-176 vol.1, 15-18 Aug. 2004.
-
(2004)
ICPP '04: Proceedings of the International Conference on Parallel Processing
, vol.1
, pp. 169-176
-
-
Lee, B.1
Vuduc, R.2
Demmel, J.3
Yelick, K.4
-
20
-
-
3042618790
-
-
J. C. Pichel, D. B. Heras, J. C. Cabaleiro, and F. F. Rivera. Improving the locality of the sparse matrix-vector product on shared memory multiprocessors. In PDP, pages 66-71. IEEE Computer Society, 2004.
-
J. C. Pichel, D. B. Heras, J. C. Cabaleiro, and F. F. Rivera. Improving the locality of the sparse matrix-vector product on shared memory multiprocessors. In PDP, pages 66-71. IEEE Computer Society, 2004.
-
-
-
-
21
-
-
85031264203
-
Improving performance of sparse matrix-vector multiplication. In Supercomputing'99, Portland, OR
-
Nov
-
A. Pinar and M. T. Heath. Improving performance of sparse matrix-vector multiplication. In Supercomputing'99, Portland, OR, Nov. 1999. ACM SIGARCH and IEEE.
-
(1999)
ACM SIGARCH and IEEE
-
-
Pinar, A.1
Heath, M.T.2
-
22
-
-
55849126609
-
-
Y Saad. SPARSKIT: A basic tool kit for sparse matrix computations. Technical report, Computer Science Department, University of Minnesota, Minneapolis, MN 55455, June 1994. Version 2
-
Y Saad. SPARSKIT: A basic tool kit for sparse matrix computations. Technical report, Computer Science Department, University of Minnesota, Minneapolis, MN 55455, June 1994. Version 2.
-
-
-
-
24
-
-
0031269220
-
Improving the memory-system performance of sparse-matrix vector multiplication
-
S. Toledo. Improving the memory-system performance of sparse-matrix vector multiplication. IBM Journal of Research and Development, 41(6):711-725, 1997.
-
(1997)
IBM Journal of Research and Development
, vol.41
, Issue.6
, pp. 711-725
-
-
Toledo, S.1
-
25
-
-
84990830919
-
Performance optimizations and bounds for sparse matrix-vector multiply
-
Baltimore, MD, Nov
-
R. Vuduc, J. Demmel, K. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance optimizations and bounds for sparse matrix-vector multiply. In Supercomputing, Baltimore, MD, Nov. 2002.
-
(2002)
Supercomputing
-
-
Vuduc, R.1
Demmel, J.2
Yelick, K.3
Kamil, S.4
Nishtala, R.5
Lee, B.6
-
27
-
-
34547468948
-
Accelerating sparse matrix computations via data compression
-
New York, NY, USA, ACM Press
-
J. Willcock and A. Lumsdaine. Accelerating sparse matrix computations via data compression. In ICS '06: Proceedings of the 20th annual International Conference on Supercomputing, pages 307-316, New York, NY, USA, 2006. ACM Press.
-
(2006)
ICS '06: Proceedings of the 20th annual International Conference on Supercomputing
, pp. 307-316
-
-
Willcock, J.1
Lumsdaine, A.2
-
28
-
-
56749158843
-
Optimization of sparse matrix-vector multiplication on emerging multicore platforms
-
Reno, NV, Nov
-
S. Williams, L. Oilker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. In SC '07: Proceedings of the 2007 ACM/IEEE conference on Supercomputing, Reno, NV, Nov. 2007.
-
(2007)
SC '07: Proceedings of the 2007 ACM/IEEE conference on Supercomputing
-
-
Williams, S.1
Oilker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
|