-
1
-
-
35648995516
-
-
University of Berkeley, Tech. Rep. UCB/EECS-2006-183, December available at
-
K. Asanovic, R. Bodik, B. C. Catanzaro, J. J. Gebis, P. Husbands, K. Keutzer, D. A. Patterson, W. L. Plishker, J. Shalf, S. W. Williams, and K. A. Yelick, "The Landscape of Parallel Computing Research: A View from Berkeley," EECS Department, University of Berkeley, Tech. Rep. UCB/EECS-2006-183, December 2006, available at http://www.eecs.berkeley.edu/ Pubs/TechRpts/2006/EECS-2006-183.html.
-
(2006)
The Landscape of Parallel Computing Research: A View from Berkeley
-
-
Asanovic, K.1
Bodik, R.2
Catanzaro, B.C.3
Gebis, J.J.4
Husbands, P.5
Keutzer, K.6
Patterson, D.A.7
Plishker, W.L.8
Shalf, J.9
Williams, S.W.10
Yelick, K.A.11
-
2
-
-
0003158656
-
Hitting the Memory Wall: Implications of the Obvious
-
March
-
W. A. Wulf and S. A. McKee, "Hitting the Memory Wall: Implications of the Obvious," ACM SIGARCH Computer Architecture News, vol. 23, no. 1, pp. 20-24, March 1995.
-
(1995)
ACM SIGARCH Computer Architecture News
, vol.23
, Issue.1
, pp. 20-24
-
-
Wulf, W.A.1
McKee, S.A.2
-
3
-
-
84870744721
-
-
last accessed 2012-06-08
-
Intel Corporation, "Intel® Xeon® Processor E5-2680," 2011, http://ark.intel.com/products/64583/Intel-Xeon-Processor-E5-2680-(20M- Cache-2 70-GHz-8-00-GTs-Intel-QPI), last accessed 2012-06-08.
-
(2011)
Intel® Xeon® Processor E5-2680
-
-
-
4
-
-
56749158843
-
Optimization of Sparse Matrix-vector Multiplication on Emerging Multicore Platforms
-
S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel, "Optimization of Sparse Matrix-vector Multiplication on Emerging Multicore Platforms," in Proc. of the 2007 ACM/IEEE Conf. on Supercomputing, Reno, NV, November 2007, pp. 38:1-38:12.
-
Proc. of the 2007 ACM/IEEE Conf. on Supercomputing, Reno, NV, November 2007
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
-
6
-
-
84949647432
-
Optimizing Sparse Matrix Computations for Register Reuse in SPARSITY
-
San Francisco, CA, May
-
E.-J. Im and K. A. Yelick, "Optimizing Sparse Matrix Computations for Register Reuse in SPARSITY," in Proc. of the Intl. Conf. on Computational Sciences, Part I, vol. 2073, San Francisco, CA, May 2001, pp. 127-136.
-
(2001)
Proc. of the Intl. Conf. on Computational Sciences
, vol.2073
, Issue.PART I
, pp. 127-136
-
-
Im, E.-J.1
Yelick, K.A.2
-
7
-
-
1542501019
-
Sparsity: Optimization Framework for Sparse Matrix Kernels
-
February
-
E.-J. Im, K. Yelick, and R. Vuduc, "Sparsity: Optimization Framework for Sparse Matrix Kernels," Intl. Journal of High Performance Computing Applications, vol. 18, no. 1, pp. 135-158, February 2004.
-
(2004)
Intl. Journal of High Performance Computing Applications
, vol.18
, Issue.1
, pp. 135-158
-
-
Im, E.-J.1
Yelick, K.2
Vuduc, R.3
-
8
-
-
33646389518
-
Fast Sparse Matrix-Vector Multiplication by Exploiting Variable Block Structure
-
R. W. Vuduc and H. J. Moon, "Fast Sparse Matrix-Vector Multiplication by Exploiting Variable Block Structure," in Proc. of the High-Performance Computing and Communications Conf., Sorrento, Italy, September 2005, pp. 807-816.
-
Proc. of the High-Performance Computing and Communications Conf., Sorrento, Italy, September 2005
, pp. 807-816
-
-
Vuduc, R.W.1
Moon, H.J.2
-
9
-
-
34547765053
-
-
University of California, Berkeley, Tech. Rep. UCB/CSD-04-1335, available at
-
R. Nishtala, R. W. Vuduc, J. W. Demmel, and K. A. Yelick, "Performance Modeling and Analysis of Cache Blocking in Sparse Matrix Vector Multiply," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-04-1335, 2004, available at http://www.eecs.berkeley.edu/ Pubs/TechRpts/2004/5535.html.
-
(2004)
Performance Modeling and Analysis of Cache Blocking in Sparse Matrix Vector Multiply
-
-
Nishtala, R.1
Vuduc, R.W.2
Demmel, J.W.3
Yelick, K.A.4
-
10
-
-
77949657892
-
Parallel symmetric sparse matrix-vector product on scalar multi-core cpus
-
Apr. [Online]. Available
-
M. Krotkiewski and M. Dabrowski, "Parallel symmetric sparse matrix-vector product on scalar multi-core cpus," Parallel Comput., vol. 36, no. 4, pp. 181-198, Apr. 2010. [Online]. Available: http://dx.doi.org/10. 1016/j.parco.2010.02.003
-
(2010)
Parallel Comput.
, vol.36
, Issue.4
, pp. 181-198
-
-
Krotkiewski, M.1
Dabrowski, M.2
-
11
-
-
85031264203
-
Improving Performance of Sparse Matrix-Vector Multiplication
-
A. Pinar and M. T. Heath, "Improving Performance of Sparse Matrix-Vector Multiplication," in Proc. of the 1999 ACM/IEEE Conf. on Supercomputing, Portland, OR, November 1999, pp. 30:1-30:9.
-
Proc. of the 1999 ACM/IEEE Conf. on Supercomputing, Portland, OR, November 1999
-
-
Pinar, A.1
Heath, M.T.2
-
12
-
-
80053996235
-
CSX: An Extended Compression Format for SpMV on Shared Memory Systems
-
K. Kourtis, V. Karakasis, G. Goumas, and N. Koziris, "CSX: an Extended Compression Format for SpMV on Shared Memory Systems," in Proc. of the 16th ACM Symp. on Principles and Practice of Parallel Programming, San Antonio, TX, April 2011, pp. 247-256.
-
Proc. of the 16th ACM Symp. on Principles and Practice of Parallel Programming, San Antonio, TX, April 2011
, pp. 247-256
-
-
Kourtis, K.1
Karakasis, V.2
Goumas, G.3
Koziris, N.4
-
13
-
-
78650279432
-
Pattern-based Sparse Matrix Representation for Memory-efficient SMVM Kernels
-
M. Belgin, G. Back, and C. J. Ribbens, "Pattern-based Sparse Matrix Representation for Memory-efficient SMVM Kernels," in Proc. of the 23rd Intl. Conf. on Supercomputing, Yorktown Heights, NY, June 2009, pp. 100-109.
-
Proc. of the 23rd Intl. Conf. on Supercomputing, Yorktown Heights, NY, June 2009
, pp. 100-109
-
-
Belgin, M.1
Back, G.2
Ribbens, C.J.3
-
14
-
-
77954707501
-
Cache-oblivious Sparse Matrix-vector Multiplication by Using Sparse Matrix Partitioning Methods
-
July
-
A. N. Yzelman, Rob, and H. Bisseling, "Cache-oblivious Sparse Matrix-vector Multiplication by Using Sparse Matrix Partitioning Methods," SIAM Journal on Scientific Computing, vol. 31, no. 4, July 2009.
-
(2009)
SIAM Journal on Scientific Computing
, vol.31
, Issue.4
-
-
Yzelman, A.N.1
Rob2
Bisseling, H.3
-
15
-
-
57349185547
-
Adaptive Runtime Tuning of Parallel Sparse Matrix-vector Multiplication on Distributed Memory Systems
-
S. Lee and R. Eigenmann, "Adaptive Runtime Tuning of Parallel Sparse Matrix-vector Multiplication on Distributed Memory Systems," in Proc. of the 22nd Intl. Conf. on Supercomputing, Island of Kos, Greece, June 2008, pp. 195- 204.
-
Proc. of the 22nd Intl. Conf. on Supercomputing, Island of Kos, Greece, June 2008
, pp. 195-204
-
-
Lee, S.1
Eigenmann, R.2
-
18
-
-
80053263342
-
Reduced-bandwidth multithreaded algorithms for sparse matrix-vector multiplication
-
Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium, ser. Washington, DC, USA: IEEE Computer Society, [Online]. Available
-
A. Buluc, S. Williams, L. Oliker, and J. Demmel, "Reduced-bandwidth multithreaded algorithms for sparse matrix-vector multiplication," in Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium, ser. IPDPS '11. Washington, DC, USA: IEEE Computer Society, 2011, pp. 721-733. [Online]. Available: http://dx.doi.org/10.1109/IPDPS.2011.73
-
(2011)
IPDPS '11
, pp. 721-733
-
-
Buluc, A.1
Williams, S.2
Oliker, L.3
Demmel, J.4
-
20
-
-
81355161778
-
The University of Florida Sparse Matrix Collection
-
December
-
T. Davis and Y. Hu, "The University of Florida Sparse Matrix Collection," ACM Transactions on Mathmatical Software, vol. 38, no. 11, pp. 1:1-1:25, December 2011.
-
(2011)
ACM Transactions on Mathmatical Software
, vol.38
, Issue.11
-
-
Davis, T.1
Hu, Y.2
|