SCOPUS 정보 검색 플랫폼

Proceedings - 2012 11th International Symposium on Parallel and Distributed Computing, ISPDC 2012

Volumn , Issue , 2012, Pages 3-10

Performance of a structure-detecting SpMV using the CSR matrix representation

(3) Pabst, Hans a Bachmayer, Bev a Klemm, Michael a

a INTEL CORPORATION (United States)

Author keywords

CRS; CSR; runtime optimization; sparse matrix vector multiplication; SpMV; structure detection

Indexed keywords

CRS; CSR; RUNTIME OPTIMIZATION; SPARSE MATRIX-VECTOR MULTIPLICATION; SPMV; STRUCTURE DETECTION;

ALGORITHMS; DISTRIBUTED COMPUTER SYSTEMS;

MATRIX ALGEBRA;

EID: 84870731723 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ISPDC.2012.9 Document Type: Conference Paper

Times cited : (3)

References (20)

1
- 35648995516
- University of Berkeley, Tech. Rep. UCB/EECS-2006-183, December available at
- K. Asanovic, R. Bodik, B. C. Catanzaro, J. J. Gebis, P. Husbands, K. Keutzer, D. A. Patterson, W. L. Plishker, J. Shalf, S. W. Williams, and K. A. Yelick, "The Landscape of Parallel Computing Research: A View from Berkeley," EECS Department, University of Berkeley, Tech. Rep. UCB/EECS-2006-183, December 2006, available at http://www.eecs.berkeley.edu/ Pubs/TechRpts/2006/EECS-2006-183.html.
- (2006) The Landscape of Parallel Computing Research: A View from Berkeley
- Asanovic, K.¹ Bodik, R.² Catanzaro, B.C.³ Gebis, J.J.⁴ Husbands, P.⁵ Keutzer, K.⁶ Patterson, D.A.⁷ Plishker, W.L.⁸ Shalf, J.⁹ Williams, S.W.¹⁰ Yelick, K.A.¹¹

2
- 0003158656
- Hitting the Memory Wall: Implications of the Obvious
- March
- W. A. Wulf and S. A. McKee, "Hitting the Memory Wall: Implications of the Obvious," ACM SIGARCH Computer Architecture News, vol. 23, no. 1, pp. 20-24, March 1995.
- (1995) ACM SIGARCH Computer Architecture News , vol.23 , Issue.1 , pp. 20-24
- Wulf, W.A.¹ McKee, S.A.²

3
- 84870744721
- last accessed 2012-06-08
- Intel Corporation, "Intel® Xeon® Processor E5-2680," 2011, http://ark.intel.com/products/64583/Intel-Xeon-Processor-E5-2680-(20M- Cache-2 70-GHz-8-00-GTs-Intel-QPI), last accessed 2012-06-08.
- (2011) Intel® Xeon® Processor E5-2680

4
- 56749158843
- Optimization of Sparse Matrix-vector Multiplication on Emerging Multicore Platforms
- S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel, "Optimization of Sparse Matrix-vector Multiplication on Emerging Multicore Platforms," in Proc. of the 2007 ACM/IEEE Conf. on Supercomputing, Reno, NV, November 2007, pp. 38:1-38:12.
- Proc. of the 2007 ACM/IEEE Conf. on Supercomputing, Reno, NV, November 2007
- Williams, S.¹ Oliker, L.² Vuduc, R.³ Shalf, J.⁴ Yelick, K.⁵ Demmel, J.⁶

5
- 79551492089
- Intel Corporation, June document number 319433-011
- Intel Corporation, "Intel® Advanced Vector Extensions Programming Reference," June 2011, document number 319433-011.
- (2011) Intel® Advanced Vector Extensions Programming Reference

6
- 84949647432
- Optimizing Sparse Matrix Computations for Register Reuse in SPARSITY
- San Francisco, CA, May
- E.-J. Im and K. A. Yelick, "Optimizing Sparse Matrix Computations for Register Reuse in SPARSITY," in Proc. of the Intl. Conf. on Computational Sciences, Part I, vol. 2073, San Francisco, CA, May 2001, pp. 127-136.
- (2001) Proc. of the Intl. Conf. on Computational Sciences , vol.2073 , Issue.PART I , pp. 127-136
- Im, E.-J.¹ Yelick, K.A.²

7
- 1542501019
- Sparsity: Optimization Framework for Sparse Matrix Kernels
- February
- E.-J. Im, K. Yelick, and R. Vuduc, "Sparsity: Optimization Framework for Sparse Matrix Kernels," Intl. Journal of High Performance Computing Applications, vol. 18, no. 1, pp. 135-158, February 2004.
- (2004) Intl. Journal of High Performance Computing Applications , vol.18 , Issue.1 , pp. 135-158
- Im, E.-J.¹ Yelick, K.² Vuduc, R.³

8
- 33646389518
- Fast Sparse Matrix-Vector Multiplication by Exploiting Variable Block Structure
- R. W. Vuduc and H. J. Moon, "Fast Sparse Matrix-Vector Multiplication by Exploiting Variable Block Structure," in Proc. of the High-Performance Computing and Communications Conf., Sorrento, Italy, September 2005, pp. 807-816.
- Proc. of the High-Performance Computing and Communications Conf., Sorrento, Italy, September 2005 , pp. 807-816
- Vuduc, R.W.¹ Moon, H.J.²

9
- 34547765053
- University of California, Berkeley, Tech. Rep. UCB/CSD-04-1335, available at
- R. Nishtala, R. W. Vuduc, J. W. Demmel, and K. A. Yelick, "Performance Modeling and Analysis of Cache Blocking in Sparse Matrix Vector Multiply," EECS Department, University of California, Berkeley, Tech. Rep. UCB/CSD-04-1335, 2004, available at http://www.eecs.berkeley.edu/ Pubs/TechRpts/2004/5535.html.
- (2004) Performance Modeling and Analysis of Cache Blocking in Sparse Matrix Vector Multiply
- Nishtala, R.¹ Vuduc, R.W.² Demmel, J.W.³ Yelick, K.A.⁴

10
- 77949657892
- Parallel symmetric sparse matrix-vector product on scalar multi-core cpus
- Apr. [Online]. Available
- M. Krotkiewski and M. Dabrowski, "Parallel symmetric sparse matrix-vector product on scalar multi-core cpus," Parallel Comput., vol. 36, no. 4, pp. 181-198, Apr. 2010. [Online]. Available: http://dx.doi.org/10. 1016/j.parco.2010.02.003
- (2010) Parallel Comput. , vol.36 , Issue.4 , pp. 181-198
- Krotkiewski, M.¹ Dabrowski, M.²

11
- 85031264203
- Improving Performance of Sparse Matrix-Vector Multiplication
- A. Pinar and M. T. Heath, "Improving Performance of Sparse Matrix-Vector Multiplication," in Proc. of the 1999 ACM/IEEE Conf. on Supercomputing, Portland, OR, November 1999, pp. 30:1-30:9.
- Proc. of the 1999 ACM/IEEE Conf. on Supercomputing, Portland, OR, November 1999
- Pinar, A.¹ Heath, M.T.²

12
- 80053996235
- CSX: An Extended Compression Format for SpMV on Shared Memory Systems
- K. Kourtis, V. Karakasis, G. Goumas, and N. Koziris, "CSX: an Extended Compression Format for SpMV on Shared Memory Systems," in Proc. of the 16th ACM Symp. on Principles and Practice of Parallel Programming, San Antonio, TX, April 2011, pp. 247-256.
- Proc. of the 16th ACM Symp. on Principles and Practice of Parallel Programming, San Antonio, TX, April 2011 , pp. 247-256
- Kourtis, K.¹ Karakasis, V.² Goumas, G.³ Koziris, N.⁴

13
- 78650279432
- Pattern-based Sparse Matrix Representation for Memory-efficient SMVM Kernels
- M. Belgin, G. Back, and C. J. Ribbens, "Pattern-based Sparse Matrix Representation for Memory-efficient SMVM Kernels," in Proc. of the 23rd Intl. Conf. on Supercomputing, Yorktown Heights, NY, June 2009, pp. 100-109.
- Proc. of the 23rd Intl. Conf. on Supercomputing, Yorktown Heights, NY, June 2009 , pp. 100-109
- Belgin, M.¹ Back, G.² Ribbens, C.J.³

14
- 77954707501
- Cache-oblivious Sparse Matrix-vector Multiplication by Using Sparse Matrix Partitioning Methods
- July
- A. N. Yzelman, Rob, and H. Bisseling, "Cache-oblivious Sparse Matrix-vector Multiplication by Using Sparse Matrix Partitioning Methods," SIAM Journal on Scientific Computing, vol. 31, no. 4, July 2009.
- (2009) SIAM Journal on Scientific Computing , vol.31 , Issue.4
- Yzelman, A.N.¹ Rob² Bisseling, H.³

15
- 57349185547
- Adaptive Runtime Tuning of Parallel Sparse Matrix-vector Multiplication on Distributed Memory Systems
- S. Lee and R. Eigenmann, "Adaptive Runtime Tuning of Parallel Sparse Matrix-vector Multiplication on Distributed Memory Systems," in Proc. of the 22nd Intl. Conf. on Supercomputing, Island of Kos, Greece, June 2008, pp. 195- 204.
- Proc. of the 22nd Intl. Conf. on Supercomputing, Island of Kos, Greece, June 2008 , pp. 195-204
- Lee, S.¹ Eigenmann, R.²

16
- 0005924935
- ISO/IEC, ISO/IEC 14882-2011
- ISO/IEC, "Information Technology - Programming Languages - C++," 2011, ISO/IEC 14882-2011.
- (2011) Information Technology - Programming Languages - C++

17
- 84866644305
- Intel Corporation, document number 323272-121US
- Intel Corporation, "Intel® C++ Composer XE 12.1 User and Reference Guides," 2011, document number 323272-121US.
- (2011) Intel® C++ Composer XE 12.1 User and Reference Guides

18
- 80053263342
- Reduced-bandwidth multithreaded algorithms for sparse matrix-vector multiplication
- Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium, ser. Washington, DC, USA: IEEE Computer Society, [Online]. Available
- A. Buluc, S. Williams, L. Oliker, and J. Demmel, "Reduced-bandwidth multithreaded algorithms for sparse matrix-vector multiplication," in Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium, ser. IPDPS '11. Washington, DC, USA: IEEE Computer Society, 2011, pp. 721-733. [Online]. Available: http://dx.doi.org/10.1109/IPDPS.2011.73
- (2011) IPDPS '11 , pp. 721-733
- Buluc, A.¹ Williams, S.² Oliker, L.³ Demmel, J.⁴

19
- 84863937520
- Intel Corporation, February document number 319433-012A
- Intel Corporation, "Intel® Architecture Instruction Set Extensions Programming Reference," February 2012, document number 319433-012A.
- (2012) Intel® Architecture Instruction Set Extensions Programming Reference

20
- 81355161778
- The University of Florida Sparse Matrix Collection
- December
- T. Davis and Y. Hu, "The University of Florida Sparse Matrix Collection," ACM Transactions on Mathmatical Software, vol. 38, no. 11, pp. 1:1-1:25, December 2011.
- (2011) ACM Transactions on Mathmatical Software , vol.38 , Issue.11
- Davis, T.¹ Hu, Y.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.