SCOPUS 정보 검색 플랫폼

Journal of Parallel and Distributed Computing

Volumn 73, Issue 12, 2013, Pages 1539-1550

Sparse matrix-vector multiplication on the Single-Chip Cloud Computer many-core processor

(2) Pichel, Juan C a Rivera, Francisco F a

a Universidade de Santiago de Compostela ^* (Spain)

Author keywords

Many core; Optimization; Performance; Power efficiency; Sparse matrix

Indexed keywords

DATA COMPRESSION TECHNIQUES; MANY-CORE; OPTIMIZATION TECHNIQUES; PERFORMANCE; POWER EFFICIENCY; SINGLE-CHIP CLOUD COMPUTERS; SPARSE MATRICES; SPARSE MATRIX-VECTOR MULTIPLICATION;

CLOUD COMPUTING; OPTIMIZATION; PROGRAM PROCESSORS;

EFFICIENCY;

EID: 84885948161 PISSN: 07437315 EISSN: None Source Type: Journal
DOI: 10.1016/j.jpdc.2013.07.017 Document Type: Article

Times cited : (12)

References (29)

1
- 0030491606
- An approximate minimum degree ordering algorithm
- P.R. Amestoy, T.A. Davis, and I.S. Duff An approximate minimum degree ordering algorithm SIAM J. Matrix Anal. Appl. 17 4 1996 886 905
- (1996) SIAM J. Matrix Anal. Appl. , vol.17 , Issue.4 , pp. 886-905
- Amestoy, P.R.¹ Davis, T.A.² Duff, I.S.³

2
- 78650279432
- Pattern-based sparse matrix representation for memory-efficient SMVM kernels
- M. Belgin, G. Back, C.J. Ribbens, Pattern-based sparse matrix representation for memory-efficient SMVM kernels, in: Proc. of the Int. Conf. on Supercomputing, 2009, pp. 100-109.
- (2009) Proc. of the Int. Conf. on Supercomputing , pp. 100-109
- Belgin, M.¹ Back, G.² Ribbens, C.J.³

3
- 70350368872
- N. Bell, and M. Garland Efficient sparse matrix-vector multiplication on CUDA, Tech. Rep., NVIDIA 2008
- (2008) Efficient Sparse Matrix-vector Multiplication on CUDA, Tech. Rep., NVIDIA
- Bell, N.¹ Garland, M.²

4
- 35549013711
- Performance optimization and modeling of blocked sparse kernels
- DOI 10.1177/1094342007083801
- A. Buttari, V. Eijkhout, J. Langou, and S. Filippone Performance optimization and modeling of blocked sparse kernels Int. J. High Perform. Comput. Appl. 21 4 2007 467 484 (Pubitemid 350011340)
- (2007) International Journal of High Performance Computing Applications , vol.21 , Issue.4 , pp. 467-484
- Buttari, A.¹ Eijkhout, V.² Langou, J.³ Filippone, S.⁴

5
- 33645913852
- Performance comparison of data-reordering algorithms for sparse matrix-vector multiplication in edge-based unstructured grid computations
- A.L.G.A. Coutinho, M.A.D. Martins, R.M. Sydenstricker, and R.N. Elias Performance comparison of data-reordering algorithms for sparse matrix-vector multiplication in edge-based unstructured grid computations Internat. J. Numer. Methods Engrg. 66 3 2006 431 460
- (2006) Internat. J. Numer. Methods Engrg. , vol.66 , Issue.3 , pp. 431-460
- Coutinho, A.L.G.A.¹ Martins, M.A.D.² Sydenstricker, R.M.³ Elias, R.N.⁴

6
- 0001803542
- Rose and Willoughby
- E. Cuthill, and J. McKee Several Strategies for Reducing the Bandwidth of Matrices 1972 Rose and Willoughby
- (1972) Several Strategies for Reducing the Bandwidth of Matrices
- Cuthill, E.¹ McKee, J.²

7
- 84885951571
- University of Florida Sparse Matrix Collection
- T. Davis, University of Florida Sparse Matrix Collection, NA Digest 97 (23). http://www.cise.ufl.edu/research/sparse/matrices.
- NA Digest , vol.97 , Issue.23
- Davis, T.¹

8
- 70450227686
- Performance evaluation of the sparse matrix-vector multiplication on modern architectures
- G. Goumas, K. Kourtis, N. Anastopoulos, V. Karakasis, and N. Koziris Performance evaluation of the sparse matrix-vector multiplication on modern architectures J. Supercomput. 50 1 2009 36 77
- (2009) J. Supercomput. , vol.50 , Issue.1 , pp. 36-77
- Goumas, G.¹ Kourtis, K.² Anastopoulos, N.³ Karakasis, V.⁴ Koziris, N.⁵

9
- 80955167944
- Performance analysis and benchmarking of the Intel SCC
- P. Gschwandtner, T. Fahringer, R. Prodan, Performance analysis and benchmarking of the Intel SCC, in: IEEE Int. Conf. on Cluster Computing, CLUSTER, 2011, pp. 139-149.
- (2011) IEEE Int. Conf. on Cluster Computing, CLUSTER , pp. 139-149
- Gschwandtner, P.¹ Fahringer, T.² Prodan, R.³

10
- 38149066662
- Optimizing sparse matrix vector multiplication on SMPs
- E.J. Im, K. Yelick, Optimizing sparse matrix vector multiplication on SMPs, in: Proc. of the 10th SIAM Conf. on Parallel Processing for Scientific Computing, 1999.
- (1999) Proc. of the 10th SIAM Conf. on Parallel Processing for Scientific Computing
- Im, E.J.¹ Yelick, K.²

11
- 1542501019
- SPARSITY: Optimization framework for sparse matrix kernels
- E.J. Im, K.A. Yelick, and R. Vuduc SPARSITY: optimization framework for sparse matrix kernels Int. J. High Perform. Comput. Appl. 18 1 2004 135 158
- (2004) Int. J. High Perform. Comput. Appl. , vol.18 , Issue.1 , pp. 135-158
- Im, E.J.¹ Yelick, K.A.² Vuduc, R.³

12
- 84885960296
- Intel, Measuring Processor Power: TDP vs. ACP, white paper, April 2011
- Intel, Measuring Processor Power: TDP vs. ACP, white paper, April 2011.

13
- 84885947825
- Intel Labs, SCC External Architecture Specification (EAS), November 2010
- Intel Labs, SCC External Architecture Specification (EAS), November 2010.

14
- 84885958699
- Intel Labs, SCC Programmer's Guide, November 2011
- Intel Labs, SCC Programmer's Guide, November 2011.

15
- 70749149210
- A comparative study of blocking storage methods for sparse matrices on multicore architectures
- V. Karakasis, G. Goumas, N. Koziris, A comparative study of blocking storage methods for sparse matrices on multicore architectures, in: Proc. of IEEE Int. Conf. on Computational Science and Engineering, 2009, pp. 247-256.
- (2009) Proc. of IEEE Int. Conf. on Computational Science and Engineering , pp. 247-256
- Karakasis, V.¹ Goumas, G.² Koziris, N.³

16
- 0003734628
- G. Karypis, V. Kumar, METIS: a software package for partitioning unstructured graphs, partitioning meshes, and computing fill-reducing orderings of sparse matrices, 1997.
- (1997) METIS: A Software Package for Partitioning Unstructured Graphs, Partitioning Meshes, and Computing Fill-reducing Orderings of Sparse Matrices
- Karypis, G.¹ Kumar, V.²

17
- 81555226707
- Memory intensive applications on a many-core processor
- M. Korch, T. Rauber, C. Scholtes, Memory intensive applications on a many-core processor, in: IEEE Int. Conf. on High Performance Computing and Communications, 2011, pp. 126-134.
- (2011) IEEE Int. Conf. on High Performance Computing and Communications , pp. 126-134
- Korch, M.¹ Rauber, T.² Scholtes, C.³

18
- 55849146932
- Optimizing sparse matrix-vector multiplication using index and value compression
- K. Kourtis, G. Goumas, N. Koziris, Optimizing sparse matrix-vector multiplication using index and value compression, in: Proc. of the Conference on Computing Frontiers, 2008, pp. 87-96.
- (2008) Proc. of the Conference on Computing Frontiers , pp. 87-96
- Kourtis, K.¹ Goumas, G.² Koziris, N.³

19
- 84870486690
- Investigation of main memory bandwidth on Intel single-chip cloud computer
- N. Melot, K. Avdic, C. Kessler, J. Keller, Investigation of main memory bandwidth on Intel single-chip cloud computer, in: 3rd Many-core Application Research Community Symposium, 2011, pp. 107-110.
- (2011) 3rd Many-core Application Research Community Symposium , pp. 107-110
- Melot, N.¹ Avdic, K.² Kessler, C.³ Keller, J.⁴

20
- 0036734103
- Effects of ordering strategies and programming paradigms on sparse matrix computations
- L. Oliker, X. Li, P. Husbands, and R. Biswas Effects of ordering strategies and programming paradigms on sparse matrix computations SIAM Rev. 44 3 2002 373 393
- (2002) SIAM Rev. , vol.44 , Issue.3 , pp. 373-393
- Oliker, L.¹ Li, X.² Husbands, P.³ Biswas, R.⁴

21
- 80054888287
- Analyzing the execution of sparse matrix-vector product on the Finisterrae SMP-NUMA system
- J.C. Pichel, J.A. Lorenzo, D.B. Heras, J.C. Cabaleiro, and T.F. Pena Analyzing the execution of sparse matrix-vector product on the Finisterrae SMP-NUMA system J. Supercomput. 58 2 2011 195 205
- (2011) J. Supercomput. , vol.58 , Issue.2 , pp. 195-205
- Pichel, J.C.¹ Lorenzo, J.A.² Heras, D.B.³ Cabaleiro, J.C.⁴ Pena, T.F.⁵

22
- 84867432441
- Experiences with the sparse matrix-vector multiplication on a many-core processor
- J.C. Pichel, F.F. Rivera, Experiences with the sparse matrix-vector multiplication on a many-core processor, in: Proc. of the IEEE 26th Int. Parallel and Distributed Processing Symposium Workshops, IPDPSW, 2012, pp. 7-15.
- (2012) Proc. of the IEEE 26th Int. Parallel and Distributed Processing Symposium Workshops, IPDPSW , pp. 7-15
- Pichel, J.C.¹ Rivera, F.F.²

23
- 84857332778
- Optimization of sparse matrix-vector multiplication using reordering techniques on GPUs
- J.C. Pichel, F.F. Rivera, M. Fernández, and A. Rodríguez Optimization of sparse matrix-vector multiplication using reordering techniques on GPUs Microprocess. Microsyst. 36 2 2012 65 77
- (2012) Microprocess. Microsyst. , vol.36 , Issue.2 , pp. 65-77
- Pichel, J.C.¹ Rivera, F.F.² Fernández, M.³ Rodríguez, A.⁴

24
- 56349128909
- Reordering algorithms for increasing locality on multicore processors
- J.C. Pichel, D.E. Singh, J. Carretero, Reordering algorithms for increasing locality on multicore processors, in: Proc. of the IEEE Int. Conf. on High Performance Computing and Communications, 2008, pp. 123-130.
- (2008) Proc. of the IEEE Int. Conf. on High Performance Computing and Communications , pp. 123-130
- Pichel, J.C.¹ Singh, D.E.² Carretero, J.³

25
- 3042576437
- Improving performance of sparse matrix-vector multiplication
- A. Pinar, M. Heath, Improving performance of sparse matrix-vector multiplication, in: Proc. of Supercomputing, 1999.
- (1999) Proc. of Supercomputing
- Pinar, A.¹ Heath, M.²

26
- 0031269220
- Improving memory-system performance of sparse matrix-vector multiplication
- S. Toledo, Improving memory-system performance of sparse matrix-vector multiplication, in: Proc. of the 8th SIAM Conf. on Parallel Processing for Scientific Computing, 1997, pp. 711-726.
- (1997) Proc. of the 8th SIAM Conf. on Parallel Processing for Scientific Computing , pp. 711-726
- Toledo, S.¹

27
- 84856529095
- Light-weight communications on Intel's single-chip cloud computer processor
- R.F. van der Wijngaart, T. Mattson, and W. Haas Light-weight communications on Intel's single-chip cloud computer processor ACM SIGOPS Oper. Syst. Rev. 45 2011 73 83
- (2011) ACM SIGOPS Oper. Syst. Rev. , vol.45 , pp. 73-83
- Van Der Wijngaart, R.F.¹ Mattson, T.² Haas, W.³

28
- 33646389518
- Fast sparse matrix-vector multiplication by exploiting variable block structure
- Lecture Notes in Computer Science
- R. Vuduc, and H. Moon Fast sparse matrix-vector multiplication by exploiting variable block structure High Performance Computing and Communications Lecture Notes in Computer Science vol. 3726 2005 807 816
- (2005) High Performance Computing and Communications , vol.3726 VOL. , pp. 807-816
- Vuduc, R.¹ Moon, H.²

29
- 56749158843
- Optimization of sparse matrix-vector multiply on emerging multicore platforms
- 12
- S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, J. Demmel, Optimization of sparse matrix-vector multiply on emerging multicore platforms, in: Proc. of the ACM/IEEE Conf. on Supercomputing, 2007, pp. 38:1-38:12.
- (2007) Proc. of the ACM/IEEE Conf. on Supercomputing , vol.38 , pp. 1-38
- Williams, S.¹ Oliker, L.² Vuduc, R.³ Shalf, J.⁴ Yelick, K.⁵ Demmel, J.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.