메뉴 건너뛰기




Volumn 36, Issue 4, 2010, Pages 181-198

Parallel symmetric sparse matrix-vector product on scalar multi-core CPUs

Author keywords

Memory bounded algorithms; Multi core architectures; Symmetric sparse matrix vector multiplication; Unstructured meshes

Indexed keywords

BOUNDED ALGORITHMS; CORE PERFORMANCE; DEGREES OF FREEDOM; DIFFUSION PROBLEMS; DISCRETIZATIONS; MATRIX; MECHANICAL ELEMENTS; MEMORY BANDWIDTHS; MESH NODES; MULTI CORE; MULTICORE ARCHITECTURES; OPTERON; PARALLEL EFFICIENCY; PARALLEL IMPLEMENTATIONS; PREFETCHING; SPARSE MATRICES; SPARSE MATRIX-VECTOR MULTIPLICATION; STORAGE REQUIREMENTS; SUSTAINED PERFORMANCE; SYMMETRIC MATRICES; UNSTRUCTURED MESHES; VARIABLE STRUCTURES;

EID: 77949657892     PISSN: 01678191     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.parco.2010.02.003     Document Type: Article
Times cited : (40)

References (35)
  • 2
    • 27844448666 scopus 로고    scopus 로고
    • Parallel iterative solvers for finite-element methods using an OpenMP/MPI hybrid programming model on the Earth Simulator
    • Nakajima K. Parallel iterative solvers for finite-element methods using an OpenMP/MPI hybrid programming model on the Earth Simulator. Parallel Computing 31 10-12 (2005) 1048-1065
    • (2005) Parallel Computing , vol.31 , Issue.10-12 , pp. 1048-1065
    • Nakajima, K.1
  • 4
    • 17444414573 scopus 로고    scopus 로고
    • A two-dimensional data distribution method for parallel sparse matrix-vector multiplication
    • Vastenhouw B., and Bisseling R.H. A two-dimensional data distribution method for parallel sparse matrix-vector multiplication. SIAM Review 47 1 (2005) 67-95
    • (2005) SIAM Review , vol.47 , Issue.1 , pp. 67-95
    • Vastenhouw, B.1    Bisseling, R.H.2
  • 5
    • 0036734103 scopus 로고    scopus 로고
    • Effects of ordering strategies and programming paradigms on sparse matrix computations
    • Oliker L., et al. Effects of ordering strategies and programming paradigms on sparse matrix computations. SIAM Review 44 3 (2002) 373-393
    • (2002) SIAM Review , vol.44 , Issue.3 , pp. 373-393
    • Oliker, L.1
  • 6
    • 60949098907 scopus 로고    scopus 로고
    • Optimization of sparse matrix-vector multiplication on emerging multicore platforms
    • Williams S., et al. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. Parallel Computing 35 3 (2008) 178-194
    • (2008) Parallel Computing , vol.35 , Issue.3 , pp. 178-194
    • Williams, S.1
  • 7
    • 77949653520 scopus 로고    scopus 로고
    • P. Kogge et al, ExaScale Computing Study: Technology Challenges in Achieving Exascale Systems, DARPA, AFRL
    • P. Kogge et al., ExaScale Computing Study: Technology Challenges in Achieving Exascale Systems, DARPA, AFRL.
  • 8
    • 10044248780 scopus 로고    scopus 로고
    • Performance models for evaluation and automatic tuning of symmetric sparse matrix-vector multiply
    • B.C. Lee et al., Performance models for evaluation and automatic tuning of symmetric sparse matrix-vector multiply, in: International Conference on Parallel Processing, Proceedings, 2004, pp. 169-176.
    • (2004) International Conference on Parallel Processing, Proceedings , pp. 169-176
    • Lee, B.C.1
  • 9
    • 84873549540 scopus 로고    scopus 로고
    • cited; Available from
    • S. Balay et al., PETSC Webpage, 2001 [cited; Available from: .
    • (2001) PETSC Webpage
    • Balay, S.1
  • 12
    • 0031269220 scopus 로고    scopus 로고
    • Improving the memory-system performance of sparse-matrix vector multiplication
    • Toledo S. Improving the memory-system performance of sparse-matrix vector multiplication. IBM Journal of Research and Development 41 6 (1997) 711-725
    • (1997) IBM Journal of Research and Development , vol.41 , Issue.6 , pp. 711-725
    • Toledo, S.1
  • 16
    • 44349158351 scopus 로고    scopus 로고
    • PT-SCOTCH: a tool for efficient parallel graph ordering
    • Chevalier C., and Pellegrini F. PT-SCOTCH: a tool for efficient parallel graph ordering. Parallel Computing 34 6-8 (2008) 318-331
    • (2008) Parallel Computing , vol.34 , Issue.6-8 , pp. 318-331
    • Chevalier, C.1    Pellegrini, F.2
  • 17
    • 0035370546 scopus 로고    scopus 로고
    • Towards a fast parallel sparse symmetric matrix-vector multiplication
    • Geus R., and Rollin S. Towards a fast parallel sparse symmetric matrix-vector multiplication. Parallel Computing 27 7 (2001) 883-896
    • (2001) Parallel Computing , vol.27 , Issue.7 , pp. 883-896
    • Geus, R.1    Rollin, S.2
  • 19
    • 3843067315 scopus 로고    scopus 로고
    • Hybrid method for generation of quadrilateral meshes
    • Rypl D., and Bittnar A. Hybrid method for generation of quadrilateral meshes. Engineering Mechanics 9 1/2 (2002) 49-64
    • (2002) Engineering Mechanics , vol.9 , Issue.1-2 , pp. 49-64
    • Rypl, D.1    Bittnar, A.2
  • 24
    • 35548992612 scopus 로고    scopus 로고
    • Using mixed precision for sparse matrix computations to enhance the performance while achieving 64-bit accuracy
    • Buttari A., et al. Using mixed precision for sparse matrix computations to enhance the performance while achieving 64-bit accuracy. ACM Transactions on Mathematical Software 34 4 (2008)
    • (2008) ACM Transactions on Mathematical Software , vol.34 , Issue.4
    • Buttari, A.1
  • 25
    • 84877043342 scopus 로고    scopus 로고
    • High resolution forward and inverse earthquake modeling on terascale computers
    • IEEE Computer Society
    • V. Akçelik et al., High resolution forward and inverse earthquake modeling on terascale computers, in: Proceedings of the 2003 ACM/IEEE Conference on Supercomputing, IEEE Computer Society, 2003, p. 52.
    • (2003) Proceedings of the 2003 ACM/IEEE Conference on Supercomputing , pp. 52
    • Akçelik, V.1
  • 27
    • 0002075716 scopus 로고    scopus 로고
    • Time-lapse seismic reservoir monitoring
    • Lumley D.E. Time-lapse seismic reservoir monitoring. Geophysics 66 1 (2001) 50-53
    • (2001) Geophysics , vol.66 , Issue.1 , pp. 50-53
    • Lumley, D.E.1
  • 28
    • 77949658941 scopus 로고    scopus 로고
    • Multiscale methods and streamline simulation for rapid reservoir performance prediction
    • Aarnes J.E., Kippe V., and Lie K.A. Multiscale methods and streamline simulation for rapid reservoir performance prediction. Progress in Industrial Mathematics at ECMI 2004 8 (2006) 399-403
    • (2006) Progress in Industrial Mathematics at ECMI 2004 , vol.8 , pp. 399-403
    • Aarnes, J.E.1    Kippe, V.2    Lie, K.A.3
  • 29
    • 84881038794 scopus 로고    scopus 로고
    • A next-generation parallel reservoir simulator for giant reservoirs
    • Society of Petroleum Engineers, The Woodlands, Texas
    • Ali H. Dogru et al., A next-generation parallel reservoir simulator for giant reservoirs, in: Reservoir Simulation Symposium, Society of Petroleum Engineers, The Woodlands, Texas, 2009.
    • (2009) Reservoir Simulation Symposium
    • Dogru, A.H.1
  • 31
    • 0031574568 scopus 로고    scopus 로고
    • Adaptive local refinement with octree load-balancing for the parallel solution of three-dimensional conservation laws
    • Flaherty J.E., et al. Adaptive local refinement with octree load-balancing for the parallel solution of three-dimensional conservation laws. Journal of Parallel and Distributed Computing (1997)
    • (1997) Journal of Parallel and Distributed Computing
    • Flaherty, J.E.1
  • 32
    • 34547483006 scopus 로고    scopus 로고
    • A numerical evaluation of sparse direct solvers for the solution of large sparse symmetric linear systems of equations
    • Gould N.I.M., Scott J.A., and Hu Y. A numerical evaluation of sparse direct solvers for the solution of large sparse symmetric linear systems of equations. ACM Transactions on Mathematical Software 33 2 (2007)
    • (2007) ACM Transactions on Mathematical Software , vol.33 , Issue.2
    • Gould, N.I.M.1    Scott, J.A.2    Hu, Y.3
  • 33
    • 77949656908 scopus 로고    scopus 로고
    • A. Gupta, S. Koric, T. George, Sparse matrix factorization on massively parallel computers, in: SC09, ACM, Portland, OR, USA, 2009.
    • A. Gupta, S. Koric, T. George, Sparse matrix factorization on massively parallel computers, in: SC09, ACM, Portland, OR, USA, 2009.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.