메뉴 건너뛰기




Volumn 37, Issue 12, 2011, Pages 806-819

Two-dimensional cache-oblivious sparse matrix-vector multiplication

Author keywords

Cache oblivious; Fine grain; Matrix vector multiplication; Parallel computing; Recursive bipartitioning; Sparse matrix

Indexed keywords

CACHE-OBLIVIOUS; FINE-GRAIN; MATRIX VECTOR MULTIPLICATION; RECURSIVE BIPARTITIONING; SPARSE MATRICES;

EID: 81355148805     PISSN: 01678191     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.parco.2011.08.004     Document Type: Conference Paper
Times cited : (30)

References (20)
  • 2
    • 0033360524 scopus 로고    scopus 로고
    • Hypergraph-partitioning-based decomposition for parallel sparse-matrix vector multiplication
    • DOI 10.1109/71.780863
    • U.V. atalyürek, and C. Aykanat Hypergraph-partitioning-based decomposition for parallel sparse-matrix vector multiplication IEEE Trans. Parallel Distrib. Syst. 10 1999 673 693 (Pubitemid 30500688)
    • (1999) IEEE Transactions on Parallel and Distributed Systems , vol.10 , Issue.7 , pp. 673-693
    • Catalyurek, U.V.1    Aykanat, C.2
  • 6
    • 1542392269 scopus 로고    scopus 로고
    • On reducing TLB misses in matrix multiplication
    • University of Texas at Austin, Department of Computer Sciences, 2002. FLAME Working Note #9
    • K. Goto, R. van de Geijn, On reducing TLB misses in matrix multiplication, Tech. Rep. TR-2002-55, University of Texas at Austin, Department of Computer Sciences, 2002. FLAME Working Note #9.
    • Tech. Rep. TR-2002-55
    • Goto, K.1    Geijn De R.Van2
  • 8
    • 84949647432 scopus 로고    scopus 로고
    • Optimizing Sparse Matrix Computations for Register Reuse in SPARSITY
    • Computational Science - ICCS 2001
    • E.-J. Im, K. Yelick, Optimizing sparse matrix computations for register reuse in SPARSITY, in: Proceedings International Conference on Computational Science, Part I, Lecture Notes in Computer Science, vol. 2073, 2001, pp. 127-136. (Pubitemid 33285441)
    • (2001) Lecture Notes in Computer Science , Issue.2073 , pp. 127-136
    • Im, E.-J.1    Yelick, K.2
  • 10
    • 70449690102 scopus 로고    scopus 로고
    • Analyzing block locality in Morton-order and Morton-hybrid matrices
    • K.P. Lorton, and D.S. Wise Analyzing block locality in Morton-order and Morton-hybrid matrices SIGARCH Comput. Archit. News 35 2007 6 12
    • (2007) SIGARCH Comput. Archit. News , vol.35 , pp. 6-12
    • Lorton, K.P.1    Wise, D.S.2
  • 11
    • 0003460690 scopus 로고
    • A computer oriented geodetic data base and a new technique in file sequencing
    • IBM, Ottawa, Canada, March
    • G. Morton, A computer oriented geodetic data base and a new technique in file sequencing, Tech. Rep., IBM, Ottawa, Canada, March 1966.
    • (1966) Tech. Rep.
    • Morton, G.1
  • 13
    • 0031269220 scopus 로고    scopus 로고
    • Improving the memory-system performance of sparse-matrix vector multiplication
    • S. Toledo Improving the memory-system performance of sparse-matrix vector multiplication IBM J. Res. Dev. 41 1997 711 725 (Pubitemid 127557044)
    • (1997) IBM Journal of Research and Development , vol.41 , Issue.6 , pp. 711-725
    • Toledo, S.1
  • 14
    • 0037173976 scopus 로고    scopus 로고
    • A framework for high-performance matrix multiplication based on hierarchical abstractions, algorithms and optimized low-level kernels
    • DOI 10.1002/cpe.630
    • V. Valsalam, and A. Skjellum A framework for high-performance matrix multiplication based on hierarchical abstractions, algorithms and optimized low-level kernels Concurrency Comput.: Practice Exp. 14 2002 805 839 (Pubitemid 34965359)
    • (2002) Concurrency Computation Practice and Experience , vol.14 , Issue.10 , pp. 805-839
    • Valsalam, V.1    Skjellum, A.2
  • 15
    • 40449112015 scopus 로고    scopus 로고
    • Memory hierarchy in cache-based systems
    • Sun Microsystems, Inc., Santa Clara, CA, Nov.
    • R. van der Pas, Memory hierarchy in cache-based systems, Tech. Rep. 817-0742-10, Sun Microsystems, Inc., Santa Clara, CA, Nov. 2002.
    • (2002) Tech. Rep. 817-0742-10
    • Pas Der R.Van1
  • 16
    • 17444414573 scopus 로고    scopus 로고
    • A two-dimensional data distribution method for parallel sparse matrix-vector multiplication
    • DOI 10.1137/S0036144502409019
    • B. Vastenhouw, and R.H. Bisseling A two-dimensional data distribution method for parallel sparse matrix-vector multiplication SIAM Rev. 47 2005 67 95 (Pubitemid 40535972)
    • (2005) SIAM Review , vol.47 , Issue.1 , pp. 67-95
    • Vastenhouw, B.1    Bisseling, R.H.2
  • 17
    • 24344485098 scopus 로고    scopus 로고
    • OSKI: A library of automatically tuned sparse matrix kernels
    • DOI 10.1088/1742-6596/16/1/071
    • R. Vuduc, J.W. Demmel, and K.A. Yelick OSKI: A library of automatically tuned sparse matrix kernels J. Phys. Conf. Ser. 16 2005 521 530 (Pubitemid 41259393)
    • (2005) Journal of Physics: Conference Series , vol.16 , Issue.1 , pp. 521-530
    • Vuduc, R.1    Demmel, J.W.2    Yelick, K.A.3
  • 18
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimizations of software and the ATLAS project
    • DOI 10.1016/S0167-8191(00)00087-9
    • R.C. Whaley, A. Petitet, and J.J. Dongarra Automated empirical optimizations of software and the ATLAS project Parallel Comput. 27 2001 3 35 (Pubitemid 32264775)
    • (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
    • Clint Whaley, R.1    Petitet, A.2    Dongarra, J.J.3
  • 19
    • 84930675361 scopus 로고    scopus 로고
    • A cache-oblivious sparse matrix-vector multiplication scheme based on the Hilbert curve
    • Springer, in press
    • A.N. Yzelman, R.H. Bisseling, A cache-oblivious sparse matrix-vector multiplication scheme based on the Hilbert curve, in Progress in Industrial Mathematics at ECMI 2010, Springer, in press.
    • (2010) Progress in Industrial Mathematics at ECMI
    • Yzelman, A.N.1    Bisseling, R.H.2
  • 20
    • 77954707501 scopus 로고    scopus 로고
    • Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods
    • A.N. Yzelman, and R.H. Bisseling Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods SIAM J. Scientif. Comput. 31 2009 3128 3154
    • (2009) SIAM J. Scientif. Comput. , vol.31 , pp. 3128-3154
    • Yzelman, A.N.1    Bisseling, R.H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.