SCOPUS 정보 검색 플랫폼

Parallel Computing

Volumn 37, Issue 12, 2011, Pages 806-819

Two-dimensional cache-oblivious sparse matrix-vector multiplication

(2) Yzelman, A N a Bisseling, Rob H a

a UTRECHT UNIVERSITY (Netherlands)

Author keywords

Cache oblivious; Fine grain; Matrix vector multiplication; Parallel computing; Recursive bipartitioning; Sparse matrix

Indexed keywords

CACHE-OBLIVIOUS; FINE-GRAIN; MATRIX VECTOR MULTIPLICATION; RECURSIVE BIPARTITIONING; SPARSE MATRICES;

CACHE MEMORY; DATA STRUCTURES; PARALLEL ARCHITECTURES; PARALLEL PROCESSING SYSTEMS; TWO DIMENSIONAL;

MATRIX ALGEBRA;

EID: 81355148805 PISSN: 01678191 EISSN: None Source Type: Journal
DOI: 10.1016/j.parco.2011.08.004 Document Type: Conference Paper

Times cited : (30)

References (20)

1
- 70449629588
- Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks
- ACM
- A. Bulu, J.T. Fineman, M. Frigo, J.R. Gilbert, and C.E. Leiserson Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks SPAA '09: Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures, New York, NY, USA 2009 ACM 233 244
- (2009) SPAA '09: Proceedings of the Twenty-first Annual Symposium on Parallelism in Algorithms and Architectures, New York, NY, USA , pp. 233-244
- Bulu, A.¹ Fineman, J.T.² Frigo, M.³ Gilbert, J.R.⁴ Leiserson, C.E.⁵

2
- 0033360524
- Hypergraph-partitioning-based decomposition for parallel sparse-matrix vector multiplication
- DOI 10.1109/71.780863
- U.V. atalyürek, and C. Aykanat Hypergraph-partitioning-based decomposition for parallel sparse-matrix vector multiplication IEEE Trans. Parallel Distrib. Syst. 10 1999 673 693 (Pubitemid 30500688)
- (1999) IEEE Transactions on Parallel and Distributed Systems , vol.10 , Issue.7 , pp. 673-693
- Catalyurek, U.V.¹ Aykanat, C.²

3
- 35048838799
- A fine-grain hypergraph model for 2D decomposition of sparse matrices
- IEEE Press, Los Alamitos, CA
- Ü.V. atalyürek, C. Aykanat, A fine-grain hypergraph model for 2D decomposition of sparse matrices, in: Proceedings 8th International Workshop on Solving Irregularly Structured Problems in Parallel, IEEE Press, Los Alamitos, CA, 2001, p. 118.
- (2001) Proceedings 8th International Workshop on Solving Irregularly Structured Problems in Parallel , pp. 118
- Atalyürek, V.¹

4
- 81355161778
- The University of Florida Sparse Matrix Collection
- in press
- T.A. Davis, Y. Hu, The University of Florida Sparse Matrix Collection, ACM Transactions on Mathematical Software 38 (2011), in press.
- (2011) ACM Transactions on Mathematical Software , vol.38
- Davis, T.A.¹ Hu, Y.²

5
- 0033350255
- Cache-oblivious algorithms
- IEEE Press Washington, DC
- M. Frigo, C.E. Leiserson, H. Prokop, and S. Ramachandran Cache-oblivious algorithms Proceedings 40th Annual Symposium on Foundations of Computer Science 1999 IEEE Press Washington, DC 285
- (1999) Proceedings 40th Annual Symposium on Foundations of Computer Science , pp. 285
- Frigo, M.¹ Leiserson, C.E.² Prokop, H.³ Ramachandran, S.⁴

6
- 1542392269
- On reducing TLB misses in matrix multiplication
- University of Texas at Austin, Department of Computer Sciences, 2002. FLAME Working Note #9
- K. Goto, R. van de Geijn, On reducing TLB misses in matrix multiplication, Tech. Rep. TR-2002-55, University of Texas at Austin, Department of Computer Sciences, 2002. FLAME Working Note #9.
- Tech. Rep. TR-2002-55
- Goto, K.¹ Geijn De R.Van²

7
- 34250347767
- A Hilbert-order multiplication scheme for unstructured sparse matrices
- DOI 10.1080/17445760601122084, PII 779509037
- G. Haase, M. Liebmann, and G. Plank A Hilbert-order multiplication scheme for unstructured sparse matrices Int. J. Parallel Emer. Distrib. Syst. 22 2007 213 220 (Pubitemid 46925815)
- (2007) International Journal of Parallel, Emergent and Distributed Systems , vol.22 , Issue.4 , pp. 213-220
- Haase, G.¹ Liebmann, M.² Plank, G.³

8
- 84949647432
- Optimizing Sparse Matrix Computations for Register Reuse in SPARSITY
- Computational Science - ICCS 2001
- E.-J. Im, K. Yelick, Optimizing sparse matrix computations for register reuse in SPARSITY, in: Proceedings International Conference on Computational Science, Part I, Lecture Notes in Computer Science, vol. 2073, 2001, pp. 127-136. (Pubitemid 33285441)
- (2001) Lecture Notes in Computer Science , Issue.2073 , pp. 127-136
- Im, E.-J.¹ Yelick, K.²

9
- 17444432688
- Master's thesis, Utrecht University, Department of Mathematics, July
- J. Koster, Parallel templates for numerical linear algebra, a high-performance computation library, Master's thesis, Utrecht University, Department of Mathematics, July 2002.
- (2002) Parallel Templates for Numerical Linear Algebra, A High-performance Computation Library
- Koster, J.¹

10
- 70449690102
- Analyzing block locality in Morton-order and Morton-hybrid matrices
- K.P. Lorton, and D.S. Wise Analyzing block locality in Morton-order and Morton-hybrid matrices SIGARCH Comput. Archit. News 35 2007 6 12
- (2007) SIGARCH Comput. Archit. News , vol.35 , pp. 6-12
- Lorton, K.P.¹ Wise, D.S.²

11
- 0003460690
- A computer oriented geodetic data base and a new technique in file sequencing
- IBM, Ottawa, Canada, March
- G. Morton, A computer oriented geodetic data base and a new technique in file sequencing, Tech. Rep., IBM, Ottawa, Canada, March 1966.
- (1966) Tech. Rep.
- Morton, G.¹

12
- 34547744862
- When cache blocking of sparse matrix vector multiply works and why
- DOI 10.1007/s00200-007-0038-9
- R. Nishtala, R.W. Vuduc, J.W. Demmel, and K.A. Yelick When cache blocking of sparse matrix vector multiply works and why Appl. Algebr. Eng. Commun. Comput. 18 2007 297 311 (Pubitemid 47224626)
- (2007) Applicable Algebra in Engineering, Communications and Computing , vol.18 , Issue.3 , pp. 297-311
- Nishtala, R.¹ Vuduc, R.W.² Demmel, J.W.³ Yelick, K.A.⁴

13
- 0031269220
- Improving the memory-system performance of sparse-matrix vector multiplication
- S. Toledo Improving the memory-system performance of sparse-matrix vector multiplication IBM J. Res. Dev. 41 1997 711 725 (Pubitemid 127557044)
- (1997) IBM Journal of Research and Development , vol.41 , Issue.6 , pp. 711-725
- Toledo, S.¹

14
- 0037173976
- A framework for high-performance matrix multiplication based on hierarchical abstractions, algorithms and optimized low-level kernels
- DOI 10.1002/cpe.630
- V. Valsalam, and A. Skjellum A framework for high-performance matrix multiplication based on hierarchical abstractions, algorithms and optimized low-level kernels Concurrency Comput.: Practice Exp. 14 2002 805 839 (Pubitemid 34965359)
- (2002) Concurrency Computation Practice and Experience , vol.14 , Issue.10 , pp. 805-839
- Valsalam, V.¹ Skjellum, A.²

15
- 40449112015
- Memory hierarchy in cache-based systems
- Sun Microsystems, Inc., Santa Clara, CA, Nov.
- R. van der Pas, Memory hierarchy in cache-based systems, Tech. Rep. 817-0742-10, Sun Microsystems, Inc., Santa Clara, CA, Nov. 2002.
- (2002) Tech. Rep. 817-0742-10
- Pas Der R.Van¹

16
- 17444414573
- A two-dimensional data distribution method for parallel sparse matrix-vector multiplication
- DOI 10.1137/S0036144502409019
- B. Vastenhouw, and R.H. Bisseling A two-dimensional data distribution method for parallel sparse matrix-vector multiplication SIAM Rev. 47 2005 67 95 (Pubitemid 40535972)
- (2005) SIAM Review , vol.47 , Issue.1 , pp. 67-95
- Vastenhouw, B.¹ Bisseling, R.H.²

17
- 24344485098
- OSKI: A library of automatically tuned sparse matrix kernels
- DOI 10.1088/1742-6596/16/1/071
- R. Vuduc, J.W. Demmel, and K.A. Yelick OSKI: A library of automatically tuned sparse matrix kernels J. Phys. Conf. Ser. 16 2005 521 530 (Pubitemid 41259393)
- (2005) Journal of Physics: Conference Series , vol.16 , Issue.1 , pp. 521-530
- Vuduc, R.¹ Demmel, J.W.² Yelick, K.A.³

18
- 0343462141
- Automated empirical optimizations of software and the ATLAS project
- DOI 10.1016/S0167-8191(00)00087-9
- R.C. Whaley, A. Petitet, and J.J. Dongarra Automated empirical optimizations of software and the ATLAS project Parallel Comput. 27 2001 3 35 (Pubitemid 32264775)
- (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
- Clint Whaley, R.¹ Petitet, A.² Dongarra, J.J.³

19
- 84930675361
- A cache-oblivious sparse matrix-vector multiplication scheme based on the Hilbert curve
- Springer, in press
- A.N. Yzelman, R.H. Bisseling, A cache-oblivious sparse matrix-vector multiplication scheme based on the Hilbert curve, in Progress in Industrial Mathematics at ECMI 2010, Springer, in press.
- (2010) Progress in Industrial Mathematics at ECMI
- Yzelman, A.N.¹ Bisseling, R.H.²

20
- 77954707501
- Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods
- A.N. Yzelman, and R.H. Bisseling Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods SIAM J. Scientif. Comput. 31 2009 3128 3154
- (2009) SIAM J. Scientif. Comput. , vol.31 , pp. 3128-3154
- Yzelman, A.N.¹ Bisseling, R.H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.