SCOPUS 정보 검색 플랫폼

IEEE Transactions on Parallel and Distributed Systems

Volumn 24, Issue 10, 2013, Pages 1930-1940

An extended compression format for the optimization of sparse matrix-vector multiplication

(5) Karakasis, Vasileios a Gkountouvas, Theodoros a Kourtis, Kornilios b Goumas, Georgios a Koziris, Nectarios a

a NATIONAL TECHNICAL UNIVERSITY OF ATHENS NTUA (Greece)

b ETH ZURICH (Switzerland)

Author keywords

data compression; multicore optimizations; Sparse Matrix Vector Multiplication

Indexed keywords

COMPRESSION TECHNIQUES; INDEXING STRUCTURES; MEMORY FOOTPRINT; MEMORY SUBSYSTEMS; MULTI CORE; MULTIPHYSICS SIMULATIONS; SPARSE MATRIX-VECTOR MULTIPLICATION; STABLE PERFORMANCE;

COMPUTER SOFTWARE; DATA COMPRESSION; OPTIMIZATION;

MATRIX ALGEBRA;

EID: 84883314318 PISSN: 10459219 EISSN: None Source Type: Journal
DOI: 10.1109/TPDS.2012.290 Document Type: Article

Times cited : (36)

References (28)

1
- 35648995516
- The landscape of parallel computing research: A view from berkeley
- Univ. of California, Berkeley
- K. Asanovic, R. Bodik, B.C. Catanzaro, J.J. Gebis, P. Husbands, K. Keutzer, D.A. Patterson, W.L. Plishker, J. Shalf, S.W. Williams, and K.A. Yelick, "The Landscape of Parallel Computing Research: A View from Berkeley," Technical Report UCB/EECS-2006-183, Univ. of California, Berkeley, 2006.
- (2006) Technical Report UCB/EECS-2006-2183
- Asanovic, K.¹ Bodik, R.² Catanzaro, B.C.³ Gebis, J.J.⁴ Husbands, P.⁵ Keutzer, K.⁶ Patterson, D.A.⁷ Plishker, W.L.⁸ Shalf, J.⁹ Williams, S.W.¹⁰ Yelick, K.A.¹¹

2
- 70450227686
- Performance evaluation of the sparse matrix-vector multiplication on modern architectures
- G. Goumas, K. Kourtis, N. Anastopoulos, V. Karakasis, and N. Koziris, "Performance Evaluation of the Sparse Matrix-Vector Multiplication on Modern Architectures," J. Supercomputing, vol. 50, no. 1, pp. 36-77, 2009.
- (2009) J. Supercomputing , vol.50 , Issue.1 , pp. 36-77
- Goumas, G.¹ Kourtis, K.² Anastopoulos, N.³ Karakasis, V.⁴ Koziris, N.⁵

3
- 56749158843
- Optimization of sparse matrix-vector multiplication on emerging multicore platforms
- S. Williams, L. Oilker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel, "Optimization of Sparse Matrix-Vector Multiplication on Emerging Multicore Platforms," Proc. ACM/IEEE Conf. Supercomputing, 2007.
- (2007) Proc. ACM/IEEE Conf. Supercomputing
- Williams, S.¹ Oilker, L.² Vuduc, R.³ Shalf, J.⁴ Yelick, K.⁵ Demmel, J.⁶

4
- 65949107549
- Roofline: An insightful visual performance model for multicore architectures
- Apr
- S. Williams, A. Waterman, and D. Patterson, "Roofline: An Insightful Visual Performance Model for Multicore Architectures," Comm. ACM-A Direct Path to Dependable Software, vol. 52, no. 4, pp. 65-76, Apr. 2009.
- (2009) Comm ACM A Direct Path to Dependable Software , vol.52 , Issue.4 , pp. 65-76
- Williams, S.¹ Waterman, A.² Patterson, D.³

5
- 0003521777
- Manchester Univ. Press
- Y. Saad, Numerical Methods for Large Eigenvalue Problems. Manchester Univ. Press, 1992.
- (1992) Numerical Methods for Large Eigenvalue Problems
- Saad, Y.¹

6
- 84983621818
- A high performance algorithm using pre-processing for the sparse matrix-vector multiplication
- R.C. Agarwal, F.G. Gustavson, and M. Zubair, "A High Performance Algorithm Using Pre-Processing for the Sparse Matrix-Vector Multiplication, " Proc. ACM/IEEE Conf. Supercomputing, pp. 32-41, 1992.
- (1992) Proc. ACM/IEEE Conf. Supercomputing , pp. 32-41
- Agarwal, R.C.¹ Gustavson, F.G.² Zubair, M.³

7
- 85031264203
- Improving performance of sparse matrix-vector multiplication
- A. Pinar and M.T. Heath, "Improving Performance of Sparse Matrix-Vector Multiplication," Proc. ACM/IEEE Conf. Supercomputing, 1999.
- (1999) Proc. ACM/IEEE Conf. Supercomputing
- Pinar, A.¹ Heath, M.T.²

8
- 84949647432
- Optimizing sparse matrix computations for register reuse in SPARSITY
- E.-J. Im and K.A. Yelick, "Optimizing Sparse Matrix Computations for Register Reuse in SPARSITY," Proc. Int'l Conf. Computational Sciences-Part I, pp. 127-136, 2001.
- (2001) Proc. Int'l Conf. Computational Sciences-Part i , pp. 127-136
- Im, E.-J.¹ Yelick, K.A.²

9
- 0035370546
- Towards a fast parallel sparse matrix-vector multiplication
- R. Geus and S. Rollin, "Towards a Fast Parallel Sparse Matrix-Vector Multiplication," Parallel Computing, vol. 27, pp. 883-896, 2001.
- (2001) Parallel Computing , vol.27 , pp. 883-896
- Geus, R.¹ Rollin, S.²

10
- 55849146932
- Optimizing sparse matrix-vector multiplication using index and value compression
- K. Kourtis, G. Goumas, and N. Koziris, "Optimizing Sparse Matrix-Vector Multiplication Using Index and Value Compression," Proc. Fifth Conf. Computing Frontiers, 2008.
- (2008) Proc. Fifth Conf. Computing Frontiers
- Kourtis, K.¹ Goumas, G.² Koziris, N.³

11
- 3042658703
- LLVM: A compilation framework for lifelong program analysis & transformation
- C. Lattner and V. Adve, "LLVM: A Compilation Framework for Lifelong Program Analysis & Transformation," Proc. Int'l Symp. Code Generation and Optimization (CGO '04), http://www. llvm.org/, 2004.
- (2004) Proc. Int'l Symp. Code Generation and Optimization (CGO '04)
- Lattner, C.¹ Adve, V.²

12
- 84874208056
- ELMER-A finite element solver for multiphysics
- M. Lyly, J. Ruokolainen, and E. Jarvinen, "ELMER-A Finite Element Solver for Multiphysics," CSC Report Scientific Computing, http://www.csc.fi/english/pages/elmer, 1999.
- (1999) CSC Report Scientific Computing
- Lyly, M.¹ Ruokolainen, J.² Jarvinen, E.³

13
- 1842829625
- SIAM
- Y. Saad, Iterative Methods for Sparse Linear Systems. SIAM, 2003.
- (2003) Iterative Methods for Sparse Linear Systems
- Saad, Y.¹

14
- 78650279432
- Pattern-based sparse matrix representation for memory-efficient SMVM kernels
- M. Belgin, G. Back, and C.J. Ribbens, "Pattern-Based Sparse Matrix Representation for Memory-Efficient SMVM Kernels," Proc. 23rd Int'l Conf. Supercomputing (ICS '09), pp. 100-109, 2009.
- (2009) Proc. 23rd Int'l Conf. Supercomputing (ICS '09) , pp. 100-109
- Belgin, M.¹ Back, G.² Ribbens, C.J.³

15
- 79952786461
- CSX: An extended compression format for spmv on shared memory systems
- K. Kourtis, V. Karakasis, G. Goumas, and N. Koziris, "CSX: An Extended Compression Format for SpMV on Shared Memory Systems," Proc. 16th ACM SIGPLAN Ann. Symp. Principles and Practice of Parallel Programming (PPoPP '11), pp. 247-256, 2011.
- (2011) Proc. 16th ACM SIGPLAN Ann. Symp. Principles and Practice of Parallel Programming (PPoPP '11) , pp. 247-256
- Kourtis, K.¹ Karakasis, V.² Goumas, G.³ Koziris, N.⁴

16
- 70449913281
- Exploring the effect of block shapes on the performance of sparse kernels
- V. Karakasis, G. Goumas, and N. Koziris, "Exploring the Effect of Block Shapes on the Performance of Sparse Kernels," Proc. IEEE Int'l Symp. Parallel and Distributed Processing, pp. 1-8, 2009.
- (2009) Proc. IEEE Int'l Symp. Parallel and Distributed Processing , pp. 1-8
- Karakasis, V.¹ Goumas, G.² Koziris, N.³

17
- 81355161778
- The university of florida sparse matrix collection
- T. Davis and Y. Hu, "The University of Florida Sparse Matrix Collection," ACM Trans. Math. Software, vol. 38, pp. 1-25, 2011.
- (2011) ACM Trans. Math. Software , vol.38 , pp. 1-25
- Davis, T.¹ Hu, Y.²

18
- 84937995839
- Direct solutions of sparse network equations by optimally ordered triangular factorization
- Nov
- W. Tinney and J. Walker, "Direct Solutions of Sparse Network Equations by Optimally Ordered Triangular Factorization," Proc. IEEE, vol. 55, no. 11, pp. 1801-1809, Nov. 1967.
- (1967) Proc. IEEE , vol.55 , Issue.11 , pp. 1801-1809
- Tinney, W.¹ Walker, J.²

19
- 84976809508
- A survey of indexing techniques for sparse matrices
- U.W. Pooch and A. Nieder, "A Survey of Indexing Techniques for Sparse Matrices," ACM Computing Surveys, vol. 5, pp. 109-133, 1973.
- (1973) ACM Computing Surveys , vol.5 , pp. 109-133
- Pooch, U.W.¹ Nieder, A.²

20
- 0003550735
- Y. Saad, "SPARSKIT: A Basic Tool Kit for Sparse Matrix Computations," 1994.
- (1994) SPARSKIT: A Basic Tool Kit for Sparse Matrix Computations
- Saad, Y.¹

21
- 1542501019
- Sparsity: Optimization framework for sparse matrix kernels
- E.-J. Im, K. Yelick, and R. Vuduc, "Sparsity: Optimization Framework for Sparse Matrix Kernels," Int'l J. High Performance Computing Applications, vol. 18, pp. 135-158, 2004.
- (2004) Int'l J. High Performance Computing Applications , vol.18 , pp. 135-158
- Im, E.-J.¹ Yelick, K.² Vuduc, R.³

22
- 33646389518
- Fast sparse matrix-vector multiplication by exploiting variable block structure
- R.W. Vuduc and H.-J. Moon, "Fast Sparse Matrix-Vector Multiplication by Exploiting Variable Block Structure," Proc. First Int'l Conf. High Performance Computing and Comm., pp. 807-816, 2005.
- (2005) Proc. First Int'l Conf. High Performance Computing and Comm , pp. 807-816
- Vuduc, R.W.¹ Moon, H.-J.²

23
- 84990830919
- Performance optimizations and bounds for sparse matrix-vector multiply
- R. Vuduc, J.W. Demmel, K.A. Yelick, S. Kamil, R. Nishtala, and B. Lee, "Performance Optimizations and Bounds for Sparse Matrix-Vector Multiply," Proc. ACM/IEEE Conf. Supercomputing, pp. 1-35, 2002.
- (2002) Proc. ACM/IEEE Conf. Supercomputing , pp. 1-35
- Vuduc, R.¹ Demmel, J.W.² Yelick, K.A.³ Kamil, S.⁴ Nishtala, R.⁵ Lee, B.⁶

24
- 24344485098
- OSKI: A library of automatically tuned sparse matrix kernels
- R. Vuduc, J.W. Demmel, and K.A. Yelick, "OSKI: A Library of Automatically Tuned Sparse Matrix Kernels," J. Physics: Conf. Series, vol. 16, no. 521, 2005.
- (2005) J. Physics: Conf. Series , vol.16 , Issue.521
- Vuduc, R.¹ Demmel, J.W.² Yelick, K.A.³

25
- 70449629588
- Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks
- A. Buluç, J.T. Fineman, M. Frigo, J.R. Gilbert, and C.E. Leiserson, "Parallel Sparse Matrix-Vector and Matrix-Transpose-Vector Multiplication Using Compressed Sparse Blocks," Proc. 21st Ann. Symp. Parallelism in Algorithms and Architectures (SPAA '09), pp. 233-244, 2009.
- (2009) Proc. 21st Ann. Symp. Parallelism in Algorithms and Architectures (SPAA '09) , pp. 233-244
- Buluç, A.¹ Fineman, J.T.² Frigo, M.³ Gilbert, J.R.⁴ Leiserson, C.E.⁵

26
- 80053263342
- Reduced-bandwidth multithreaded algorithms for sparse matrix-vector multiplication
- A. Buluç, S. Williams, L. Oliker, and J. Demmel, "Reduced-Bandwidth Multithreaded Algorithms for Sparse Matrix-Vector Multiplication," Proc. IEEE Int'l Parallel and Distributed Processing Symp., pp. 721-733, 2011.
- (2011) Proc. IEEE Int'l Parallel and Distributed Processing Symp , pp. 721-733
- Buluç, A.¹ Williams, S.² Oliker, L.³ Demmel, J.⁴

27
- 34547468948
- Accelerating sparse matrix computations via data compression
- J. Willcock and A. Lumsdaine, "Accelerating Sparse Matrix Computations via Data Compression," Proc. 20th Ann. Int'l Conf. Supercomputing, pp. 307-316, 2006.
- (2006) Proc. 20th Ann. Int'l Conf. Supercomputing , pp. 307-316
- Willcock, J.¹ Lumsdaine, A.²

28
- 78651410164
- Exploiting compression opportunities to improve SpMxV performance on shared memory systems
- article 16
- K. Kourtis, G. Goumas, and N. Koziris, "Exploiting Compression Opportunities to Improve SpMxV Performance on Shared Memory Systems," ACM Trans. Architecture and Code Optimization, vol. 7, no. 3, article 16, 2010.
- (2010) ACM Trans. Architecture and Code Optimization , vol.7 , Issue.3
- Kourtis, K.¹ Goumas, G.² Koziris, N.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.