SCOPUS 정보 검색 플랫폼

Proceedings of the International Conference on Supercomputing

Volumn , Issue , 2012, Pages 195-204

Sparse matrix-vector multiply on the HICAMP architecture

(5) Stevenson, John P a Firoozshahian, Amin b Solomatnikov, Alex b Horowitz, Mark a Cheriton, David a

a STANFORD UNIVERSITY (United States)

b Hicamp Systems (United States)

Author keywords

Deduplication; HICAMP; SpMV

Indexed keywords

CRITICAL TASKS; DATA REUSE; DEDUPLICATION; FULLY COMPATIBLE; HICAMP; INNER LOOPS; LINEAR SYSTEM SOLVER; MAIN-MEMORY; MATRIX VECTOR MULTIPLY; NON-SYMMETRIC MATRICES; PERFORMANCE BOUNDS; PROCESSOR CACHE; RANDOM PATTERN; SPARSE MATRICES; SPECIALIZED HARDWARE; SPMV; STORAGE FORMATS; SYMMETRIC MATRICES; THREAD SYNCHRONIZATION;

COMPUTER ARCHITECTURE; INTELLIGENT CONTROL; LINEAR SYSTEMS;

MATRIX ALGEBRA;

EID: 84864047435 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2304576.2304603 Document Type: Conference Paper

Times cited : (9)

References (25)

1
- 84864027319
- Technologies for data-intensive computing
- October
- A. Bechtolsheim. Technologies for Data-Intensive Computing. In Proceedings of the 13th International Workshop on High Performance Transaction Systems. HPTS, October 2009.
- (2009) Proceedings of the 13th International Workshop on High Performance Transaction Systems. HPTS
- Bechtolsheim, A.¹

2
- 78650279432
- Pattern-based sparse matrix representation for memory-efficient SMVM kernels
- ACM, June
- M. Belgin, G. Back, and C. J. Ribbens. Pattern-based Sparse Matrix Representation for Memory-Efficient SMVM Kernels. In Proceedings of the 23rd International Conference on Supercomputing (ICS '09), pages 100-109. ACM, June 2009.
- (2009) Proceedings of the 23rd International Conference on Supercomputing (ICS '09) , pp. 100-109
- Belgin, M.¹ Back, G.² Ribbens, C.J.³

3
- 77956260008
- Implementing sparse matrix-vector multiplication on throughput-oriented processors
- ACM, November
- N. Bell and M. Garland. Implementing Sparse Matrix-Vector Multiplication on Throughput-Oriented Processors. In Proceedings of Supercomputing (SC '09), pages 18:1-18:11. ACM, November 2009.
- (2009) Proceedings of Supercomputing (SC '09) , pp. 181-1811
- Bell, N.¹ Garland, M.²

4
- 84864034684
- Hierarchical diagonal blocking and precision reduction applied to combinatorial multigrid
- November
- G. Blelloch, I. Koutis, G. Miller, and K. Tangwongsan. Hierarchical Diagonal Blocking and Precision Reduction Applied to Combinatorial Multigrid. In Proceedings of Supercomputing (SC '10), pages 1-12, November 2010.
- (2010) Proceedings of Supercomputing (SC '10) , pp. 1-12
- Blelloch, G.¹ Koutis, I.² Miller, G.³ Tangwongsan, K.⁴

5
- 79955370378
- The future of microprocessors
- May
- S. Borkar and A. A. Chien. The Future of Microprocessors. Communications of the ACM, 54:67-77, May 2011.
- (2011) Communications of the ACM , vol.54 , pp. 67-77
- Borkar, S.¹ Chien, A.A.²

6
- 70449629588
- Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks
- ACM
- A. Buluç, J. T. Fineman, M. Frigo, J. R. Gilbert, and C. E. Leiserson. Parallel Sparse Matrix-Vector and Matrix-Transpose-Vector Multiplication Using Compressed Sparse Blocks. In Proceedings of the 21st Annual Symposium on Parallelism in Algorithms and Architectures, SPAA '09, pages 233-244. ACM, 2009.
- (2009) Proceedings of the 21st Annual Symposium on Parallelism in Algorithms and Architectures, SPAA '09 , pp. 233-244
- Buluç, A.¹ Fineman, J.T.² Frigo, M.³ Gilbert, J.R.⁴ Leiserson, C.E.⁵

7
- 80053263342
- Reduced-bandwidth multithreaded algorithms for sparse matrix-vector multiplication
- 2011 IEEE International May
- A. Buluç, S. Williams, L. Oliker, and J. Demmel. Reduced-Bandwidth Multithreaded Algorithms for Sparse Matrix-Vector Multiplication. In International Parallel Distributed Processing Symposium (IPDPS), 2011 IEEE International, pages 721-733, May 2011.
- (2011) International Parallel Distributed Processing Symposium (IPDPS) , pp. 721-733
- Buluç, A.¹ Williams, S.² Oliker, L.³ Demmel, J.⁴

8
- 84864026220
- January U.S. Patent 7650460
- D. R. Cheriton. Hierarchical Immutable Content-Addressable Memory Processor, January 2010. U.S. Patent 7650460.
- (2010) Hierarchical Immutable Content-addressable Memory Processor
- Cheriton, D.R.¹

9
- 84858769083
- HICAMP: Architectural support for efficient concurrency-safe shared structured data access
- New York, NY, USA ACM
- D. R. Cheriton, A. Firoozshahian, A. Solomatnikov, J. P. Stevenson, and O. Azizi. HICAMP: Architectural Support for Efficient Concurrency-Safe Shared Structured Data Access. In Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS '12, pages 287-300, New York, NY, USA, 2012. ACM.
- (2012) Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS '12 , pp. 287-300
- Cheriton, D.R.¹ Firoozshahian, A.² Solomatnikov, A.³ Stevenson, J.P.⁴ Azizi, O.⁵

10
- 81355161778
- The university of florida sparse matrix collection
- November
- T. A. Davis and Y. Hu. The University of Florida Sparse Matrix Collection. ACM Trans. Math. Softw., 38:1:1-1:25, November 2011.
- (2011) ACM Trans. Math. Softw. , vol.38 , pp. 11-125
- Davis, T.A.¹ Hu, Y.²

11
- 0033884908
- Xtensa: A configurable and extensible processor
- DOI 10.1109/40.848473
- R. Gonzalez. Xtensa: A Configurable and Extensible Processor. Micro, IEEE, 20(2):60-70, Mar/Apr 2000. (Pubitemid 30585385)
- (2000) IEEE Micro , vol.20 , Issue.2 , pp. 60-70
- Gonzalez, R.E.¹

12
- 1542501019
- SPARSITY: Framework for optimizing sparse matrix-vector multiply
- February
- E.-J. Im, K. A. Yelick, and R. Vuduc. SPARSITY: Framework for Optimizing Sparse Matrix-Vector Multiply. International Journal of High Performance Computing Applications, 18(1):135-158, February 2004.
- (2004) International Journal of High Performance Computing Applications , vol.18 , Issue.1 , pp. 135-158
- Im, E.-J.¹ Yelick, K.A.² Vuduc, R.³

13
- 55849145179
- Improving the performance of multithreaded sparse matrix-vector multiplication using index and value compression
- September
- K. Kourtis, G. Goumas, and N. Koziris. Improving the Performance of Multithreaded Sparse Matrix-Vector Multiplication Using Index and Value Compression. In 37th International Conference on Parallel Processing (ICPP '08), pages 511-519, September 2008.
- (2008) 37th International Conference on Parallel Processing (ICPP '08) , pp. 511-519
- Kourtis, K.¹ Goumas, G.² Koziris, N.³

14
- 55849146932
- Optimizing sparse matrix-vector multiplication using index and value compression
- ACM, May
- K. Kourtis, G. Goumas, and N. Koziris. Optimizing Sparse Matrix-Vector Multiplication Using Index and Value Compression. In Proceedings of the 5th Conference on Computing Frontiers (CF '08), pages 87-96. ACM, May 2008.
- (2008) Proceedings of the 5th Conference on Computing Frontiers (CF '08) , pp. 87-96
- Kourtis, K.¹ Goumas, G.² Koziris, N.³

15
- 79952786461
- CSX: An extended compression format for SpMV on shared memory systems
- February
- K. Kourtis, V. Karakasis, G. Goumas, and N. Koziris. CSX: An Extended Compression Format for SpMV on Shared Memory Systems. In Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming, pages 247-256, February 2011.
- (2011) Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming , pp. 247-256
- Kourtis, K.¹ Karakasis, V.² Goumas, G.³ Koziris, N.⁴

16
- 77949657892
- Parallel symmetric sparse matrix-vector product on scalar multi-core CPUs
- M. Krotkiewski and M. Dabrowski. Parallel Symmetric Sparse Matrix-Vector Product on Scalar Multi-Core CPUs. Parallel Computing, 36(4):181 - 198, 2010.
- (2010) Parallel Computing , vol.36 , Issue.4 , pp. 181-198
- Krotkiewski, M.¹ Dabrowski, M.²

17
- 79551537591
- Use of hybrid recursive CSR/COO data structures in sparse matrix-vector multiplication
- October
- M. Martone, S. Filippone, P. Gepner, M. Paprzycki, and S. Tucci. Use of Hybrid Recursive CSR/COO Data Structures in Sparse Matrix-Vector Multiplication. In IMCSIT, pages 327-335, October 2010.
- (2010) IMCSIT , pp. 327-335
- Martone, M.¹ Filippone, S.² Gepner, P.³ Paprzycki, M.⁴ Tucci, S.⁵

18
- 0038998034
- Memory bandwidth and machine balance in current high performance computers
- December
- J. D. McCalpin. Memory Bandwidth and Machine Balance in Current High Performance Computers. IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter, pages 19-25, December 1995.
- (1995) IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter , pp. 19-25
- McCalpin, J.D.¹

19
- 79958758626
- A sparse matrix personality for the convey HC-1
- IEEE, May
- K. K. Nagar and J. D. Bakos. A Sparse Matrix Personality for the Convey HC-1. In Proceedings of the 19th Annual Symposium on Field-Programmable Custom Computing Machines (FCCM '11), pages 1-8. IEEE, May 2011.
- (2011) Proceedings of the 19th Annual Symposium on Field-programmable Custom Computing Machines (FCCM '11) , pp. 1-8
- Nagar, K.K.¹ Bakos, J.D.²

20
- 0031269220
- Improving the memory-system performance of sparse-matrix vector multiplication
- S. Toledo. Improving the Memory-System Performance of Sparse-Matrix Vector Multiplication. IBM Journal of Research and Development, 41(6):711-725, November 1997. (Pubitemid 127557044)
- (1997) IBM Journal of Research and Development , vol.41 , Issue.6 , pp. 711-725
- Toledo, S.¹

21
- 78649844813
- LIKWID: A lightweight performance-oriented tool suite for x86 multicore environments
- J. Treibig, G. Hager, and G. Wellein. LIKWID: A Lightweight Performance-Oriented Tool Suite for x86 Multicore Environments. In Proceedings the International Workshop on Parallel Software Tools and Tool Infrastructures, 2010.
- (2010) Proceedings the International Workshop on Parallel Software Tools and Tool Infrastructures
- Treibig, J.¹ Hager, G.² Wellein, G.³

22
- 24344485098
- OSKI: A library of automatically tuned sparse matrix kernels
- San Francisco, CA, USA, June Institute of Physics Publishing
- R. Vuduc, J. W. Demmel, and K. A. Yelick. OSKI: A Library of Automatically Tuned Sparse Matrix Kernels. In Proceedings of SciDAC 2005, Journal of Physics: Conference Series, San Francisco, CA, USA, June 2005. Institute of Physics Publishing.
- (2005) Proceedings of SciDAC 2005, Journal of Physics: Conference Series
- Vuduc, R.¹ Demmel, J.W.² Yelick, K.A.³

23
- 84990830919
- Performance optimizations and bounds for sparse matrix-vector multiply
- Baltimore, MD, USA, November
- R. Vuduc, J. W. Demmel, K. A. Yelick, S. Kamil, R. Nishtala, and B. Lee. Performance Optimizations and Bounds for Sparse Matrix-Vector Multiply. In Proceedings of Supercomputing (SC '02), Baltimore, MD, USA, November 2002.
- (2002) Proceedings of Supercomputing (SC '02)
- Vuduc, R.¹ Demmel, J.W.² Yelick, K.A.³ Kamil, S.⁴ Nishtala, R.⁵ Lee, B.⁶

24
- 34547468948
- Accelerating sparse matrix computations via data compression
- DOI 10.1145/1183401.1183444, Proceedings of the 20th Annual International Conference on Supercomputing, ICS 2006
- J. Willcock and A. Lumsdaine. Accelerating Sparse Matrix Computations via Data Compression. In Proceedings of the 20th International Conference on Supercomputing (ICS '06), pages 307-316. ACM, June 2006. (Pubitemid 47168517)
- (2006) Proceedings of the International Conference on Supercomputing , pp. 307-316
- Willcock, J.¹ Lumsdaine, A.²

25
- 56749158843
- Optimization of sparse matrix-vector multiplication on emerging multicore platforms
- ACM, November
- S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel. Optimization of Sparse Matrix-Vector Multiplication on Emerging Multicore Platforms. In Proceedings of Supercomputing (SC '07), pages 38:1-38:12. ACM, November 2007.
- (2007) Proceedings of Supercomputing (SC '07) , pp. 381-3812
- Williams, S.¹ Oliker, L.² Vuduc, R.³ Shalf, J.⁴ Yelick, K.⁵ Demmel, J.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.