메뉴 건너뛰기




Volumn 21, Issue 15, 2009, Pages 1838-1856

Increasing data reuse of sparse algebra codes on simultaneous multithreading architectures

Author keywords

Data reuse; Irregular codes; Locality; Multithreading; Sparse algebra codes; Sparse matrix

Indexed keywords

MATRIX ALGEBRA; MEMORY ARCHITECTURE; MULTITASKING;

EID: 70349124898     PISSN: 15320626     EISSN: 15320634     Source Type: Journal    
DOI: 10.1002/cpe.1404     Document Type: Article
Times cited : (5)

References (35)
  • 1
    • 0029666641 scopus 로고    scopus 로고
    • Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor
    • Philadelphia, U.S.A
    • Tullsen DM, Eggers SJ, Emer JS, Levy HM, Lo JL, Stamm RL. Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor. ISCA, Philadelphia, U.S.A., 1996; 191-202.
    • (1996) ISCA , pp. 191-202
    • Tullsen, D.M.1    Eggers, S.J.2    Emer, J.S.3    Levy, H.M.4    Lo, J.L.5    Stamm, R.L.6
  • 2
    • 0031199614 scopus 로고    scopus 로고
    • Converting thread-level parallelism to instruction-level parallelism via simultaneous multithreading
    • Lo JL, Emer JS, Levy HM, Stamm RL, Tullsen DM. Converting thread-level parallelism to instruction-level parallelism via simultaneous multithreading. ACM Transactions on Computer Systems 1997; 15(3):322-354.
    • (1997) ACM Transactions on Computer Systems , vol.15 , Issue.3 , pp. 322-354
    • Lo, J.L.1    Emer, J.S.2    Levy, H.M.3    Stamm, R.L.4    Tullsen, D.M.5
  • 6
    • 5144222846 scopus 로고    scopus 로고
    • The block lanczos method for linear systems with multiple right-hand sides
    • Guennounia AE, Jbilou K, Sadok H. The block lanczos method for linear systems with multiple right-hand sides. Applied Numerical Mathematics 2004; 51(2-3):243-256.
    • (2004) Applied Numerical Mathematics , vol.51 , Issue.2-3 , pp. 243-256
    • Guennounia, A.E.1    Jbilou, K.2    Sadok, H.3
  • 9
    • 0001803542 scopus 로고
    • Several strategies for reducing the bandwidth of matrices
    • Rose DJ, Willoughby RA (eds.). Plenum Press: New York
    • Cuthill E, McKee J. Several strategies for reducing the bandwidth of matrices. Sparse Matrices and their Applications, Rose DJ, Willoughby RA (eds.). Plenum Press: New York, 1972.
    • (1972) Sparse Matrices and Their Applications
    • Cuthill, E.1    McKee, J.2
  • 11
    • 0036734103 scopus 로고    scopus 로고
    • Effects of ordering strategies and programming paradigms on sparse matrix computations
    • Oliker L, Li X, Husbands P, Biswas R. Effects of ordering strategies and programming paradigms on sparse matrix computations. SIAM Review 2002; 44(3):373-393.
    • (2002) SIAM Review , vol.44 , Issue.3 , pp. 373-393
    • Oliker, L.1    Li, X.2    Husbands, P.3    Biswas, R.4
  • 12
    • 0033189408 scopus 로고    scopus 로고
    • Memory hierarchy performance prediction for blocked sparse algorithms
    • Fraguela BB, Doallo R, Zapata EL. Memory hierarchy performance prediction for blocked sparse algorithms. Parallel Processing Letters 1999; 9(3):347-360.
    • (1999) Parallel Processing Letters , vol.9 , Issue.3 , pp. 347-360
    • Fraguela, B.B.1    Doallo, R.2    Zapata, E.L.3
  • 15
    • 3042576437 scopus 로고    scopus 로고
    • Improving performance of sparse matrix-vector multiplication
    • Portland, OR
    • Pinar A, Heath M. Improving performance of sparse matrix-vector multiplication. Proceedings of Supercomputing, Portland, OR, 1999.
    • (1999) Proceedings of Supercomputing
    • Pinar, A.1    Heath, M.2
  • 17
    • 25644439819 scopus 로고    scopus 로고
    • Performance optimization of irregular codes based on the combination of reordering and blocking techniques
    • Pichel JC, Heras DB, Cabaleiro JC, Rivera FF. Performance optimization of irregular codes based on the combination of reordering and blocking techniques. Parallel Computing 2005; 31(8-9):858-876.
    • (2005) Parallel Computing , vol.31 , Issue.8-9 , pp. 858-876
    • Pichel, J.C.1    Heras, D.B.2    Cabaleiro, J.C.3    Rivera, F.F.4
  • 20
    • 0242370926 scopus 로고    scopus 로고
    • Code and data transformations for improving shared cache performance on SMT processors
    • Tokyo-Odaiba, Japan
    • Nikolopoulos DS. Code and data transformations for improving shared cache performance on SMT processors. International Symposium on High Performance Computing, Tokyo-Odaiba, Japan, 2003; 54-69.
    • (2003) International Symposium on High Performance Computing , pp. 54-69
    • Nikolopoulos, D.S.1
  • 23
    • 0035370397 scopus 로고    scopus 로고
    • Modeling data locality for the sparse matrix-vector product using distance measures
    • Heras DB, Cabaleiro JC, Rivera FF. Modeling data locality for the sparse matrix-vector product using distance measures. Parallel Computing 2001; 27:897-912.
    • (2001) Parallel Computing , vol.27 , pp. 897-912
    • Heras, D.B.1    Cabaleiro, J.C.2    Rivera, F.F.3
  • 26
    • 0042415671 scopus 로고    scopus 로고
    • An overview of the sparse basic linear algebra subprograms: The new standard from the BLAS technical forum
    • Duff I, Heroux M, Pozo R. An overview of the sparse basic linear algebra subprograms: The new standard from the BLAS technical forum. ACM Transactions on Mathematical Software 2002; 28(2):239-267.
    • (2002) ACM Transactions on Mathematical Software , vol.28 , Issue.2 , pp. 239-267
    • Duff, I.1    Heroux, M.2    Pozo, R.3
  • 31
    • 0003197949 scopus 로고    scopus 로고
    • University of Florida sparse matrix collection
    • 15 October 2007
    • Davis T. University of Florida Sparse Matrix Collection. NA Digest 1997; 97(23). http://www.cise.ufl.edu/research/ sparse/matrices [15 October 2007].
    • (1997) NA Digest , vol.97 , pp. 23
    • Davis, T.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.