메뉴 건너뛰기




Volumn 2141 LNCS, Issue , 2001, Pages 26-38

Fractal matrix multiplication: A case study on portability of cache performance

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHM ENGINEERING; CACHE PERFORMANCE; FLOATING POINTS; MATRIX MULTIPLICATION; MEMORY HIERARCHY; MEMORY SYSTEMS; MULTIPLICATION ALGORITHMS; ONE-MACHINE; PRIORI KNOWLEDGE; THEORETICAL INVESTIGATIONS;

EID: 78650730334     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/3-540-44688-5_3     Document Type: Conference Paper
Times cited : (6)

References (43)
  • 3
    • 0028483922 scopus 로고
    • The uniform memory hierarchy model of computation
    • B. Alpern, L. Carter, E. Feig and T. Selker: The uniform memory hierarchy model of computation. In Algorithmica, vol. 12, (1994), 72-129.
    • (1994) Algorithmica , vol.12 , pp. 72-129
    • Alpern, B.1    Carter, L.2    Feig, E.3    Selker, T.4
  • 6
    • 0040115155 scopus 로고    scopus 로고
    • Processor-time tradeoffs under bounded-speed message propagation. Part II: Lower bounds
    • G. Bilardi and F. P. Preparata: Processor-time tradeoffs under bounded-speed message propagation. Part II: lower bounds. Theory of Computing Systems, Vol. 32, 531-559, 1999.
    • (1999) Theory of Computing Systems , vol.32 , pp. 531-559
    • Bilardi, G.1    Preparata, F.P.2
  • 18
    • 78650753554 scopus 로고    scopus 로고
    • Engineering and Scientific Subroutine Library
    • Engineering and Scientific Subroutine Library. http://www.rs6000.ibm.com/ resource/aix-resource/sp-books/essl/
  • 20
    • 0030688479 scopus 로고    scopus 로고
    • Auto-blocking matrix-multiplication or tracking BLAS3 performance from source code
    • Proc. 1997 ACM Symp. on Principles and Practice of Parallel Programming, July
    • J. D. Frens and D. S. Wise: Auto-blocking matrix-multiplication or tracking BLAS3 performance from source code. Proc. 1997 ACM Symp. on Principles and Practice of Parallel Programming, SIGPLAN Not. 32, 7 (July 1997), 206-216.
    • (1997) SIGPLAN Not , vol.32 , Issue.7 , pp. 206-216
    • Frens, J.D.1    Wise, D.S.2
  • 23
    • 0027764718 scopus 로고    scopus 로고
    • To copy or not to copy: A compile-time technique for assessing when data copying should be used to eliminate cache conflicts
    • E. D. Granston, W. Jalby and O. Teman: To copy or not to copy: a compile-time technique for assessing when data copying should be used to eliminate cache conflicts. Proceedings of Supercomputing Nov 1993, pg.410-419.
    • Proceedings of Supercomputing Nov 1993 , pp. 410-419
    • Granston, E.D.1    Jalby, W.2    Teman, O.3
  • 25
    • 0031273280 scopus 로고    scopus 로고
    • Recursion leads to automatic variable blocking for dense linear algebra algorithms
    • November
    • F. G. Gustavson: Recursion leads to automatic variable blocking for dense linear algebra algorithms. Journal of Research and Development Volume 41, Number 6, November 1997.
    • (1997) Journal of Research and Development , vol.41 , Issue.6
    • Gustavson, F.G.1
  • 26
    • 84947926251 scopus 로고    scopus 로고
    • Recursive blocked data formats and BLAS's for dense linear algebra algorithms
    • B. Kagstrom et al (eds), Applied Parallel Computing. Large Scale Scientific and Industrial Problems, PARA'98 Proceedings. Springer Verlag
    • F. Gustavson, A. Henriksson, I. Jonsson, P. Ling, and B. Kagstrom: Recursive blocked data formats and BLAS's for dense linear algebra algorithms. In B. Kagstrom et al (eds), Applied Parallel Computing. Large Scale Scientific and Industrial Problems, PARA'98 Proceedings. Lecture Notes in Computing Science, No. 1541, p. 195-206, Springer Verlag, 1998.
    • (1998) Lecture Notes in Computing Science , Issue.1541 , pp. 195-206
    • Gustavson, F.1    Henriksson, A.2    Jonsson, I.3    Ling, P.4    Kagstrom, B.5
  • 27
    • 54249111153 scopus 로고    scopus 로고
    • Accuracy and stability of numerical algorithms
    • ed.
    • N. J. Higham: Accuracy and stability of numerical algorithms ed. SIAM 1996
    • SIAM 1996
    • Higham, N.J.1
  • 29
    • 0032155342 scopus 로고    scopus 로고
    • Algorithm 784: GEMM-based level 3 BLAS: Portability and optimization issues
    • Sept.
    • B. Kȧgström, P. Ling and C. Van Loan: Algorithm 784: GEMM-based level 3 BLAS: portability and optimization issues. ACM transactions on Mathematical Software, Vol24, No.3, Sept. 1998, pages 303-316
    • (1998) ACM Transactions on Mathematical Software , vol.24 , Issue.3 , pp. 303-316
    • Kagström, B.1    Ling, P.2    Van Loan, C.3
  • 30
    • 0032155271 scopus 로고    scopus 로고
    • GEMM-based level 3 BLAS: High-performance model implementations and performance evaluation benchmark
    • Sept.
    • B. Kȧgström, P. Ling and C. Van Loan: GEMM-based level 3 BLAS: high-performance model implementations and performance evaluation benchmark. ACM transactions on Mathematical Software, Vol24, No.3, Sept. 1998, pages 268-302.
    • (1998) ACM Transactions on Mathematical Software , vol.24 , Issue.3 , pp. 268-302
    • Kagström, B.1    Ling, P.2    Van Loan, C.3
  • 36
    • 34250487811 scopus 로고
    • Gaussian elimination is not optimal
    • V. Strassen: Gaussian elimination is not optimal. Numerische Mathematik 14(3):354-356, 1969.
    • (1969) Numerische Mathematik , vol.14 , Issue.3 , pp. 354-356
    • Strassen, V.1
  • 37
    • 0031496750 scopus 로고    scopus 로고
    • Locality of reference in LU decomposition with partial pivoting
    • Oct.
    • S. Toledo: Locality of reference in LU decomposition with partial pivoting. SIAM J.Matrix Anal. Appl. Vol.18, No. 4, pp.1065-1081, Oct.1997
    • (1997) SIAM J.Matrix Anal. Appl. , vol.18 , Issue.4 , pp. 1065-1081
    • Toledo, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.