메뉴 건너뛰기




Volumn 4, Issue , 1999, Pages 3-

Matrix Multiplication: A Case Study of Enhanced Data Cache Utilization

Author keywords

Algorithms; BLAS; Blocking; Cache; Matrix Multiplication; Performance; Prefetching

Indexed keywords


EID: 84957880482     PISSN: 10846654     EISSN: 10846654     Source Type: Journal    
DOI: 10.1145/347792.347806     Document Type: Article
Times cited : (8)

References (17)
  • 1
    • 0028427170 scopus 로고
    • Improving performance of linear algebra algorithms for dense matrices, using algorithmic prefetch
    • Agarwal, R. C., Gustavson, F. G., and Zubair, M. 1994. Improving performance of linear algebra algorithms for dense matrices, using algorithmic prefetch. IBM J. of Research and Development 38, 3, 265-275.
    • (1994) IBM J. of Research and Development , vol.38 , Issue.3 , pp. 265-275
    • Agarwal, R.C.1    Gustavson, F.G.2    Zubair, M.3
  • 2
    • 0026137116 scopus 로고
    • The cache performance and optimizations of blocked algorithms
    • Callahan, D., Kennedy, K., and Porterfield, A. 1991a. The cache performance and optimizations of blocked algorithms. In Proceedings of ASPLOS’91 (1991), pp. 63-74.
    • (1991) Proceedings of ASPLOS’91 , pp. 63-74
    • Callahan, D.1    Kennedy, K.2    Porterfield, A.3
  • 5
    • 0942292873 scopus 로고    scopus 로고
    • Matrix multiplication: A case study of algorithm engineering
    • (August 1998) Max-Plank-Institut für Informatik
    • Eiron, N., Rodeh, M., and Steinwarts, I. 1998. Matrix multiplication: A case study of algorithm engineering. In Proceedings of WAE’98 (August 1998), pp. 40-52. Max-Plank-Institut für Informatik.
    • (1998) Proceedings of WAE’98 , pp. 40-52
    • Eiron, N.1    Rodeh, M.2    Steinwarts, I.3
  • 6
    • 0028261367 scopus 로고
    • Complexity/performance tradeoffs with non-blocking loads
    • (1994)
    • Farkas, K. and Jouppi, N. 1994. Complexity/performance tradeoffs with non-blocking loads. In Proceedings of JSCA’94 (1994), pp. 211-222.
    • (1994) Proceedings of JSCA’94 , pp. 211-222
    • Farkas, K.1    Jouppi, N.2
  • 7
    • 85024292364 scopus 로고    scopus 로고
    • Personal Communication
    • Gustavson, F. G. 1998. Personal Communication.
    • (1998)
    • Gustavson, F.G.1
  • 11
    • 0042650298 scopus 로고
    • Software pipelining: An effective technique for vliw machines
    • (1988)
    • Lam, M. S. 1988. Software pipelining: An effective technique for vliw machines. In Proceedings of SIGPLAN’88 (1988), pp. 318-328.
    • (1988) Proceedings of SIGPLAN’88 , pp. 318-328
    • Lam, M.S.1
  • 14
    • 84976833735 scopus 로고
    • Design and evaluation of a compiler algorithm for data prefetching
    • (1992)
    • Mowry, T. C., Lam, M. S., and Gupta, A. 1992. Design and evaluation of a compiler algorithm for data prefetching. In Proceedings of ASPLOS’92 (1992), pp. 62-73.
    • (1992) Proceedings of ASPLOS’92 , pp. 62-73
    • Mowry, T.C.1    Lam, M.S.2    Gupta, A.3
  • 15
    • 0029714323 scopus 로고
    • Data prefetching and multilevel blocking for linear algebra operations
    • (1992)
    • Navarro, J. J., Gaecía-Diego, E., and Heeeeeo, J. R. 1992. Data prefetching and multilevel blocking for linear algebra operations. In Proceedings of ICS’96 (1992), pp. 109-116.
    • (1992) Proceedings of ICS’96 , pp. 109-116
    • Navarro, J.J.1    Gaecía-Diego, E.2    Heeeeeo, J.R.3
  • 17
    • 0027764718 scopus 로고
    • To copy or not to copy: A compile-time technique for assessing when data copying should be used to eliminate cache conflicts
    • (1993)
    • Temam, O., Granston, E. D., and Jalby, W. 1993. To copy or not to copy: A compile-time technique for assessing when data copying should be used to eliminate cache conflicts. In Proceedings of SUPERCOMPUTING’93 (1993), pp. 410-419.
    • (1993) Proceedings of SUPERCOMPUTING’93 , pp. 410-419
    • Temam, O.1    Granston, E.D.2    Jalby, W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.