-
1
-
-
0028427170
-
Improving performance of linear algebra algorithms for dense matrices, using algorithmic prefetch
-
Agarwal, R. C., Gustavson, F. G., and Zubair, M. 1994. Improving performance of linear algebra algorithms for dense matrices, using algorithmic prefetch. IBM J. of Research and Development 38, 3, 265-275.
-
(1994)
IBM J. of Research and Development
, vol.38
, Issue.3
, pp. 265-275
-
-
Agarwal, R.C.1
Gustavson, F.G.2
Zubair, M.3
-
2
-
-
0026137116
-
The cache performance and optimizations of blocked algorithms
-
Callahan, D., Kennedy, K., and Porterfield, A. 1991a. The cache performance and optimizations of blocked algorithms. In Proceedings of ASPLOS’91 (1991), pp. 63-74.
-
(1991)
Proceedings of ASPLOS’91
, pp. 63-74
-
-
Callahan, D.1
Kennedy, K.2
Porterfield, A.3
-
3
-
-
84976722352
-
Software prefetching
-
Callahan, D., Kennedy, K., and Porterfield, A. 1991b. Software prefetching. In Proceedings of ASPLOS’91 (1991), pp. 40-52.
-
(1991)
Proceedings of ASPLOS’91
, pp. 40-52
-
-
Callahan, D.1
Kennedy, K.2
Porterfield, A.3
-
5
-
-
0942292873
-
Matrix multiplication: A case study of algorithm engineering
-
(August 1998) Max-Plank-Institut für Informatik
-
Eiron, N., Rodeh, M., and Steinwarts, I. 1998. Matrix multiplication: A case study of algorithm engineering. In Proceedings of WAE’98 (August 1998), pp. 40-52. Max-Plank-Institut für Informatik.
-
(1998)
Proceedings of WAE’98
, pp. 40-52
-
-
Eiron, N.1
Rodeh, M.2
Steinwarts, I.3
-
6
-
-
0028261367
-
Complexity/performance tradeoffs with non-blocking loads
-
(1994)
-
Farkas, K. and Jouppi, N. 1994. Complexity/performance tradeoffs with non-blocking loads. In Proceedings of JSCA’94 (1994), pp. 211-222.
-
(1994)
Proceedings of JSCA’94
, pp. 211-222
-
-
Farkas, K.1
Jouppi, N.2
-
7
-
-
85024292364
-
-
Personal Communication
-
Gustavson, F. G. 1998. Personal Communication.
-
(1998)
-
-
Gustavson, F.G.1
-
11
-
-
0042650298
-
Software pipelining: An effective technique for vliw machines
-
(1988)
-
Lam, M. S. 1988. Software pipelining: An effective technique for vliw machines. In Proceedings of SIGPLAN’88 (1988), pp. 318-328.
-
(1988)
Proceedings of SIGPLAN’88
, pp. 318-328
-
-
Lam, M.S.1
-
12
-
-
0009053589
-
Reducing cache conflicts in data cache prefetching
-
Lee, J. H., Lee, M. Y., Choi, S. U., and Park, M. S. 1994. Reducing cache conflicts in data cache prefetching. Computer Architecture News 22, 4, 71-77.
-
(1994)
Computer Architecture News
, vol.22
, Issue.4
, pp. 71-77
-
-
Lee, J.H.1
Lee, M.Y.2
Choi, S.U.3
Park, M.S.4
-
14
-
-
84976833735
-
Design and evaluation of a compiler algorithm for data prefetching
-
(1992)
-
Mowry, T. C., Lam, M. S., and Gupta, A. 1992. Design and evaluation of a compiler algorithm for data prefetching. In Proceedings of ASPLOS’92 (1992), pp. 62-73.
-
(1992)
Proceedings of ASPLOS’92
, pp. 62-73
-
-
Mowry, T.C.1
Lam, M.S.2
Gupta, A.3
-
15
-
-
0029714323
-
Data prefetching and multilevel blocking for linear algebra operations
-
(1992)
-
Navarro, J. J., Gaecía-Diego, E., and Heeeeeo, J. R. 1992. Data prefetching and multilevel blocking for linear algebra operations. In Proceedings of ICS’96 (1992), pp. 109-116.
-
(1992)
Proceedings of ICS’96
, pp. 109-116
-
-
Navarro, J.J.1
Gaecía-Diego, E.2
Heeeeeo, J.R.3
-
17
-
-
0027764718
-
To copy or not to copy: A compile-time technique for assessing when data copying should be used to eliminate cache conflicts
-
(1993)
-
Temam, O., Granston, E. D., and Jalby, W. 1993. To copy or not to copy: A compile-time technique for assessing when data copying should be used to eliminate cache conflicts. In Proceedings of SUPERCOMPUTING’93 (1993), pp. 410-419.
-
(1993)
Proceedings of SUPERCOMPUTING’93
, pp. 410-419
-
-
Temam, O.1
Granston, E.D.2
Jalby, W.3
|