-
1
-
-
0026267802
-
An effective on-chip preloading scheme to reduce data access penalty
-
J.-L. Baer and T.-F. Chen, “An effective on-chip preloading scheme to reduce data access penalty,” Proc. Supercomputing ′91, pp. 176–186, 1991.
-
(1991)
Proc. Supercomputing ′91
, pp. 176-186
-
-
Baer, J.-L.1
Chen, T.-F.2
-
2
-
-
0024682679
-
Multilevel cache hierarchies: Organizations, protocols, and performance
-
J.-L. Baer and W.-H. Wang, “Multilevel cache hierarchies: Organizations, protocols, and performance,” J. Parallel and Distributed Computing, vol. 6, no. 3, pp. 451–476, 1989.
-
(1989)
J. Parallel and Distributed Computing
, vol.6
, Issue.3
, pp. 451-476
-
-
Baer, J.-L.1
Wang, W.-H.2
-
3
-
-
84908201792
-
-
Technical Report #1137, Computer Science Dept., Univ. of Wis.-Madison, Feb.
-
T. Ball and J.R. Larus, “Branch prediction for free,” Technical Report #1137, Computer Science Dept., Univ. of Wis.-Madison, Feb. 1993.
-
(1993)
Branch prediction for free
-
-
Ball, T.1
Larus, J.R.2
-
6
-
-
84944799568
-
Data access microarchitectures for superscalar processors with compiler-assisted data prefetching
-
W.Y. Chen, S.A. Mahlke, P.P. Chang, and W.-M. Hwu, “Data access microarchitectures for superscalar processors with compiler-assisted data prefetching,” Proc. 24th Int'l Symp. Microarchitecture, 1991.
-
(1991)
Proc. 24th Int'l Symp. Microarchitecture
-
-
Chen, W.Y.1
Mahlke, S.A.2
Chang, P.P.3
Hwu, W.-M.4
-
9
-
-
0001366267
-
Strategies for cache and local memory management by global program transformation
-
Oct.
-
D. Gannon, W. Jalby, and K. Gallivan, “Strategies for cache and local memory management by global program transformation,” J. Parallel and Distributed Computing, vol. 5, no. 5, pp. 587–616, Oct. 1988.
-
(1988)
J. Parallel and Distributed Computing
, vol.5
, Issue.5
, pp. 587-616
-
-
Gannon, D.1
Jalby, W.2
Gallivan, K.3
-
10
-
-
0025146693
-
Compiler-directed data prefetching in multiprocessors with memory hierarchies
-
1990.
-
E. Gornish, E. Granston, and A. Veidenbaum, “Compiler-directed data prefetching in multiprocessors with memory hierarchies,” Proc. 1990 Int'l Conf. Supercomputing, pp. 354–368, 1990.
-
Proc. 1990 Int'l Conf. Supercomputing
, pp. 354-368
-
-
Gornish, E.1
Granston, E.2
Veidenbaum, A.3
-
11
-
-
0025429331
-
Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
-
May
-
N.P. Jouppi, “Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers,” Proc. 17th Ann. Int'l Symp. Computer Architecture, pp. 364–373, May 1990.
-
(1990)
Proc. 17th Ann. Int'l Symp. Computer Architecture
, pp. 364-373
-
-
Jouppi, N.P.1
-
14
-
-
0021204160
-
Branch prediction strategies and branch target buffer design
-
Jan.
-
J.K.F. Lee and A.J. Smith, “Branch prediction strategies and branch target buffer design,” Computer, pp. 6–22, Jan. 1984.
-
(1984)
Computer
, pp. 6-22
-
-
Lee, J.K.F.1
Smith, A.J.2
-
15
-
-
0023586486
-
Data prefetching in shared memory multiprocessors
-
R.L. Lee, P-C. Yew, and D.H. Lawrie, “Data prefetching in shared memory multiprocessors,” Proc. Int'l Conf. Parallel Processing, pp. 28–31, 1987.
-
(1987)
Proc. Int'l Conf. Parallel Processing
, pp. 28-31
-
-
Lee, R.L.1
Yew, P.-C.2
Lawrie, D.H.3
-
16
-
-
0023169552
-
Multiprocessor Cache design considerations
-
R.L. Lee, P-C. Yew, and D.H. Lawrie, “Multiprocessor Cache design considerations,” Proc. 14th Ann. Int'l Symp. Computer Architecture, pp. 253–262, 1987.
-
(1987)
Proc. 14th Ann. Int'l Symp. Computer Architecture
, pp. 253-262
-
-
Lee, R.L.1
Yew, P.-C.2
Lawrie, D.H.3
-
17
-
-
0002031606
-
Tolerating latency through software-controlled prefetching in shared-memory multiprocessors
-
June
-
T. Mowry and A. Gupta, “Tolerating latency through software-controlled prefetching in shared-memory multiprocessors,” J. Parallel and Distributed Computing, vol. 12, no. 2, pp. 87–106, June 1991.
-
(1991)
J. Parallel and Distributed Computing
, vol.12
, Issue.2
, pp. 87-106
-
-
Mowry, T.1
Gupta, A.2
-
18
-
-
0026918402
-
Design and evaluation of a compiler algorithm for prefetching
-
T. Mowry, M.S. Lam, and A. Gupta, “Design and evaluation of a compiler algorithm for prefetching,” Proc. Fifth Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 62–73, 1992.
-
(1992)
Proc. Fifth Int'l Conf. Architectural Support for Programming Languages and Operating Systems
, pp. 62-73
-
-
Mowry, T.1
Lam, M.S.2
Gupta, A.3
-
19
-
-
0026918390
-
Improving the accuracy of dynamic branch prediction using branch correlation
-
S.-T. Pa, K. So, and J.T. Rahmeh, “Improving the accuracy of dynamic branch prediction using branch correlation,” Proc. Fifth Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 76–84, 1992.
-
(1992)
Proc. Fifth Int'l Conf. Architectural Support for Programming Languages and Operating Systems
, pp. 76-84
-
-
Pa, S.-T.1
So, K.2
Rahmeh, J.T.3
-
20
-
-
84939749407
-
-
Technical Report UCB/CSD 89/552, Univ. of Calif., Berkeley, Dec.
-
C.H. Perleberg and A.J. Smith, “Branch target buffer design and optimization,” Technical Report UCB/CSD 89/552, Univ. of Calif., Berkeley, Dec. 1989.
-
(1989)
Branch target buffer design and optimization
-
-
Perleberg, C.H.1
Smith, A.J.2
-
23
-
-
0344300562
-
Prefetch unit for vector operations on scalar computers
-
Sept.
-
I. Sklenar, “Prefetch unit for vector operations on scalar computers,” Computer Architecture News, vol. 20, no. 4, pp. 31–37, Sept. 1992.
-
(1992)
Computer Architecture News
, vol.20
, Issue.4
, pp. 31-37
-
-
Sklenar, I.1
-
24
-
-
0020177251
-
Cache memories
-
Sept.
-
A.J. Smith, “Cache memories,” ACM Computing Surveys, vol. 14, no. 3, pp. 473–530, Sept. 1982.
-
(1982)
ACM Computing Surveys
, vol.14
, Issue.3
, pp. 473-530
-
-
Smith, A.J.1
|