-
1
-
-
33646389168
-
Compiler-directed content-aware prefetching for dynamic data structures
-
New Orleans, LA. IEEE Computer Society Press, Los Alamitos, CA
-
AL-SUKHNI, H., BRATT, I., AND CONNORS, D. A. 2003. Compiler-directed content-aware prefetching for dynamic data structures. In Proceedings of the 12th Annual International Conference on Parallel Architectures and Compilation Techniques (New Orleans, LA). IEEE Computer Society Press, Los Alamitos, CA.
-
(2003)
Proceedings of the 12th Annual International Conference on Parallel Architectures and Compilation Techniques
-
-
Al-Sukhni, H.1
Bratt, I.2
Connors, D.A.3
-
2
-
-
0003465202
-
The SimpleScalar tool set, version 2.0
-
University of Wisconsin-Madison, Madison, WI
-
BURGER, D. AND AUSTIN, T. M. 1997. The SimpleScalar tool set, version 2.0. Tech. rep. CS TR 1342, University of Wisconsin-Madison, Madison, WI.
-
(1997)
Tech. Rep.
, vol.CS TR 1342
-
-
Burger, D.1
Austin, T.M.2
-
3
-
-
0026138044
-
Software prefetching
-
Santa Clara, CA. ACM Press, New York, NY
-
CALLAHAN, D., KENNEDY, K., AND PORTERFIELD, A. 1991. Software prefetching. In Proceedings of the 4th International Conference on Architectural Support for Programming Languages and Operating Systems (Santa Clara, CA). ACM Press, New York, NY, 40-52.
-
(1991)
Proceedings of the 4th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 40-52
-
-
Callahan, D.1
Kennedy, K.2
Porterfield, A.3
-
4
-
-
0003758490
-
Generalized correlation based hardware prefetching
-
CHARNEY, M. J. AND REEVES, A. P. 1995. Generalized correlation based hardware prefetching. Tech. rep. EE CEG 95-100.
-
(1995)
Tech. Rep.
, vol.EE CEG 95-100
-
-
Charney, M.J.1
Reeves, A.P.2
-
5
-
-
0029511258
-
An effective programmable prefetch engine for on-chip caches
-
Ann Arbor, MI. IEEE Computer Society Press, Los Alamitos, CA
-
CHEN, T.-F. 1995. An effective programmable prefetch engine for on-chip caches. In Proceedings of the 28th Annual Symposium on Microarchitecture (Ann Arbor, MI). IEEE Computer Society Press, Los Alamitos, CA, 237-242.
-
(1995)
Proceedings of the 28th Annual Symposium on Microarchitecture
, pp. 237-242
-
-
Chen, T.-F.1
-
6
-
-
0029308368
-
Effective hardware-based data prefetching for high-performance processors
-
CHEN, T.-F. AND BAER, J.-L. 1995. Effective hardware-based data prefetching for high-performance processors. ACM Trans. Comput. 44, 5 (May), 609-623.
-
(1995)
ACM Trans. Comput.
, vol.44
, Issue.5 MAY
, pp. 609-623
-
-
Chen, T.-F.1
Baer, J.-L.2
-
7
-
-
0028757333
-
Sunder: A programmable hardware prefetch architecture for numerical loops
-
ACM Press, New York, NY
-
CHIUEH, T. 1994. Sunder: A programmable hardware prefetch architecture for numerical loops. In Proceedings of Supercomputing '94. ACM Press, New York, NY, 488-497.
-
(1994)
Proceedings of Supercomputing '94
, pp. 488-497
-
-
Chiueh, T.1
-
8
-
-
0035691709
-
Dynamic speculative precomputation
-
Austin, TX. IEEE Computer Society Press, Los Alamitos, CA
-
COLLINS, J. D., TULLSEN, D. M., WANG, H., AND SEEN, J. P. 2001a. Dynamic speculative precomputation. In Proceedings of the 34th International Symposium on Microarchitecture (Austin, TX). IEEE Computer Society Press, Los Alamitos, CA.
-
(2001)
Proceedings of the 34th International Symposium on Microarchitecture
-
-
Collins, J.D.1
Tullsen, D.M.2
Wang, H.3
Seen, J.P.4
-
9
-
-
0034839033
-
Speculative precomputation: Long-range prefetching of delinquent loads
-
Goteborg, Sweden. ACM Press, New York, NY
-
COLLINS, J. D., WANG, H., TULLSEN, D. M., HUGHES, C., LEE, Y.-F., LAVERY, D., AND SHEN, J. P. 2001b. Speculative precomputation: Long-range prefetching of delinquent loads. In Proceedings of the 28th Annual International Symposium on Computer Architecture (Goteborg, Sweden). ACM Press, New York, NY.
-
(2001)
Proceedings of the 28th Annual International Symposium on Computer Architecture
-
-
Collins, J.D.1
Wang, H.2
Tullsen, D.M.3
Hughes, C.4
Lee, Y.-F.5
Lavery, D.6
Shen, J.P.7
-
10
-
-
0026962180
-
Stride directed prefetching in scalar processors
-
Portland, OR. IEEE Computer Society Press, Los Alamitos, CA
-
Fu, J. W. C., PATEL, J. H., AND JANSSENS, B. L. 1992. Stride directed prefetching in scalar processors. In Proceedings of the 25th Annual International Symposium on Microarchitecture (Portland, OR). IEEE Computer Society Press, Los Alamitos, CA, 102-110.
-
(1992)
Proceedings of the 25th Annual International Symposium on Microarchitecture
, pp. 102-110
-
-
Fu, J.W.C.1
Patel, J.H.2
Janssens, B.L.3
-
11
-
-
84976736522
-
gprof: A call graph execution profiler
-
ACM Press, New York, NY
-
GRAHAM, S. L., KESSLER, P. B., AND McKUSICK, M. K. 1982. gprof: A call graph execution profiler. In Proceedings of 1982 SIGPLAN Symposium on Compiler Construction. ACM Press, New York, NY, 120-126.
-
(1982)
Proceedings of 1982 SIGPLAN Symposium on Compiler Construction
, pp. 120-126
-
-
Graham, S.L.1
Kessler, P.B.2
McKusick, M.K.3
-
12
-
-
0030380793
-
Maximizing multiprocessor performance with the SUIF compiler
-
HALL, M., ANDEESON, J., AMARASINGHE, S., MURPHY, B., LIAO, S.-W., BUGNION, E., AND LAM, M. 1996. Maximizing multiprocessor performance with the SUIF compiler. IEEE Comp. 29, 12 (Dec.), 84-96.
-
(1996)
IEEE Comp.
, vol.29
, Issue.12 DEC.
, pp. 84-96
-
-
Hall, M.1
Andeeson, J.2
Amarasinghe, S.3
Murphy, B.4
Liao, S.-W.5
Bugnion, E.6
Lam, M.7
-
13
-
-
0030677583
-
Prefetching using Markov predictors
-
Denver, CO. ACM Press, New York, NY
-
JOSEPH, D. AND GRUNWALD, D. 1997. Prefetching using Markov predictors. In Proceedings of the 24th International Symposium on Computer Architecture (Denver, CO). ACM Press, New York, NY, 252-263.
-
(1997)
Proceedings of the 24th International Symposium on Computer Architecture
, pp. 252-263
-
-
Joseph, D.1
Grunwald, D.2
-
14
-
-
0025429331
-
Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
-
Seattle, WA. ACM Press, New York, NY
-
JOUPPI, N. P. 1990. Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers. In Proceedings of the 17th Annual International Symposium on Computer Architecture (Seattle, WA). ACM Press, New York, NY, 364-373.
-
(1990)
Proceedings of the 17th Annual International Symposium on Computer Architecture
, pp. 364-373
-
-
Jouppi, N.P.1
-
15
-
-
0034581346
-
A prefetching technique for irregular accesses to linked data structures
-
Toulouse, France. ACM Press, New York, NY
-
KARLSSON, M., DAHLGREN, F., AND STENSTROM, P. 2000. A prefetching technique for irregular accesses to linked data structures. In Proceedings of the 6th International Conference on High Performance Computer Architecture (Toulouse, France). ACM Press, New York, NY.
-
(2000)
Proceedings of the 6th International Conference on High Performance Computer Architecture
-
-
Karlsson, M.1
Dahlgren, F.2
Stenstrom, P.3
-
16
-
-
0036949290
-
Design and evaluation of compiler algorithms for pre-execution
-
San Jose, CA. ACM Press, New York, NY
-
KIM, D. AND YEUNG, D. 2002. Design and evaluation of compiler algorithms for pre-execution. In Proceedings of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems (San Jose, CA). ACM Press, New York, NY, 159-170.
-
(2002)
Proceedings of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 159-170
-
-
Kim, D.1
Yeung, D.2
-
17
-
-
0026153646
-
An architecture for software-controlled data prefetching
-
Toronto, Ont. Canada. ACM Press, New York, NY
-
KLAIBER, A. C. AND LEVY, H. M. 1991. An architecture for software-controlled data prefetching. In Proceedings of the 18th International Symposium on Computer Architecture (Toronto, Ont. Canada). ACM Press, New York, NY, 43-53.
-
(1991)
Proceedings of the 18th International Symposium on Computer Architecture
, pp. 43-53
-
-
Klaiber, A.C.1
Levy, H.M.2
-
18
-
-
0035176199
-
Multi-chain prefetching: Effective exploitation of inter-chain memory parallelism for pointer-chasing codes
-
Barcelona, Spain. IEEE Computer Society Press, Los Alamitos, CA
-
KOHOUT, N., CHOI, S., KIM, D., AND YEUNG, D. 2001. Multi-chain prefetching: Effective exploitation of inter-chain memory parallelism for pointer-chasing codes. In Proceedings of the 10th Annual International Conference on Parallel Architectures and Compilation Techniques (Barcelona, Spain). IEEE Computer Society Press, Los Alamitos, CA, 268-279.
-
(2001)
Proceedings of the 10th Annual International Conference on Parallel Architectures and Compilation Techniques
, pp. 268-279
-
-
Kohout, N.1
Choi, S.2
Kim, D.3
Yeung, D.4
-
19
-
-
0019892368
-
Lockup-free instruction fetch/prefetch cache organization
-
Barcelona, Spain. ACM Press, New York, NY
-
KROFT, D. 1981. Lockup-free instruction fetch/prefetch cache organization. In Proceedings of 8th International Symposium on Computer Architecture (Barcelona, Spain). ACM Press, New York, NY, 81-87.
-
(1981)
Proceedings of 8th International Symposium on Computer Architecture
, pp. 81-87
-
-
Kroft, D.1
-
20
-
-
0036036248
-
Post-pass binary adaptation for software-based speculative precomputation
-
Berlin, Germany. ACM Press, New York, NY
-
LIAO, S. S. W., WANG, P. H., WANG, H., HOFLEHNER, G., LAVERY, D., AND SHEN, J. P. 2002. Post-pass binary adaptation for software-based speculative precomputation. In Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (Berlin, Germany). ACM Press, New York, NY.
-
(2002)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation
-
-
Liao, S.S.W.1
Wang, P.H.2
Wang, H.3
Hoflehner, G.4
Lavery, D.5
Shen, J.P.6
-
21
-
-
0034839064
-
Tolerating memory latency through software-controlled pre-execution in simultaneous multithreading processors
-
Goteborg, Sweden. ACM Press, New York, NY
-
LUK, C.-K. 2001. Tolerating memory latency through software-controlled pre-execution in simultaneous multithreading processors. In Proceedings of the 28th Annual International Symposium on Computer Architecture (Goteborg, Sweden). ACM Press, New York, NY.
-
(2001)
Proceedings of the 28th Annual International Symposium on Computer Architecture
-
-
Luk, C.-K.1
-
22
-
-
0030259355
-
Compiler-based prefetching for recursive data structures
-
Cambridge, MA. ACM Press, New York, NY
-
LUK, C.-K. AND MOWRY, T. C. 1996. Compiler-based prefetching for recursive data structures. In Proceedings of the Seventh International Conference on Architectural Support for Programming Languages and Operating Systems (Cambridge, MA). ACM Press, New York, NY, 222-233.
-
(1996)
Proceedings of the Seventh International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 222-233
-
-
Luk, C.-K.1
Mowry, T.C.2
-
24
-
-
0029723172
-
Examination of a memory access classification scheme for pointer-intensive and numeric programs
-
Philadelphia, PA. ACM Press, New York, NY
-
MEHROTRA, S. AND HARRISON, L. 1996. Examination of a memory access classification scheme for pointer-intensive and numeric programs. In Proceedings of the 10th ACM International Conference on Supercomputing (Philadelphia, PA). ACM Press, New York, NY.
-
(1996)
Proceedings of the 10th ACM International Conference on Supercomputing
-
-
Mehrotra, S.1
Harrison, L.2
-
25
-
-
0034819242
-
Slice-processors: An implementation of operation-based prediction
-
Sorrento, Italy. ACM Press, New York, NY
-
MOSHOVOS, A., PNEVMATIKATOS, D. N., AND BANIASADI, A. 2001. Slice-processors: An implementation of operation-based prediction. In Proceedings of the International Conference on Supercomputing (Sorrento, Italy). ACM Press, New York, NY.
-
(2001)
Proceedings of the International Conference on Supercomputing
-
-
Moshovos, A.1
Pnevmatikatos, D.N.2
Baniasadi, A.3
-
26
-
-
0031988272
-
Tolerating latency in multiprocessors through compiler-inserted prefetching
-
MOWRY, T. 1998. Tolerating latency in multiprocessors through compiler-inserted prefetching. ACM Trans. Comput. Syst. 16, 1 (Feb.), 55-92.
-
(1998)
ACM Trans. Comput. Syst.
, vol.16
, Issue.1 FEB.
, pp. 55-92
-
-
Mowry, T.1
-
27
-
-
0002031606
-
Tolerating latency through software-controlled prefetching in shared-memory multiprocessors
-
MOWRY, T. AND GUPTA, A. 1991. Tolerating latency through software-controlled prefetching in shared-memory multiprocessors. J. Parallel Distrib. Comput. 12, 2 (June), 87-106.
-
(1991)
J. Parallel Distrib. Comput.
, vol.12
, Issue.2 JUNE
, pp. 87-106
-
-
Mowry, T.1
Gupta, A.2
-
28
-
-
0033356548
-
Code transformations to improve memory parallelism
-
Haifa, Israel. IEEE Computer Society Press, Los Alamitos, CA
-
PAI, V. S. AND ADVE, S. 1999. Code transformations to improve memory parallelism. In Proceedings of the International Symposium on Microarchitecture (Haifa, Israel). IEEE Computer Society Press, Los Alamitos, CA.
-
(1999)
Proceedings of the International Symposium on Microarchitecture
-
-
Pai, V.S.1
Adve, S.2
-
29
-
-
0028294834
-
Evaluating stream buffers as a secondary cache replacement
-
Chicago, IL. ACM Press, New York, NY
-
PALACHARLA, S. AND KESSLER, R. E. 1994. Evaluating stream buffers as a secondary cache replacement. In Proceedings of the 21st Annual International Symposium on Computer Architecture (Chicago, IL). ACM Press, New York, NY, 24-33.
-
(1994)
Proceedings of the 21st Annual International Symposium on Computer Architecture
, pp. 24-33
-
-
Palacharla, S.1
Kessler, R.E.2
-
30
-
-
0029273301
-
Supporting dynamic data structures on distributed memory machines
-
ROGERS, A., CARLISLE, M., REPPY, J., AND HENDREN, L. 1995. Supporting dynamic data structures on distributed memory machines. ACM Trans. Programm. Lang. Syst. 17, 2 (March), 233-263.
-
(1995)
ACM Trans. Programm. Lang. Syst.
, vol.17
, Issue.2 MARCH
, pp. 233-263
-
-
Rogers, A.1
Carlisle, M.2
Reppy, J.3
Hendren, L.4
-
31
-
-
0031600692
-
Dependence based prefetching for linked data structures
-
San Jose, CA. ACM Press, New York, NY
-
ROTH, A., MOSHOVOS, A., AND SOHI, G. S. 1998. Dependence based prefetching for linked data structures. In Proceedings of the Eigth International Conference on Architectural Support for Programming Languages and Operating Systems (San Jose, CA). ACM Press, New York, NY.
-
(1998)
Proceedings of the Eigth International Conference on Architectural Support for Programming Languages and Operating Systems
-
-
Roth, A.1
Moshovos, A.2
Sohi, G.S.3
-
32
-
-
0032662988
-
Effective jump-pointer prefetching for linked data structures
-
Atlanta, GA. ACM Press, New York, NY
-
ROTH, A. AND SOHI, G. S. 1999. Effective jump-pointer prefetching for linked data structures. In Proceedings of the 26th International Symposium on Computer Architecture (Atlanta, GA). ACM Press, New York, NY.
-
(1999)
Proceedings of the 26th International Symposium on Computer Architecture
-
-
Roth, A.1
Sohi, G.S.2
-
33
-
-
0001820176
-
Speculative data-driven multithreading
-
Nuevo Leone, Mexico. ACM Press, New York, NY
-
ROTH, A. AND SOHI, G. S. 2001. Speculative data-driven multithreading. In Proceedings of the 7th International Conference on High Performance Computer Architecture (Nuevo Leone, Mexico). ACM Press, New York, NY, 191-202.
-
(2001)
Proceedings of the 7th International Conference on High Performance Computer Architecture
, pp. 191-202
-
-
Roth, A.1
Sohi, G.S.2
-
34
-
-
0034462352
-
Predictor-directed stream buffers
-
Monterey, CA. IEEE Computer Society Press, Los Alamitos, CA
-
SHERWOOD, T., SAIR, S., AND CALDER, B. 2000. Predictor-directed stream buffers. In Proceedings of the 33rd International Symposium on Microarchitecture (Monterey, CA). IEEE Computer Society Press, Los Alamitos, CA.
-
(2000)
Proceedings of the 33rd International Symposium on Microarchitecture
-
-
Sherwood, T.1
Sair, S.2
Calder, B.3
-
35
-
-
2542451617
-
Speculative pointer prefetching of induction pointers
-
Genoa, Italy
-
STOUTCHININ, A., AMAHAL, J. N., GAO, G. R., DEHNERT, J. C., JAIN, S., AND DOUILLET, A. 2001. Speculative pointer prefetching of induction pointers. In Compiler Construction 2001, European Joint Conferences on Theory and Practice of Software (Genoa, Italy).
-
(2001)
Compiler Construction 2001, European Joint Conferences on Theory and Practice of Software
-
-
Stoutchinin, A.1
Amahal, J.N.2
Gao, G.R.3
Dehnert, J.C.4
Jain, S.5
Douillet, A.6
-
37
-
-
0029666641
-
Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor
-
Philadelphia, PA. ACM Press, New York, NY
-
TULLSEN, D. M., EGGERS, S. J., EMEH, J. S., LEVY, H. M., Lo, J. L., AND STAMM, R. L. 1996. Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor. In Proceedings of the 1996 International Symposium on Computer Architecture (Philadelphia, PA). ACM Press, New York, NY.
-
(1996)
Proceedings of the 1996 International Symposium on Computer Architecture
-
-
Tullsen, D.M.1
Eggers, S.J.2
Emeh, J.S.3
Levy, H.M.4
Lo, J.L.5
Stamm, R.L.6
-
38
-
-
0033705677
-
Push vs. pull: Data movement for linked data structures
-
Santa Fe, NM. ACM Press, New York, NY
-
YANG, C.-L. AND LEBECK, A. R. 2000. Push vs. pull: Data movement for linked data structures. In Proceedings of the International Conference on Supercomputing (Santa Fe, NM). ACM Press, New York, NY.
-
(2000)
Proceedings of the International Conference on Supercomputing
-
-
Yang, C.-L.1
Lebeck, A.R.2
-
39
-
-
0034856097
-
Execution-based prediction using speculative slices
-
Goteborg, Sweden. ACM Press, New York, NY
-
ZILLES, C. AND SOHI, G. 2001. Execution-based prediction using speculative slices. In Proceedings of the 28th Annual International Symposium on Computer Architecture (Goteborg, Sweden). ACM Press, New York, NY.
-
(2001)
Proceedings of the 28th Annual International Symposium on Computer Architecture
-
-
Zilles, C.1
Sohi, G.2
|