-
1
-
-
0028055525
-
Predictability of load/store instruction latencies
-
Austin, Tex.. ACM, New York
-
ABRAHAM, S. G., SUGUMAR, R. A., RAU, B. R., AND GUPTA, R. 1993. Predictability of load/store instruction latencies. In Proceedings of the 26th Annual International Symposium on Microarchitecture (Austin, Tex.). ACM, New York, 139-152.
-
(1993)
Proceedings of the 26th Annual International Symposium on Microarchitecture
, pp. 139-152
-
-
Abraham, S.G.1
Sugumar, R.A.2
Rau, B.R.3
Gupta, R.4
-
2
-
-
0003927082
-
Continuous profiling: Where have all the cycles gone?
-
Digital. July
-
ANDERSON, J. M., BERC, L. M., DEAN, J., GHEMAWAT, S., HENZINGER, M. R., LEUNG, S.-T. A., SITES, R. L., VANDEVOORDE, M. T., WALDSPURGER, C. A., AND WEIHL, W. E. 1997. Continuous profiling: Where have all the cycles gone? SRC Technical Note 1997-016a, Digital. July.
-
(1997)
SRC Technical Note 1997-016a
-
-
Anderson, J.M.1
Berc, L.M.2
Dean, J.3
Ghemawat, S.4
Henzinger, M.R.5
Leung, S.-T.A.6
Sites, R.L.7
Vandevoorde, M.T.8
Waldspurger, C.A.9
Weihl, W.E.10
-
3
-
-
0034844454
-
Data prefetching by dependence graph precomputation
-
(Goteborg, Sweden). ACM, New York
-
ANNAVARAM, M., PATEL, J. M., AND DAVIDSON, E. S. 2001. Data prefetching by dependence graph precomputation. In Proceedings of the 28th Annual International Symposium on Computer Architecture (Goteborg, Sweden). ACM, New York, 52-61.
-
(2001)
Proceedings of the 28th Annual International Symposium on Computer Architecture
, pp. 52-61
-
-
Annavaram, M.1
Patel, J.M.2
Davidson, E.S.3
-
5
-
-
0003465202
-
-
CS TR 1342, University of Wisconsin-Madison, Madison, Wisc., June
-
BURGER, D. AND AUSTIN, T. M. 1997. The SimpleScalar Tool Set, Version 2.0. CS TR 1342, University of Wisconsin-Madison, Madison, Wisc., June.
-
(1997)
The SimpleScalar Tool Set, Version 2.0
-
-
Burger, D.1
Austin, T.M.2
-
6
-
-
0037567883
-
Automatic I/O hint generation through speculative execution
-
(New Orleans, La.). ACM, New York
-
CHANG, P. AND GIBSON, G. A. 1999. Automatic I/O hint generation through speculative execution. In Proceedings of the 3rd Symposium on Operating Systems Design and Implementation (New Orleans, La.). ACM, New York, 1-14.
-
(1999)
Proceedings of the 3rd Symposium on Operating Systems Design and Implementation
, pp. 1-14
-
-
Chang, P.1
Gibson, G.A.2
-
7
-
-
0032662989
-
Simultaneous subordinate microthreading (SSMT)
-
(Atlanta, Ga.). ACM, New York
-
CHAPPELL, R. S., KIM, S. P., REINHARDT, S. K., AND PATT, Y. N. 1999. Simultaneous subordinate microthreading (SSMT). In Proceedings of the 26th International Symposium on Computer Architecture (Atlanta, Ga.). ACM, New York, 186-195.
-
(1999)
Proceedings of the 26th International Symposium on Computer Architecture
, pp. 186-195
-
-
Chappell, R.S.1
Kim, S.P.2
Reinhardt, S.K.3
Patt, Y.N.4
-
8
-
-
0036294826
-
Difficult-path branch prediction using subordinate microthreads
-
(Anchorage, Ak.). ACM, New York
-
CHAPPELL, R. S., TSENG, P., YOAZ, A., AND PATT, Y. N. 2002. Difficult-path branch prediction using subordinate microthreads. In Proceedings of the 29th Annual International Symposium on Computer Architecture (Anchorage, Ak.). ACM, New York, 307-317.
-
(2002)
Proceedings of the 29th Annual International Symposium on Computer Architecture
, pp. 307-317
-
-
Chappell, R.S.1
Tseng, P.2
Yoaz, A.3
Patt, Y.N.4
-
9
-
-
0029308368
-
Effective hardware-based data prefetching for high-performance processors
-
May
-
CHEN, T.-F. AND BAER, J.-L. 1995. Effective hardware-based data prefetching for high-performance processors. Trans. Comput. 44, 5 (May), 609-623.
-
(1995)
Trans. Comput.
, vol.44
, Issue.5
, pp. 609-623
-
-
Chen, T.-F.1
Baer, J.-L.2
-
11
-
-
0035691709
-
Dynamic speculative precomputation
-
(Austin, Tex.). ACM, New York
-
COLLINS, J. D., TULLSEN, D. M., WANG, H., AND SHEN, J. P. 2001. Dynamic speculative precomputation. In Proceedings of the 34th International Symposium on Microarchitecture (Austin, Tex.). ACM, New York, 306-317.
-
(2001)
Proceedings of the 34th International Symposium on Microarchitecture
, pp. 306-317
-
-
Collins, J.D.1
Tullsen, D.M.2
Wang, H.3
Shen, J.P.4
-
12
-
-
0034839033
-
Speculative precomputation: Long-range prefetching of delinquent loads
-
(Goteborg, Sweden). ACM, New York
-
COLLINS, J. D., WANG, H., TULLSEN, D. M., HUGHES, C., LEE, Y.-F., LAVERY, D., AND SHEN, J. P. 2001b. Speculative precomputation: Long-range prefetching of delinquent loads. In Proceedings of the 28th Annual International Symposium on Computer Architecture (Goteborg, Sweden). ACM, New York, 14-25.
-
(2001)
Proceedings of the 28th Annual International Symposium on Computer Architecture
, pp. 14-25
-
-
Collins, J.D.1
Wang, H.2
Tullsen, D.M.3
Hughes, C.4
Lee, Y.-F.5
Lavery, D.6
Shen, J.P.7
-
13
-
-
0022893044
-
Doacross: Beyond vectorization for multiprocessors
-
(University Park, PA). IEEE Computer Society Press, Los Alamitos, Calif.
-
CYTRON, R. 1986. Doacross: Beyond vectorization for multiprocessors. In Proceedings of the 1986 International Conference on Parallel Processing. (University Park, PA). IEEE Computer Society Press, Los Alamitos, Calif., 836-844.
-
(1986)
Proceedings of the 1986 International Conference on Parallel Processing
, pp. 836-844
-
-
Cytron, R.1
-
14
-
-
0004174428
-
Assisted execution
-
Department of EE-Systems, University of Southern California. October
-
DUBOIS, M. AND SONG, Y. H. 1998. Assisted execution. CENG Technical Report 98-25, Department of EE-Systems, University of Southern California. October.
-
(1998)
CENG Technical Report
, vol.98
, Issue.25
-
-
Dubois, M.1
Song, Y.H.2
-
15
-
-
0030662863
-
Improving data cache performance by pre-executing instructions under a cache miss
-
(Vienna, Austria). ACM, New York
-
DUNDAS, J. AND MUDGE, T. 1997. Improving data cache performance by pre-executing instructions under a cache miss. In Proceedings of the 1997 ACM International Conference on Supercomputing (Vienna, Austria). ACM, New York, 68-75.
-
(1997)
Proceedings of the 1997 ACM International Conference on Supercomputing
, pp. 68-75
-
-
Dundas, J.1
Mudge, T.2
-
16
-
-
0032308864
-
Dataflow analysis of branch mispredictions and its application to early resolution of branch outcomes
-
(Dallas, Tex.). ACM, New York
-
FARCY, A., TEMAM, O., ESPASA, R., AND JUAN, T. 1998. Dataflow analysis of branch mispredictions and its application to early resolution of branch outcomes. In Proceedings of the 31st International Symposium on Microarchitecture (Dallas, Tex.). ACM, New York, 59-68.
-
(1998)
Proceedings of the 31st International Symposium on Microarchitecture
, pp. 59-68
-
-
Farcy, A.1
Temam, O.2
Espasa, R.3
Juan, T.4
-
17
-
-
0023385308
-
The program dependence graph and its use in optimization
-
July
-
FERRANTE, J., OTTENSTEIN, K., AND WARREN, J. 1987. The program dependence graph and its use in optimization. ACM Trans. Prog. Lang. 9, 3 (July), 319-349.
-
(1987)
ACM Trans. Prog. Lang.
, vol.9
, Issue.3
, pp. 319-349
-
-
Ferrante, J.1
Ottenstein, K.2
Warren, J.3
-
18
-
-
0036949290
-
Design and evaluation of compiler algorithms for pre-execution
-
(San Jose, Calif.). ACM, New York
-
KIM, D. AND YEUNG, D. 2002. Design and evaluation of compiler algorithms for pre-execution. In Proceedings of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems (San Jose, Calif.). ACM, New York, 159-170.
-
(2002)
Proceedings of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 159-170
-
-
Kim, D.1
Yeung, D.2
-
19
-
-
0036036248
-
Post-pass binary adaptation for software-based speculative precomputation
-
(Berlin, Germany). ACM, New York
-
LIAO, S. S. W., WANG, P. H., WANG, H., HOFLEHNER, G., LAVERY, D., AND SHEN, J. P. 2002. Post-pass binary adaptation for software-based speculative precomputation. In Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (Berlin, Germany). ACM, New York, 117-128.
-
(2002)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 117-128
-
-
Liao, S.S.W.1
Wang, P.H.2
Wang, H.3
Hoflehner, G.4
Lavery, D.5
Shen, J.P.6
-
20
-
-
0034839064
-
Tolerating Memory Latency through software-controlled pre-execution in simultaneous multithreading processors
-
(Goteborg, Sweden). ACM, New York
-
LUK, C.-K. 2001. Tolerating Memory Latency through software-controlled pre-execution in simultaneous multithreading processors. In Proceedings of the 28th Annual International Symposium on Computer Architecture (Goteborg, Sweden). ACM, New York, 40-51.
-
(2001)
Proceedings of the 28th Annual International Symposium on Computer Architecture
, pp. 40-51
-
-
Luk, C.-K.1
-
21
-
-
2542477725
-
Using the unravel program slicing tool to evaluate high integrity software
-
May (San Francisco, Calif.)
-
LYLE, J. R. AND WALLACE, D. R. May 1997. Using the unravel program slicing tool to evaluate high integrity software. In Proceedings of 10th International Software Quality Week (San Francisco, Calif.).
-
(1997)
Proceedings of 10th International Software Quality Week
-
-
Lyle, J.R.1
Wallace, D.R.2
-
22
-
-
0004237928
-
Unravel: A CASE tool to assist evaluation of high integrity software
-
National Institute of Standards and Technology. August
-
LYLE, J. R., WALLACE, D. R., GRAHAM, J. R., GALLAGHER, K. B., POOLE, J. P., AND BINKLEY, D. W. 1995. Unravel: A CASE tool to assist evaluation of high integrity software. NISTIR 5691, National Institute of Standards and Technology. August.
-
(1995)
NISTIR
, vol.5691
-
-
Lyle, J.R.1
Wallace, D.R.2
Graham, J.R.3
Gallagher, K.B.4
Poole, J.P.5
Binkley, D.W.6
-
23
-
-
84878636152
-
A study of a simultaneous multithreaded processor implementation
-
(Toulouse, France). Springer-Verlag, New York
-
MADON, D., SANCHEZ, E., AND MONNIER, S. 1999. A study of a simultaneous multithreaded processor implementation. In Proceedings of EuroPar '99. (Toulouse, France). Springer-Verlag, New York, 716-726.
-
(1999)
Proceedings of EuroPar '99
, pp. 716-726
-
-
Madon, D.1
Sanchez, E.2
Monnier, S.3
-
24
-
-
0034819242
-
Slice-processors: An implementation of operation-based prediction
-
(Sorrento, Italy). ACM, New York
-
MOSHOVOS, A., PNEVMATIKATOS, D. N., AND BANIASADI, A. 2001. Slice-processors: An implementation of operation-based prediction. In Proceedings of the International Conference on Supercomputing (Sorrento, Italy). ACM, New York, 321-334.
-
(2001)
Proceedings of the International Conference on Supercomputing
, pp. 321-334
-
-
Moshovos, A.1
Pnevmatikatos, D.N.2
Baniasadi, A.3
-
25
-
-
0031988272
-
Tolerating latency in multiprocessors through compiler-inserted prefetching
-
Feb.
-
MOWRY, T. 1998, Tolerating latency in multiprocessors through compiler-inserted prefetching. Trans. Comput. Syst. 16, 1 (Feb.), 55-92.
-
(1998)
Trans. Comput. Syst.
, vol.16
, Issue.1
, pp. 55-92
-
-
Mowry, T.1
-
26
-
-
0019055294
-
High-speed multiprocessors and compilation techniques
-
Sept.
-
PADUA, D. A., KUCK, D. J., AND LAWRIE, D. H. 1980. High-speed multiprocessors and compilation techniques, IEEE Trans. Comput. C-29, 9 (Sept.), 763-776.
-
(1980)
IEEE Trans. Comput.
, vol.C-29
, Issue.9
, pp. 763-776
-
-
Padua, D.A.1
Kuck, D.J.2
Lawrie, D.H.3
-
27
-
-
0022874874
-
Advanced compiler optimizations for supercomputers
-
Dec.
-
PADUA, D. A. AND WOLFE, M. J. 1986. Advanced compiler optimizations for supercomputers. Communi. ACM 29, 12 (Dec.), 1184-1201.
-
(1986)
Communi. ACM
, vol.29
, Issue.12
, pp. 1184-1201
-
-
Padua, D.A.1
Wolfe, M.J.2
-
28
-
-
0033356548
-
Code transformations to improve memory parallelism
-
(Haifa, Israel). ACM, New York
-
PAI, V. S. AND ADVE, S. 1999. Code transformations to improve memory parallelism. In. Proceedings of the International Symposium on Microarchitecture (Haifa, Israel). ACM, New York, 147-155.
-
(1999)
Proceedings of the International Symposium on Microarchitecture
, pp. 147-155
-
-
Pai, V.S.1
Adve, S.2
-
29
-
-
0029273301
-
Supporting dynamic data structures on distributed memory machines
-
Mar.
-
ROGERS, A., CARLISLE, M., REPPY, J., AND HENDREN, L. 1995. Supporting dynamic data structures on distributed memory machines. ACM Trans. Prog. Lang. Syst. 17, 2 (Mar.).
-
(1995)
ACM Trans. Prog. Lang. Syst.
, vol.17
, Issue.2
-
-
Rogers, A.1
Carlisle, M.2
Reppy, J.3
Hendren, L.4
-
30
-
-
0031600692
-
Dependence based prefetching for linked data structures
-
(San Jose, Calif.). ACM, New York
-
ROTH, A., MOSHOVOS, A., AND SOHI, G. S. 1998. Dependence based prefetching for linked data structures. In Proceedings of the 8th International Conference on Architectural Support for Programming Languages and Operating Systems (San Jose, Calif.). ACM, New York, 115-126.
-
(1998)
Proceedings of the 8th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 115-126
-
-
Roth, A.1
Moshovos, A.2
Sohi, G.S.3
-
31
-
-
0032669611
-
Improving virtual function call target prediction via dependence-based pre-computation
-
(Rhodes, Greece). ACM, New York
-
ROTH, A., MOSHOVOS, A., AND SOHI, G. S. 1999. Improving virtual function call target prediction via dependence-based pre-computation. In Proceedings of the 13th Annual International Conference on Supercomputing (Rhodes, Greece). ACM, New York, 356-364.
-
(1999)
Proceedings of the 13th Annual International Conference on Supercomputing
, pp. 356-364
-
-
Roth, A.1
Moshovos, A.2
Sohi, G.S.3
-
32
-
-
0001820176
-
Speculative data-driven multithreading
-
(Monterrey, Mexico). IEEE Computer Society Press, Los Alamitos, Calif.
-
ROTH, A. AND SOHI, G. S. 2001. Speculative data-driven multithreading. In Proceedings of the 7th International Conference on High Performance Computer Architecture (Monterrey, Mexico). IEEE Computer Society Press, Los Alamitos, Calif., 191-202.
-
(2001)
Proceedings of the 7th International Conference on High Performance Computer Architecture
, pp. 191-202
-
-
Roth, A.1
Sohi, G.S.2
-
33
-
-
84948958124
-
A quantitative framework for automated pre-execution thread selection
-
(Istanbul, Turkey). ACM, New York
-
ROTH, A. AND SOHI, G. S. 2002. A quantitative framework for automated pre-execution thread selection. In Proceedings of the 35th Annual International Symposium on Microarchitecture (Istanbul, Turkey). ACM, New York, 430-441.
-
(2002)
Proceedings of the 35th Annual International Symposium on Microarchitecture
, pp. 430-441
-
-
Roth, A.1
Sohi, G.S.2
-
34
-
-
0034443570
-
Symbiotic Jobscheduling for a simutaneous multithreading processor
-
(Cambridge, Mass.). ACM, New York
-
SNAVELY, A. AND TULLSEN, D. M. 2000. Symbiotic Jobscheduling for a simutaneous multithreading processor. In Proceedings of the 9th International Conference on Architectural Support for Programming Languages and Operating Systems (Cambridge, Mass.). ACM, New York, 234-244.
-
(2000)
Proceedings of the 9th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 234-244
-
-
Snavely, A.1
Tullsen, D.M.2
-
35
-
-
4444220816
-
-
SPEC. 2000. SPEC CPU2000 V1.2 (http://www.specbench.org/osg/cpu2000/).
-
(2000)
SPEC CPU2000 V1.2
-
-
-
36
-
-
0034441012
-
Slipstream processors: Improving both performance and fault tolerance
-
(Cambridge, Mass.). ACM, New York
-
SUNDARAMOORTHY, K., PURSER, Z., AND ROTENBERG, E. 2000. Slipstream processors: Improving both performance and fault tolerance. In Proceedings of the 9th International Conference on Architectural Support for Programming Languages and Operating Systems (Cambridge, Mass.). ACM, New York, 191-202.
-
(2000)
Proceedings of the 9th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 191-202
-
-
Sundaramoorthy, K.1
Purser, Z.2
Rotenberg, E.3
-
37
-
-
0029666641
-
Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor
-
(Philadelphia, Pa.). ACM, New York
-
TULLSEN, D. M., EGGERS, S. J., EMER, J. S., LEVY, H. M., LO, J. L., AND STAMM, R. L. 1996. Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor. In Proceedings of the 1996 International Symposium on Computer Architecture (Philadelphia, Pa.). ACM, New York, 191-202.
-
(1996)
Proceedings of the 1996 International Symposium on Computer Architecture
, pp. 191-202
-
-
Tullsen, D.M.1
Eggers, S.J.2
Emer, J.S.3
Levy, H.M.4
Lo, J.L.5
Stamm, R.L.6
-
38
-
-
0032786014
-
Supporting fine-grained synchronization on a simultaneous multithreading processor
-
(Orlando, Fla.). IEEE Computer Society Press, Los Alamitos, Calif.
-
TULLSEN, D. M., LO, J. L., EGGERS, S. J., AND LEVY, H. M. 1999. Supporting fine-grained synchronization on a simultaneous multithreading processor. In Proceedings of the 5th International Symposium on High-Performance Computer Architecture. (Orlando, Fla.). IEEE Computer Society Press, Los Alamitos, Calif., 54-58.
-
(1999)
Proceedings of the 5th International Symposium on High-Performance Computer Architecture
, pp. 54-58
-
-
Tullsen, D.M.1
Lo, J.L.2
Eggers, S.J.3
Levy, H.M.4
-
39
-
-
84949755841
-
Memory latency-tolerance approaches for itanium processors: Out-of-order execution vs. speculative precomputation
-
(Boston, Mass.). ACM, New York
-
WANG, P. H., WANG, H., COLLINS, J. D., GROCHOWSKI, E., KLING, R. M., AND SHEN, J. P. 2002. Memory latency-tolerance approaches for itanium processors: Out-of-order execution vs. speculative precomputation. In Proceedings of the 8th International Symposium on High-Performance Computer Architecture (Boston, Mass.). ACM, New York, 187-196.
-
(2002)
Proceedings of the 8th International Symposium on High-Performance Computer Architecture
, pp. 187-196
-
-
Wang, P.H.1
Wang, H.2
Collins, J.D.3
Grochowski, E.4
Kling, R.M.5
Shen, J.P.6
-
40
-
-
0021457061
-
Program slicing
-
July
-
WEISER, M. 1984. Program slicing. IEEE Trans. Softw. Eng. SE-10, 4 (July).
-
(1984)
IEEE Trans. Softw. Eng.
, vol.SE-10
, Issue.4
-
-
Weiser, M.1
-
41
-
-
0033707298
-
Understanding the backward slices of performance degrading instructions
-
(Vancouver, Canada). ACM, New York
-
ZILLES, C. B. AND SOHI, G. S. 2000. Understanding the backward slices of performance degrading instructions. In Proceedings of the 27th Annual International Symposium on Computer Architecture (Vancouver, Canada). ACM, New York, 172-181.
-
(2000)
Proceedings of the 27th Annual International Symposium on Computer Architecture
, pp. 172-181
-
-
Zilles, C.B.1
Sohi, G.S.2
-
42
-
-
0034856097
-
Execution-based prediction using speculative slices
-
(Goteborg, Sweden). ACM, New York
-
ZILLES, C. B. AND SOHI, G. 2001. Execution-based prediction using speculative slices. In Proceedings of the 28th Annual International Symposium on Computer Architecture (Goteborg, Sweden). ACM, New York, 2-13.
-
(2001)
Proceedings of the 28th Annual International Symposium on Computer Architecture
, pp. 2-13
-
-
Zilles, C.B.1
Sohi, G.2
-
43
-
-
84948955651
-
Master/slave speculative parallelization
-
(Istanbul, Turkey). ACM, New York
-
ZILLES, C. B. AND SOHI, G. 2002. Master/slave speculative parallelization. In Proceedings of the 35th International Symposium on Microarchitecture (Istanbul, Turkey). ACM, New York, 85-96.
-
(2002)
Proceedings of the 35th International Symposium on Microarchitecture
, pp. 85-96
-
-
Zilles, C.B.1
Sohi, G.2
|