-
1
-
-
0030645124
-
Exploiting hardware performance counters with flow and context sensitive profiling
-
June
-
Ammons, G., Ball, T., and Larus, J.R. 1997. Exploiting hardware performance counters with flow and context sensitive profiling. ACM Sigplan Notices 32, June. 85-96.
-
(1997)
ACM Sigplan Notices
, vol.32
, pp. 85-96
-
-
Ammons, G.1
Ball, T.2
Larus, J.R.3
-
2
-
-
0031540532
-
Continuous profiling: Where have all the cycles gone
-
th ACM Symposium of Operating Systems Principles. 1-14.
-
(1997)
th ACM Symposium of Operating Systems Principles
, pp. 1-14
-
-
Anderson, L.1
Berc, M.2
Dean, J.3
Ghemawat, M.R.4
Henzinger, S.5
Leung, S.6
Sites, L.7
Vandervoorde, M.T.8
Waldspurger, C.A.9
Weihl, W.E.10
-
3
-
-
0029703704
-
Fast, effective dynamic compilation
-
Auslander, J., Philipose, M., Chambers, C., Eggers, S.J., and Bershad, B.N. 1996. Fast, effective dynamic compilation. In Proceedings of the SIGPLAN'96 Conference on Programming Language Design and Implementation (PLDI'96).
-
(1996)
Proceedings of the SIGPLAN'96 Conference on Programming Language Design and Implementation (PLDI'96)
-
-
Auslander, J.1
Philipose, M.2
Chambers, C.3
Eggers, S.J.4
Bershad, B.N.5
-
4
-
-
0031624040
-
Scalable cross-module optimization
-
Montreal
-
Ayers, A., De Jong, S., Peyton, J., and Schooler, R. 1998. Scalable cross-module optimization. In Proceedings of the SIGPLAN'98 Conference on Programming Language Design and Implementation (PLDI'98), Montreal. 301-312.
-
(1998)
Proceedings of the SIGPLAN'98 Conference on Programming Language Design and Implementation (PLDI'98)
, pp. 301-312
-
-
Ayers, A.1
De Jong, S.2
Peyton, J.3
Schooler, R.4
-
8
-
-
0032290843
-
MPS: Miss-path scheduling for multiple-issue processors
-
Dec.
-
Banerjia, S., Sathaye, S.W., Menezes, K.N., and Conte, T.M. 1998. MPS: miss-path scheduling for multiple-issue processors. IEEE Trans. Computers 47, 12 (Dec.).
-
(1998)
IEEE Trans. Computers
, vol.47
, pp. 12
-
-
Banerjia, S.1
Sathaye, S.W.2
Menezes, K.N.3
Conte, T.M.4
-
10
-
-
0031635766
-
Complete removal of redundant expressions
-
Montreal
-
Bodik, R., Gupta, R., and Soffa, M.L. 1998. Complete removal of redundant expressions. In Proceedings of the SIGPLAN'98 Conference on Programming Language Design and Implementation (PLDI'98), Montreal. 1-14.
-
(1998)
Proceedings of the SIGPLAN'98 Conference on Programming Language Design and Implementation (PLDI'98)
, pp. 1-14
-
-
Bodik, R.1
Gupta, R.2
Soffa, M.L.3
-
11
-
-
5444234938
-
-
System and method of using annotations to optimize dynamically translated code in the presence of signals. U.S. Patent 5,838,978. Nov 1998
-
Buzbee, W.B. 1998. System and method of using annotations to optimize dynamically translated code in the presence of signals. U.S. Patent 5,838,978. Nov 1998.
-
(1998)
-
-
Buzbee, W.B.1
-
12
-
-
0032025103
-
FX!32: A profile-directed binary translator
-
March/April 1998
-
Chernoff, A., Herdeg, M., Hookway, R., Reeve, C., Rubin, N., Tye, T., Yadavalli, B., and Yates, J. 1998. FX!32: a profile-directed binary translator. IEEE Micro, Vol 18, No. 2, March/April 1998.
-
(1998)
IEEE Micro
, vol.18
, Issue.2
-
-
Chernoff, A.1
Herdeg, M.2
Hookway, R.3
Reeve, C.4
Rubin, N.5
Tye, T.6
Yadavalli, B.7
Yates, J.8
-
13
-
-
0003978993
-
Shade: A fast instruction set simulator for execution profiling
-
Dept. Comp. Science and Engineering, Univ. Washington
-
Cmelik, R.F. and Keppel, D. 1993. Shade: a fast instruction set simulator for execution profiling. Technical Report UWCSE-93-06-06, Dept. Comp. Science and Engineering, Univ. Washington.
-
(1993)
Technical Report
, vol.UWCSE-93-06-06
-
-
Cmelik, R.F.1
Keppel, D.2
-
15
-
-
0031363362
-
Optimizing Alpha executables on windows NT with spike
-
Cohn, R., Goodwin, D.W., and Lowney, P.G. 1997. Optimizing Alpha executables on Windows NT with Spike. Digital Technical Journal 9, 4.
-
(1997)
Digital Technical Journal
, vol.9
, pp. 4
-
-
Cohn, R.1
Goodwin, D.W.2
Lowney, P.G.3
-
18
-
-
0030130138
-
Hardware-based profiling: An effective technique for profile-driven optimization
-
Conte, T.M., Patel, B.A., Menezes, K.N., and Cox, J.S. 1996. Hardware-based profiling: An effective technique for profile-driven optimization. International Journal of Parallel Programming 24, April. 187-206.
-
(1996)
International Journal of Parallel Programming
, vol.24
, Issue.APRIL
, pp. 187-206
-
-
Conte, T.M.1
Patel, B.A.2
Menezes, K.N.3
Cox, J.S.4
-
20
-
-
0027574855
-
A methodology for procedure cloning
-
Cooper, K., Hall M.H., and Kennedy, K. 1993. A methodology for procedure cloning. Computer Languages 19, 2 (April). 105-117.
-
(1993)
Computer Languages
, vol.19
, Issue.2 APRIL
, pp. 105-117
-
-
Cooper, K.1
Hall, M.H.2
Kennedy, K.3
-
21
-
-
0002232440
-
Compiling java just in time
-
May/Jun 1997
-
Cramer, T., Friedman, R., Miller, T., Seberger, D., Wilson, R., and Wolczko, M. 1997. Compiling Java Just In Time. IEEE Micro, May/Jun 1997.
-
(1997)
IEEE Micro
-
-
Cramer, T.1
Friedman, R.2
Miller, T.3
Seberger, D.4
Wilson, R.5
Wolczko, M.6
-
28
-
-
0005421783
-
The Java HotSpot virtual machine architecture
-
Griswold, D. 1998. The Java HotSpot virtual machine architecture. Sun Microsystems, Mar. 1998. Available from http://java.sun.com/products/hotspot/ whitepaper.html.
-
(1998)
Sun Microsystems, Mar. 1998
-
-
Griswold, D.1
-
29
-
-
0002625387
-
Intel and HP make EPIC disclosure
-
Oct.
-
Gwennap, L. 1997. Intel and HP make EPIC disclosure. Microprocessor Report 11, 14 (Oct.).
-
(1997)
Microprocessor Report
, vol.11
, pp. 14
-
-
Gwennap, L.1
-
30
-
-
2342513342
-
PA-7200 enables inexpensive MP systems
-
Mar.
-
Gwennap, L. 1994. PA-7200 enables inexpensive MP systems. Microprocessor Report 8, 3 (Mar.).
-
(1994)
Microprocessor Report
, vol.8
, pp. 3
-
-
Gwennap, L.1
-
34
-
-
0003988888
-
-
PhD thesis, Computer Science Dept., Stanford University, available as Technical Report STAN-CS-TR-94-1520. Also available as a Sun Microsystems Lab technical report
-
Holzle, U. 1994. Adaptive optimization for SELF: reconciling high performance with exploratory programming, PhD thesis, Computer Science Dept., Stanford University, available as Technical Report STAN-CS-TR-94-1520. Also available as a Sun Microsystems Lab technical report.
-
(1994)
Adaptive Optimization for SELF: Reconciling High Performance with Exploratory Programming
-
-
Holzle, U.1
-
35
-
-
0030675993
-
Digital FX!32: Combining emulation and binary translation
-
1997
-
Hookway, R.J, and Herdeg, M.A. 1997. Digital FX!32: combining emulation and binary translation. Digital Technical journal, Vol 9, No. 1, 1997, pp 3-12.
-
(1997)
Digital Technical Journal
, vol.9
, Issue.1
, pp. 3-12
-
-
Hookway, R.J.1
Herdeg, M.A.2
-
36
-
-
5444245113
-
-
Private communication
-
Hunt, Doug 1999. Private communication.
-
(1999)
-
-
Hunt, D.1
-
37
-
-
5444234324
-
-
Private communication
-
Hsu, W. 1999. Private communication.
-
(1999)
-
-
Hsu, W.1
-
38
-
-
0027595384
-
The superblock: An effective structure for VLIW and superscalar compilation
-
Hwu, W.W., Mahlke, S.A., Chen, W.Y., Chang, P. P., Warter, N.J., Bringmann, R.A., Ouellette, R.Q., Hank, R.E., Kiyohara, T., Haab, G.E., Holm, J.G., and Lavery, D.M. 1993. The superblock: an effective structure for VLIW and superscalar compilation. The Journal of Supercomputing 7, (Jan.). 229-248.
-
(1993)
The Journal of Supercomputing
, vol.7
, Issue.JAN
, pp. 229-248
-
-
Hwu, W.W.1
Mahlke, S.A.2
Chen, W.Y.3
Chang, P.P.4
Warter, N.J.5
Bringmann, R.A.6
Ouellette, R.Q.7
Hank, R.E.8
Kiyohara, T.9
Haab, G.E.10
Holm, J.G.11
Lavery, D.M.12
-
41
-
-
5444235820
-
-
Memory controller for a microprocessor for detecting a failure of speculation on the physical nature of a component being addressed. U.S. Patent 5,832,205, Nov. 1998
-
Kelly, E.K., Cmelik, R.F., and Wing, M.J. 1998. Memory controller for a microprocessor for detecting a failure of speculation on the physical nature of a component being addressed. U.S. Patent 5,832,205, Nov. 1998.
-
(1998)
-
-
Kelly, E.K.1
Cmelik, R.F.2
Wing, M.J.3
-
42
-
-
0028460367
-
Optimal code motion: Theory and practice
-
Knoop, J., Ruething, O., and Steffen, B. 1994. Optimal code motion: theory and practice. ACM Transactions on Programming Languages and Systems 16, 4. 1117-1155.
-
(1994)
ACM Transactions on Programming Languages and Systems
, vol.16
, Issue.4
, pp. 1117-1155
-
-
Knoop, J.1
Ruething, O.2
Steffen, B.3
-
44
-
-
0006423345
-
The HP PA-8000 RISC CPU: A high performance out-of-order processor
-
Palo Alto, CA.
-
Kumar, A. 1996. The HP PA-8000 RISC CPU: a high performance out-of-order processor. In Proceedings Hot Chips VIII, Palo Alto, CA.
-
(1996)
Proceedings Hot Chips VIII
-
-
Kumar, A.1
-
45
-
-
0003641991
-
Dynamo: A staged compiler architecture for dynamic program optimization
-
Dept. Computer Science, Indiana University
-
Leone, M. and Dybvig, R.K. 1997. Dynamo: a staged compiler architecture for dynamic program optimization. Technical Report #490, Dept. Computer Science, Indiana University.
-
(1997)
Technical Report #490
, vol.490
-
-
Leone, M.1
Dybvig, R.K.2
-
48
-
-
0018434045
-
Global optimization by suppression of partial redundancies
-
Morel, E. and Renvoise, C. 1979. Global optimization by suppression of partial redundancies. CACM 22, 2. 96-103.
-
(1979)
CACM
, vol.22
, Issue.2
, pp. 96-103
-
-
Morel, E.1
Renvoise, C.2
-
50
-
-
0026918390
-
Improving the accuracy of dynamic branch prediction using branch correlation
-
Pan, S., So, K., and Rahmeh, J. 1992. Improving the accuracy of dynamic branch prediction using branch correlation. In Proceedings of ASPLOS-V. 76-84.
-
(1992)
Proceedings of ASPLOS-V
, pp. 76-84
-
-
Pan, S.1
So, K.2
Rahmeh, J.3
-
51
-
-
0030126384
-
Tuning the Pentium Pro microarchitecture
-
Papworth, D. 1996. Tuning the Pentium Pro microarchitecture. IEEE Micro, (Apr.). 8-15.
-
(1996)
IEEE Micro
, Issue.APR
, pp. 8-15
-
-
Papworth, D.1
-
52
-
-
5444225705
-
-
Dynamic flow instruction cache memory organized around trace segments independent of virtual address line. U.S. patent 5,381,533
-
Peleg, A. and Weiser, U. 1994. Dynamic flow instruction cache memory organized around trace segments independent of virtual address line. U.S. patent 5,381,533.
-
(1994)
-
-
Peleg, A.1
Weiser, U.2
-
53
-
-
0030717771
-
Tcc: A system for fast flexible, and high-level dynamic code generation
-
Poletta, M., Engler, D.R., and Kaashoek, M.F. 1997. tcc: a system for fast flexible, and high-level dynamic code generation. In Proceedings of the SIGPLAN '97 Conference on Programming Language Design and Implementation. 109-121.
-
(1997)
Proceedings of the SIGPLAN '97 Conference on Programming Language Design and Implementation
, pp. 109-121
-
-
Poletta, M.1
Engler, D.R.2
Kaashoek, M.F.3
-
55
-
-
0027594708
-
Multi-way versus one-way constraints in user interfaces: Experiences with the DeltaBlue algorithm
-
Sannella, M., Maloney, J., Freeman-Benson, B., and Borning, A. 1993. Multi-way versus one-way constraints in user interfaces: experiences with the DeltaBlue algorithm. Software - Practice and Experience 23, 5 (May). 529-566.
-
(1993)
Software - Practice and Experience
, vol.23
, Issue.5 MAY
, pp. 529-566
-
-
Sannella, M.1
Maloney, J.2
Freeman-Benson, B.3
Borning, A.4
-
56
-
-
0005002277
-
Binary Translation
-
Special Issue
-
Sites, R.L., Chernoff, A., Kirk, M.B., Marks, M.P., and Robinson, S.G. Binary Translation. Digital Technical Journal, Vol 4, No. 4, Special Issue, 1992.
-
(1992)
Digital Technical Journal
, vol.4
, Issue.4
-
-
Sites, R.L.1
Chernoff, A.2
Kirk, M.B.3
Marks, M.P.4
Robinson, S.G.5
-
57
-
-
0028516384
-
The PowerPC 604 microproccesor
-
Song, S.P. Denman, M., and Chang, J. 1995. The PowerPC 604 microproccesor", IEEE Micro, (Oct.). 8-17.
-
(1995)
IEEE Micro
, Issue.OCT
, pp. 8-17
-
-
Song, S.P.1
Denman, M.2
Chang, J.3
-
58
-
-
5444266191
-
-
Information on SPEC programs are available from the SPEC Web site
-
SPEC, Information on SPEC programs are available from the SPEC Web site at http://www.spec.org.
-
-
-
-
59
-
-
5444246742
-
Emulating the x86 and DOS/Windows in RISC environments
-
San Jose, CA
-
Stears, P. 1994. Emulating the x86 and DOS/Windows in RISC environments. In Proceedings Microprocessor Forum, San Jose, CA.
-
(1994)
Proceedings Microprocessor Forum
-
-
Stears, P.1
-
61
-
-
0343216869
-
Systems for late code modification
-
Digital Equipment Corp. Western Research Laboratory, 250 University Ave, Polo Alto, CA 94301
-
Wall, D.W. 1992. Systems for late code modification. Research Report 92/3, Digital Equipment Corp. Western Research Laboratory, 250 University Ave, Polo Alto, CA 94301.
-
(1992)
Research Report
, vol.92
, Issue.3
-
-
Wall, D.W.1
-
63
-
-
0030129806
-
MIPS R10000 superscalar microprocessor
-
Apr. 1996
-
Yeager, K.C. 1996. MIPS R10000 superscalar microprocessor. In IEEE Micro, Apr. 1996.
-
(1996)
IEEE Micro
-
-
Yeager, K.C.1
-
64
-
-
84976769139
-
Improving the accuracy of static branch prediction using branch correlation
-
Young, C. and Smith, M.D. 1994. Improving the accuracy of static branch prediction using branch correlation. In Proceedings of ASPLOS-VI. 232-241.
-
(1994)
Proceedings of ASPLOS-VI
, pp. 232-241
-
-
Young, C.1
Smith, M.D.2
|