-
1
-
-
0031540532
-
Continuous profiling: Where have all the cycles gone?
-
Anderson, J. M., Berc, L. M., Dean, J., Ghemawat, S., Henzinger, M. R., Leung, S. A., Sites, R. L., Vandevoorde, M. T., Waldspurger, C. A., and Weihl, W. E. 1997. Continuous profiling: Where have all the cycles gone? ACM Trans. Comput. Syst.
-
(1997)
ACM Trans. Comput. Syst.
-
-
Anderson, J.M.1
Berc, L.M.2
Dean, J.3
Ghemawat, S.4
Henzinger, M.R.5
Leung, S.A.6
Sites, R.L.7
Vandevoorde, M.T.8
Waldspurger, C.A.9
Weihl, W.E.10
-
3
-
-
84949754375
-
Loose loops sink chips
-
Borch, E., Tune, E., Manne, B., and Emer, J. 2002. Loose loops sink chips. In 8th International Symposium on High-Performance Computer Architecture.
-
(2002)
8th International Symposium on High-Performance Computer Architecture
-
-
Borch, E.1
Tune, E.2
Manne, B.3
Emer, J.4
-
5
-
-
0003465202
-
The Simplescalar Tool Set, version 2.0
-
CS-TR-1997-1342, University of Wisconsin, Madison
-
Burger, D. C. and Austin, T. M. 1997. The Simplescalar Tool Set, version 2.0. Tech. Rep., CS-TR-1997-1342, University of Wisconsin, Madison.
-
(1997)
Tech. Rep.
-
-
Burger, D.C.1
Austin, T.M.2
-
8
-
-
0034839033
-
Speculative precomputation: Long-range prefetching of delinquent loads
-
Collins, J. D., Wang, H., Tullsen, D. M., Hughes, C., Lee, Y., Lavery, D., and Shen, J. P. 2001. Speculative precomputation: Long-range prefetching of delinquent loads. In 28th International Symposium on Computer Architecture.
-
(2001)
28th International Symposium on Computer Architecture
-
-
Collins, J.D.1
Wang, H.2
Tullsen, D.M.3
Hughes, C.4
Lee, Y.5
Lavery, D.6
Shen, J.P.7
-
9
-
-
0031340339
-
ProfileMe: Hardware support for instruction-level profiling on out-of-order processors
-
Dean, J., Hicks, J. E., Waldspurger, C. A., Weihl, W. E., and Chrysos, G. 1997. ProfileMe: Hardware support for instruction-level profiling on out-of-order processors. In 30th International Symposium on Microarchitecture.
-
(1997)
30th International Symposium on Microarchitecture
-
-
Dean, J.1
Hicks, J.E.2
Waldspurger, C.A.3
Weihl, W.E.4
Chrysos, G.5
-
10
-
-
0035694429
-
Performance characterization of a hardware mechanism for dynamic optimization
-
Fahs, B., Bose, S., Crum, M., Slechta, B., Spadini, F., Tung, T., Patel, S. J., and Lumetta, S. S. 2001. Performance characterization of a hardware mechanism for dynamic optimization. In 34th International Symposium on Microarchitecture.
-
(2001)
34th International Symposium on Microarchitecture.
-
-
Fahs, B.1
Bose, S.2
Crum, M.3
Slechta, B.4
Spadini, F.5
Tung, T.6
Patel, S.J.7
Lumetta, S.S.8
-
13
-
-
84944400526
-
The non-critical buffer: Using load latency tolerance to improve data cache efficiency
-
Fisk, B. R. and Bahar, R. I. 1999. The non-critical buffer: Using load latency tolerance to improve data cache efficiency.
-
(1999)
-
-
Fisk, B.R.1
Bahar, R.I.2
-
14
-
-
0029653518
-
Whole-genome random sequencing and assembly of haemophilus-influenzae
-
Fleischmann, R. D. et al. 1995. Whole-genome random sequencing and assembly of haemophilus-influenzae. Science 269, 496-512.
-
(1995)
Science
, vol.269
, pp. 496-512
-
-
Fleischmann, R.D.1
-
17
-
-
0036287089
-
The optimal logic depth per pipeline stage is 6 to 8 Fo4 inverter delays
-
Hrishikesh, M. S., Jouppi, N. P., Farkas, K. I., Burger, D., Keckler, S. W., and Shivakumar, P. 2002. The optimal logic depth per pipeline stage is 6 to 8 Fo4 inverter delays. In 29th International Symposium on Computer Architecture.
-
(2002)
29th International Symposium on Computer Architecture
-
-
Hrishikesh, M.S.1
Jouppi, N.P.2
Farkas, K.I.3
Burger, D.4
Keckler, S.W.5
Shivakumar, P.6
-
18
-
-
84944394394
-
-
Available at http://developer.intel.com/design/pentium4/manuals/
-
INTEL. 2003. Intel Pentium 4 Processor Manual. Available at http://developer.intel.com/design/pentium4/manuals/.
-
(2003)
Intel Pentium 4 Processor Manual
-
-
-
25
-
-
84908877398
-
Non-vital loads
-
Rakvic, R., Black, B., Limaye, D., and Shen, J. P. 2002. Non-vital loads. In 8th International Symposium on High-Performance Computer Architecture.
-
(2002)
8th International Symposium on High-Performance Computer Architecture.
-
-
Rakvic, R.1
Black, B.2
Limaye, D.3
Shen, J.P.4
-
26
-
-
84944382929
-
-
Ranganathan, P., Gharachorloo, K., Adve, S. V., and Barroso, L. A. 1998. Performance of database workloads on shared-memory systems with out-of-order processors.
-
(1998)
Performance of database workloads on shared-memory systems with out-of-order processors
-
-
Ranganathan, P.1
Gharachorloo, K.2
Adve, S.V.3
Barroso, L.A.4
-
27
-
-
84883540577
-
The impact of architectural trends on operating system performance
-
Rosenblum, M., Bugnion, E., Herrod, S. A., Witchel, E., and Gupta, A. 1995. The impact of architectural trends on operating system performance. In 15th Symposium on Operating Systems Principles.
-
(1995)
15th Symposium on Operating Systems Principles
-
-
Rosenblum, M.1
Bugnion, E.2
Herrod, S.A.3
Witchel, E.4
Gupta, A.5
-
28
-
-
0039177624
-
Speculative Data-Driven Sequencing for Imperative Programs
-
CS-TR-2000-1411, University of Wisconsin, Madison
-
Roth, A. and Sohi, G. 2000. Speculative Data-Driven Sequencing for Imperative Programs. Tech. Rep. CS-TR-2000-1411, University of Wisconsin, Madison.
-
(2000)
Tech. Rep.
-
-
Roth, A.1
Sohi, G.2
-
30
-
-
0345272496
-
Energy-efficient processor design using multiple clock domains with dynamic voltage and frequency scaling
-
Semeraro, G., Magklis, G., Balasubramonian, R., Albonesi, D., Dwarkadas, S., and Scott, M. 2002. Energy-efficient processor design using multiple clock domains with dynamic voltage and frequency scaling. In 8th International Symposium on High-Performance Computer Architecture.
-
(2002)
8th International Symposium on High-Performance Computer Architecture
-
-
Semeraro, G.1
Magklis, G.2
Balasubramonian, R.3
Albonesi, D.4
Dwarkadas, S.5
Scott, M.6
-
34
-
-
0036652569
-
Pentium 4 performance-monitoring features
-
July
-
Sprunt, B. 2002. Pentium 4 performance-monitoring features. IEEE Micro, July.
-
(2002)
IEEE Micro
-
-
Sprunt, B.1
-
35
-
-
0034844456
-
Locality vs. criticality
-
Srinivasan, S. T., ching Ju, R. D., Lebeck, A. R., and Wilkerson, C. 2001. Locality vs. criticality. in 28th International Symposium on Computer Architecture.
-
(2001)
28th International Symposium on Computer Architecture
-
-
Srinivasan, S.T.1
ching Ju, R.D.2
Lebeck, A.R.3
Wilkerson, C.4
-
37
-
-
0033703889
-
A scalable approach to thread-level speculation
-
Steffan, J. G., Colohan, C. B., Zhai, A., and Mowry, T. C. 2000. A scalable approach to thread-level speculation. In 27th International Symposium on Computer Architecture.
-
(2000)
27th International Symposium on Computer Architecture
-
-
Steffan, J.G.1
Colohan, C.B.2
Zhai, A.3
Mowry, T.C.4
-
38
-
-
0034817930
-
Dynamic prediction of critical path instructions
-
Tune, E., Liang, D., Tullsen, D. M., and Calder, B. 2001. Dynamic prediction of critical path instructions. in 7th International Symposium on High-Performance Computer Architecture.
-
(2001)
7th International Symposium on High-Performance Computer Architecture
-
-
Tune, E.1
Liang, D.2
Tullsen, D.M.3
Calder, B.4
-
41
-
-
84944805749
-
Performance analysis using the MIPS R10000 performance counters
-
Zagha, M., Larson, B., Turner, S., and Itzkowitz, M. 1996. Performance analysis using the MIPS R10000 performance counters. In Supercomputing '96.
-
(1996)
Supercomputing '96.
-
-
Zagha, M.1
Larson, B.2
Turner, S.3
Itzkowitz, M.4
|