-
1
-
-
34548016218
-
-
PhD thesis, Department of Electrical and Computer Engineering, University of Toronto
-
T. M. Aamodt. Modeling and Optimization of Speculative Threads. PhD thesis, Department of Electrical and Computer Engineering, University of Toronto, 2006.
-
(2006)
Modeling and Optimization of Speculative Threads
-
-
Aamodt, T.M.1
-
2
-
-
33646170275
-
A Framework for Modeling and Optimization of Prescient Instruction Prefetch
-
T. M. Aamodt, P. Marcuello, P. Chow, A. González, P. Hammarlund, H. Wang, and J. P. Shen. A Framework for Modeling and Optimization of Prescient Instruction Prefetch. In ACM SIGMETRICS Int'l Conf. on Measurement and Modeling of Computer Systems, pages 13-24, 2003.
-
(2003)
ACM SIGMETRICS Int'l Conf. on Measurement and Modeling of Computer Systems
, pp. 13-24
-
-
Aamodt, T.M.1
Marcuello, P.2
Chow, P.3
González, A.4
Hammarlund, P.5
Wang, H.6
Shen, J.P.7
-
6
-
-
34548021357
-
-
D. Burger and T. M. Austin. The SimpleScalar Tool Set, Version 2.0, 1997
-
D. Burger and T. M. Austin. The SimpleScalar Tool Set, Version 2.0. http://www.simplescalar.com, 1997.
-
-
-
-
8
-
-
0026368758
-
Using Profile Information to Assist Classic Code Optimizations
-
P. P. Chang, S. A. Mahlke, and W. Hwu. Using Profile Information to Assist Classic Code Optimizations. Software: Practice and Experience, 21(12):1301-1321, 1991.
-
(1991)
Software: Practice and Experience
, vol.21
, Issue.12
, pp. 1301-1321
-
-
Chang, P.P.1
Mahlke, S.A.2
Hwu, W.3
-
9
-
-
0036294826
-
Difficult-Path Branch Prediction Using Subordinate Microthreads
-
R. S. Chappell, F. Tseng, A. Yoaz, and Y. N. Patt. Difficult-Path Branch Prediction Using Subordinate Microthreads. In 29th Int'l Symp. on Computer Architecture, pages 307-317, 2002.
-
(2002)
29th Int'l Symp. on Computer Architecture
, pp. 307-317
-
-
Chappell, R.S.1
Tseng, F.2
Yoaz, A.3
Patt, Y.N.4
-
10
-
-
17244373536
-
Interprocedural probabilistic pointer analysis
-
P.-S. Chen, Y.-S. Hwang, R. D.-C. Ju, and J. K. Lee. Interprocedural probabilistic pointer analysis. IEEE Trans. Parallel Distrib. Syst., 15(10):893-907, 2004.
-
(2004)
IEEE Trans. Parallel Distrib. Syst
, vol.15
, Issue.10
, pp. 893-907
-
-
Chen, P.-S.1
Hwang, Y.-S.2
Ju, R.D.-C.3
Lee, J.K.4
-
11
-
-
0034839033
-
Speculative Precomputation: Long-Range Prefetching of Delinquent Loads
-
J. D. Collins, H. Wang, D. M. Tullsen, C. Hughes, Y.-F. Lee, D. Lavery, and J. P. Shen. Speculative Precomputation: Long-Range Prefetching of Delinquent Loads. In 28th Int'l Symp. on Computer Architecture, pages 14-25, 2001.
-
(2001)
28th Int'l Symp. on Computer Architecture
, pp. 14-25
-
-
Collins, J.D.1
Wang, H.2
Tullsen, D.M.3
Hughes, C.4
Lee, Y.-F.5
Lavery, D.6
Shen, J.P.7
-
12
-
-
0004174428
-
Assisted execution
-
98-25, Department of EE-Systems, University of Southern California, October
-
M. Dubois and Y. Song. Assisted execution. Technical Report CENG 98-25, Department of EE-Systems, University of Southern California, October 1998.
-
(1998)
Technical Report CENG
-
-
Dubois, M.1
Song, Y.2
-
13
-
-
0019596071
-
Trace Scheduling: A Technique for Global Microcode Compaction
-
J. A. Fisher. Trace Scheduling: A Technique for Global Microcode Compaction. IEEE Trans. Computers, 30(7):478-490, 1981.
-
(1981)
IEEE Trans. Computers
, vol.30
, Issue.7
, pp. 478-490
-
-
Fisher, J.A.1
-
15
-
-
0030380793
-
Maximizing Multiprocessor Performance with the SUIF Compiler
-
December
-
M. W. Hall, J. M. Anderson, S. P. Amarasinghe, B. R. Murphy, S.-W. Liao, E. Bugnion, and M. S. Lam. Maximizing Multiprocessor Performance with the SUIF Compiler. IEEE Computer, December 1996.
-
(1996)
IEEE Computer
-
-
Hall, M.W.1
Anderson, J.M.2
Amarasinghe, S.P.3
Murphy, B.R.4
Liao, S.-W.5
Bugnion, E.6
Lam, M.S.7
-
17
-
-
3042569221
-
Physical Experimentation with Prefetching Helper Threads on Intel's Hyper-Threaded Processors
-
D. Kim, S.-W. Liao, P. H. Wang, J. del Cuvillo, X. Tian, X. Zou, H. Wang, D. Yeung, M. Girkar, and J. P. Shen. Physical Experimentation with Prefetching Helper Threads on Intel's Hyper-Threaded Processors. In 2nd Intl. Symp. on Code Generation and Optimization (CGO 2004), pages 27-38, 2004.
-
(2004)
2nd Intl. Symp. on Code Generation and Optimization (CGO 2004)
, pp. 27-38
-
-
Kim, D.1
Liao, S.-W.2
Wang, P.H.3
del Cuvillo, J.4
Tian, X.5
Zou, X.6
Wang, H.7
Yeung, D.8
Girkar, M.9
Shen, J.P.10
-
18
-
-
0036949290
-
Design and Evaluation of Compiler Algorithms for Pre-Execution
-
D. Kim and D. Yeung. Design and Evaluation of Compiler Algorithms for Pre-Execution. In ASPLOS-X, pages 159-170, 2002.
-
(2002)
ASPLOS-X
, pp. 159-170
-
-
Kim, D.1
Yeung, D.2
-
20
-
-
0036036248
-
Post-Pass Binary Adaptation for Software-Based Speculative Precomputation
-
S. S. Liao, P. H. Wang, H. Wang, G. Hoflehner, D. Lavery, and J. P. Shen. Post-Pass Binary Adaptation for Software-Based Speculative Precomputation. In Conf. on Programming Language Design and Implementation, pages 117-128, 2002.
-
(2002)
Conf. on Programming Language Design and Implementation
, pp. 117-128
-
-
Liao, S.S.1
Wang, P.H.2
Wang, H.3
Hoflehner, G.4
Lavery, D.5
Shen, J.P.6
-
24
-
-
0031357519
-
Predicting Data Cache Misses in Non-Numeric Applications Through Correlation Profiling
-
T. C Mowry and C-K. Luk. Predicting Data Cache Misses in Non-Numeric Applications Through Correlation Profiling. In 30th Int'l Symp. on Microarchitecture, pages 314-320, 1997.
-
(1997)
30th Int'l Symp. on Microarchitecture
, pp. 314-320
-
-
Mowry, T.C.1
Luk, C.-K.2
-
26
-
-
27544460107
-
Energy-effectiveness of pre-execution and energy-aware p-thread selection
-
V. Petric and A. Roth. Energy-effectiveness of pre-execution and energy-aware p-thread selection. In 32nd Int 'I Symp. on Computer Architecture, pages 322-333, 2005.
-
(2005)
32nd Int 'I Symp. on Computer Architecture
, pp. 322-333
-
-
Petric, V.1
Roth, A.2
-
29
-
-
84948958124
-
A Quantitative Framework for Automated Pre-Execution Thread Selection
-
A. Roth and G. S. Sohi. A Quantitative Framework for Automated Pre-Execution Thread Selection. In Int'l Symp. on Microarchitecture, pages 430-441, 2002.
-
(2002)
Int'l Symp. on Microarchitecture
, pp. 430-441
-
-
Roth, A.1
Sohi, G.S.2
-
33
-
-
34548008526
-
-
Standard Performance Evaluation Corporation. SPEC 2000 CPU benchmarks, http://www.spec.org/.
-
Standard Performance Evaluation Corporation. SPEC 2000 CPU benchmarks, http://www.spec.org/.
-
-
-
-
34
-
-
0019588127
-
A Unified Approach to Path Problems
-
R. E. Tarjan. A Unified Approach to Path Problems. Journal of the ACM, 28(3):577-593, 1981.
-
(1981)
Journal of the ACM
, vol.28
, Issue.3
, pp. 577-593
-
-
Tarjan, R.E.1
-
35
-
-
0019587817
-
Fast Algorithms for Solving Path Problems
-
R. E. Tarjan. Fast Algorithms for Solving Path Problems. Journal of the ACM, 28(3):594-614, 1981.
-
(1981)
Journal of the ACM
, vol.28
, Issue.3
, pp. 594-614
-
-
Tarjan, R.E.1
-
36
-
-
19944428009
-
Helper threads via virtual multithreading on an experimental itanium 2 processor-based platform
-
P. H. Wang, J. D. Collins, H. Wang, D. Kim, B. Greene, K.-M. Chan, A. B. Yunus, T. Sych, S. F. Moore, and J. P. Shen. Helper threads via virtual multithreading on an experimental itanium 2 processor-based platform. In ASPLOS, pages 144-155, 2004.
-
(2004)
ASPLOS
, pp. 144-155
-
-
Wang, P.H.1
Collins, J.D.2
Wang, H.3
Kim, D.4
Greene, B.5
Chan, K.-M.6
Yunus, A.B.7
Sych, T.8
Moore, S.F.9
Shen, J.P.10
-
38
-
-
0033707298
-
Understanding the backward slices of performance degrading instructions
-
C. B. Zilles and G. S. Sohi. Understanding the backward slices of performance degrading instructions. In 27th Int'l Symp. on Computer Architecture, pages 172-181, 2000.
-
(2000)
27th Int'l Symp. on Computer Architecture
, pp. 172-181
-
-
Zilles, C.B.1
Sohi, G.S.2
|