-
1
-
-
0031237789
-
Simultaneous multithreading: A platform for next-generation processors
-
September/October
-
S. J. Eggers, J. S. Emer, H. M. Levy, J. L. Lo, R. L. Stamm, and D. M. Tullsen, "Simultaneous multithreading: A platform for next-generation processors," IEEE Micro, vol. 17, pp. 12-18, September/October 1997.
-
(1997)
IEEE Micro
, vol.17
, pp. 12-18
-
-
Eggers, S.J.1
Emer, J.S.2
Levy, H.M.3
Lo, J.L.4
Stamm, R.L.5
Tullsen, D.M.6
-
2
-
-
0033348795
-
A chip-multiprocessor architecture with speculative multithreading
-
V. Krishnan and J. Torrellas, "A chip-multiprocessor architecture with speculative multithreading," IEEE Transactions on Computers, vol. 48, no. 9, pp. 866-880, 1999.
-
(1999)
IEEE Transactions on Computers
, vol.48
, Issue.9
, pp. 866-880
-
-
Krishnan, V.1
Torrellas, J.2
-
4
-
-
0032786014
-
Supporting fi ne-grained synchronization on a simultaneous multithreading processor
-
D. M. Tullsen, J. L. Lo, S. J. Eggers, and H. M. Levy, "Supporting fi ne-grained synchronization on a simultaneous multithreading processor," in International Symposium on Architectural Support for Programming Languages and Operating Systems, pp. 54-58, 2000.
-
(2000)
International Symposium on Architectural Support for Programming Languages and Operating Systems
, pp. 54-58
-
-
Tullsen, D.M.1
Lo, J.L.2
Eggers, S.J.3
Levy, H.M.4
-
5
-
-
77950300305
-
ILP versus TLP on SMT
-
November
-
N. Mitchell, L. Carter, J. Ferrante, and D. Tullsen, "ILP versus TLP on SMT," Supercomputing, November 1999.
-
(1999)
Supercomputing
-
-
Mitchell, N.1
Carter, L.2
Ferrante, J.3
Tullsen, D.4
-
7
-
-
0029480935
-
Compiler technology for future microprocessors
-
December
-
W. W. Hwu, R. E. Hank, D. M. Gallagher, S. A. Mahlke, D. M. Lavery, G. E. Haab, J. C. Gyllenhaal, and D. I. August, "Compiler technology for future microprocessors," Proceedings of the IEEE, vol. 83, pp. 1625-1995, December 1995.
-
(1995)
Proceedings of the IEEE
, vol.83
, pp. 1625-1995
-
-
Hwu, W.W.1
Hank, R.E.2
Gallagher, D.M.3
Mahlke, S.A.4
Lavery, D.M.5
Haab, G.E.6
Gyllenhaal, J.C.7
August, D.I.8
-
8
-
-
0029666641
-
Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor
-
D. Tullsen, S. Eggers, J. Emer, H. Levy, J. Lo, and R. Stamm, "Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor," in Proceedings of the 23rd annual International Symposium on Computer Architecture, pp. 191-202, 1996.
-
(1996)
Proceedings of the 23rd Annual International Symposium on Computer Architecture
, pp. 191-202
-
-
Tullsen, D.1
Eggers, S.2
Emer, J.3
Levy, H.4
Lo, J.5
Stamm, R.6
-
9
-
-
84949769332
-
A new memory monitoring scheme for memory-aware scheduling and partitioning
-
G. E. Suh, S. Devadas, and L. Rudolph, "A new memory monitoring scheme for memory-aware scheduling and partitioning," in Proceedings of the Eigth International Symposium on High Performance Computer Architecture (HPCA), pp. 117-128, 2002.
-
(2002)
Proceedings of the Eigth International Symposium on High Performance Computer Architecture (HPCA)
, pp. 117-128
-
-
Suh, G.E.1
Devadas, S.2
Rudolph, L.3
-
10
-
-
10444238444
-
Fair cache sharing and partitioning in a chip multiprocessor architecture
-
IEEE Computer Society
-
S. Kim, D. Chandra, and Y. Solihin, "Fair cache sharing and partitioning in a chip multiprocessor architecture," in Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2004 (PACT '04), pp. 111-121, IEEE Computer Society, 2004.
-
(2004)
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2004 (PACT '04)
, pp. 111-121
-
-
Kim, S.1
Chandra, D.2
Solihin, Y.3
-
11
-
-
4143087192
-
Predictable performance in smt processors
-
ACM Press
-
F. J. Cazorla, P. M. Knijnenburg, R. Sakellariou, E. Fernendez, A. Ramirez, and M. Valero, "Predictable performance in smt processors," in CF'04: Proceedings of the First Conference on Computing Frontiers, pp. 433-443, ACM Press, 2004.
-
(2004)
CF'04: Proceedings of the First Conference on Computing Frontiers
, pp. 433-443
-
-
Cazorla, F.J.1
Knijnenburg, P.M.2
Sakellariou, R.3
Fernendez, E.4
Ramirez, A.5
Valero, M.6
-
12
-
-
21244457025
-
Special issue on intel hyperthreading in pentium-4 processors
-
January
-
I. Corporation, "Special issue on intel hyperthreading in pentium-4 processors," Intel Technology Journal, vol. 1, January 2002.
-
(2002)
Intel Technology Journal
, vol.1
-
-
Corporation, I.1
-
13
-
-
4143116894
-
Contention on 2nd level cache may limit the effectiveness of simultaneous multithreading
-
IRISA
-
S. Hily and A. Seznec, "Contention on 2nd level cache may limit the effectiveness of simultaneous multithreading," Tech. Rep. PI-1086, IRISA, 1997.
-
(1997)
Tech. Rep.
, vol.PI-1086
-
-
Hily, S.1
Seznec, A.2
-
15
-
-
0013229812
-
Thread-sensitive scheduling for smt processors
-
Department of Computer Science & Engineering University of Washington, Seattle, Washington
-
S. Parekh, S. Eggers, and H. Levy, "Thread-sensitive scheduling for smt processors," tech. rep., Department of Computer Science & Engineering University of Washington, Seattle, Washington, 2000.
-
(2000)
Tech. Rep.
-
-
Parekh, S.1
Eggers, S.2
Levy, H.3
-
16
-
-
0025028257
-
The Tera computer system
-
R. Alverson, D. Callahan, D. Cummings, B. Koblenz, A. Porterfi eld, and B. Smith, "The Tera computer system," in Proceedings of the 1990 International Conference on Supercomputing, pp. 1-6, 1990.
-
(1990)
Proceedings of the 1990 International Conference on Supercomputing
, pp. 1-6
-
-
Alverson, R.1
Callahan, D.2
Cummings, D.3
Koblenz, B.4
Porterfield, A.5
Smith, B.6
-
17
-
-
0025431380
-
APRIL: A processor architecture for multiprocessing
-
Seattle, WA
-
A. Agarwal, B. Lim, D. Kranz, and J. Kubiatowicz, "APRIL: A processor architecture for multiprocessing," in Proceedings of the 17th Annual International Symposium on Computer Architecture, (Seattle, WA), pp. 104-114, 1990.
-
(1990)
Proceedings of the 17th Annual International Symposium on Computer Architecture
, pp. 104-114
-
-
Agarwal, A.1
Lim, B.2
Kranz, D.3
Kubiatowicz, J.4
-
22
-
-
10444257607
-
Predictable fi ne-grained cache behavior for enhanced simultaneous multithreading (SMT) scheduling
-
J. Kihm, A. Janiszewski, and D. Connors, "Predictable fi ne-grained cache behavior for enhanced simultaneous multithreading (SMT) scheduling," in Proceedings of International Conference on Computing, Communications and Control Technologies, 2004.
-
(2004)
Proceedings of International Conference on Computing, Communications and Control Technologies
-
-
Kihm, J.1
Janiszewski, A.2
Connors, D.3
-
23
-
-
84877021547
-
Multi-processor performance on the tera mta
-
IEEE Computer Society
-
A. Snavely, L. Carter, J. Boisseau, A. Majumdar, K. S. Gatlin, N. Mitchell, J. Feo, and B. Koblenz, "Multi-processor performance on the tera mta," in SC '98: Proceedings of the Proceedings of the IEEE/ACM SC98 Conference, p. 4, IEEE Computer Society, 1998.
-
(1998)
SC '98: Proceedings of the Proceedings of the IEEE/ACM SC98 Conference
, pp. 4
-
-
Snavely, A.1
Carter, L.2
Boisseau, J.3
Majumdar, A.4
Gatlin, K.S.5
Mitchell, N.6
Feo, J.7
Koblenz, B.8
-
24
-
-
0026157612
-
IMPACT: An architectural framework for multiple-instruction-issue processors
-
May
-
P. P. Chang, S. A. Mahlke, W. Y. Chen, N. J. Warter, and W. W. Hwu, "IMPACT: An architectural framework for multiple-instruction-issue processors," in Proceedings of the 18th International Symposium on Computer Architecture, pp. 266-275, May 1991.
-
(1991)
Proceedings of the 18th International Symposium on Computer Architecture
, pp. 266-275
-
-
Chang, P.P.1
Mahlke, S.A.2
Chen, W.Y.3
Warter, N.J.4
Hwu, W.W.5
-
25
-
-
10444263677
-
Architectural support for enhanced SMT job scheduling
-
IEEE Computer Society
-
A. Settle, J. Kihm, A. Janiszewski, and D. A. Connors, "Architectural support for enhanced SMT job scheduling," in Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, IEEE Computer Society, 2004.
-
(2004)
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques
-
-
Settle, A.1
Kihm, J.2
Janiszewski, A.3
Connors, D.A.4
-
29
-
-
0034312472
-
A multithreaded powerpc processor for commercial servers
-
November
-
J. M. Borkenhagen, R. J. Eickemeyer, R. N. Kalla, and S. R. Kunkel, "A multithreaded powerpc processor for commercial servers," IBM Journal of Research and Development, vol. 44, pp. 885-898, November 2000.
-
(2000)
IBM Journal of Research and Development
, vol.44
, pp. 885-898
-
-
Borkenhagen, J.M.1
Eickemeyer, R.J.2
Kalla, R.N.3
Kunkel, S.R.4
-
31
-
-
0032662989
-
Simultaneous subordinate microthreading (SSMT)
-
May
-
R. Chappell, J. Stark, S. Kim, S. Reinhardt, and Y. Patt, "Simultaneous subordinate microthreading (SSMT)," in Proceedings of the 26th Annual International Symposium on Computer Architecture, pp. 186-195, May 1999.
-
(1999)
Proceedings of the 26th Annual International Symposium on Computer Architecture
, pp. 186-195
-
-
Chappell, R.1
Stark, J.2
Kim, S.3
Reinhardt, S.4
Patt, Y.5
-
32
-
-
85088550971
-
Multiscalar processors
-
G. S. Sohi, S. E. Breach, and T. N. Vijaykumar, "Multiscalar processors," in 25 Years ISCA: Retrospectives and Reprints, pp. 521-532, 1998.
-
(1998)
25 Years ISCA: Retrospectives and Reprints
, pp. 521-532
-
-
Sohi, G.S.1
Breach, S.E.2
Vijaykumar, T.N.3
-
33
-
-
22644451045
-
Exploiting speculative thread-level parallelism on a smt processor
-
P. Marcuello and A. Gonzalez, "Exploiting speculative thread-level parallelism on a smt processor.," in HPCN Europe, pp. 754-763, 1999.
-
(1999)
HPCN Europe
, pp. 754-763
-
-
Marcuello, P.1
Gonzalez, A.2
-
35
-
-
10444224244
-
Prescient instruction prefetch
-
November
-
T. Aamodt, P. Marcuello, P. Chow, P. Hammarlund, and H. Wang, "Prescient instruction prefetch," in Proc. of the 6th Workshop on Multithreaded Execution, Architecture and Compilation, pp. 2-10, November 2002.
-
(2002)
Proc. of the 6th Workshop on Multithreaded Execution, Architecture and Compilation
, pp. 2-10
-
-
Aamodt, T.1
Marcuello, P.2
Chow, P.3
Hammarlund, P.4
Wang, H.5
-
36
-
-
0034839033
-
Speculative precomputation: Long-range prefetching of delinquent loads
-
July
-
J. D. Collins, H. Wang, D. M. Tullsen, C. J. Hughes, Y. fong Lee, D. Lavery, and J. P. Shen, "Speculative precomputation: Long-range prefetching of delinquent loads," in Proceedings of the 28th International Symposium on Computer Architecture, July 2001.
-
(2001)
Proceedings of the 28th International Symposium on Computer Architecture
-
-
Collins, J.D.1
Wang, H.2
Tullsen, D.M.3
Hughes, C.J.4
Lee, Y.F.5
Lavery, D.6
Shen, J.P.7
-
38
-
-
19944432981
-
Helper threads via virtual multithreading on an experimental itanium 2 processor-based platform
-
ACM Press
-
P. H. Wang, J. D. Collins, H. Wang, D. Kim, B. Greene, K.-M. Chan, A. B. Yunus, T. Sych, S. F. Moore, and J. P. Shen, "Helper threads via virtual multithreading on an experimental itanium 2 processor-based platform," in ASPLOS-XI: Proceedings of the 11th international conference on Architectural support for programming languages and operating systems, pp. 144-155, ACM Press, 2004.
-
(2004)
ASPLOS-XI: Proceedings of the 11th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 144-155
-
-
Wang, P.H.1
Collins, J.D.2
Wang, H.3
Kim, D.4
Greene, B.5
Chan, K.-M.6
Yunus, A.B.7
Sych, T.8
Moore, S.F.9
Shen, J.P.10
-
42
-
-
84981164301
-
Boosting SMT performance by speculation control
-
K. Luo, M. Franklin, S. S. Mukherjee, and A. Seznec, "Boosting SMT performance by speculation control," in Proceedings of the International Parallel and Distributed Processing Symposium, pp. 2-9, 2001.
-
(2001)
Proceedings of the International Parallel and Distributed Processing Symposium
, pp. 2-9
-
-
Luo, K.1
Franklin, M.2
Mukherjee, S.S.3
Seznec, A.4
-
43
-
-
0031594020
-
An analysis of database workload performance on simultaneous multithreaded processors
-
J. L. Lo, L. A. Barroso, S. J. Eggers, K. Gharachorloo, H. M. Levy, and S. S. Parekh, "An analysis of database workload performance on simultaneous multithreaded processors," in Proceedings of the 25th International Symposium on Computer Achitecture (ISCA), pp. 39-50, 1998.
-
(1998)
Proceedings of the 25th International Symposium on Computer Achitecture (ISCA)
, pp. 39-50
-
-
Lo, J.L.1
Barroso, L.A.2
Eggers, S.J.3
Gharachorloo, K.4
Levy, H.M.5
Parekh, S.S.6
-
44
-
-
21244474546
-
Predicting inter-thread cache contention on a chip multi-processor architecture
-
Feb
-
D. Chandra, F. Quo, S. Kim, and Y. Solihin, "Predicting inter-thread cache contention on a chip multi-processor architecture," in Proceedings of the 11th International Symposium on High Performance Computer Architecture (HPCA), pp. 340-351, Feb 2005.
-
(2005)
Proceedings of the 11th International Symposium on High Performance Computer Architecture (HPCA)
, pp. 340-351
-
-
Chandra, D.1
Quo, F.2
Kim, S.3
Solihin, Y.4
-
45
-
-
0031364101
-
Tuning compiler optimizations for simultaneous multithreading
-
December
-
J. L. Lo, S. J. Eggers, H. M. Levy, S. S. Parekh, and D. M. Tullsen, "Tuning compiler optimizations for simultaneous multithreading," in Proceedings of the 30th International Symposium on Microarchitecture, pp. 114-124, December 1997.
-
(1997)
Proceedings of the 30th International Symposium on Microarchitecture
, pp. 114-124
-
-
Lo, J.L.1
Eggers, S.J.2
Levy, H.M.3
Parekh, S.S.4
Tullsen, D.M.5
-
47
-
-
79955715200
-
The working set model for program behavior
-
May
-
P. J. Denning, "The working set model for program behavior," Communications of the ACM, vol. 11, pp. 323-333, May 1968.
-
(1968)
Communications of the ACM
, vol.11
, pp. 323-333
-
-
Denning, P.J.1
-
50
-
-
84934274832
-
Using hardware counters to automatically improve memory performance
-
M. M. Tikir and J. K. Hollingsworth, "Using hardware counters to automatically improve memory performance," in Proceedings of 2004 High Performance Computing, Networking, and Storage Conference (SC), p. 46, 2004.
-
(2004)
Proceedings of 2004 High Performance Computing, Networking, and Storage Conference (SC)
, pp. 46
-
-
Tikir, M.M.1
Hollingsworth, J.K.2
-
51
-
-
0032644674
-
Reducing cache misses using hardware and software page placement
-
ACM Press
-
T. Sherwood, B. Calder, and J. Emer, "Reducing cache misses using hardware and software page placement," in ICS '99: Proceedings of the 13th International Conference on Supercomputing, pp. 155-164, ACM Press, 1999.
-
(1999)
ICS '99: Proceedings of the 13th International Conference on Supercomputing
, pp. 155-164
-
-
Sherwood, T.1
Calder, B.2
Emer, J.3
|