-
1
-
-
0033717865
-
Clock rate versus IPC: The end of the road for conventional microarchitectures
-
AGARWAL, W., HRISHIKESH, M., KECKLER, S., AND BURGER, D. 2000. Clock rate versus IPC: the end of the road for conventional microarchitectures. In Proceedings of ISCA 27.
-
(2000)
Proceedings of ISCA
, vol.27
-
-
Agarwal, W.1
Hrishikesh, M.2
Keckler, S.3
Burger, D.4
-
2
-
-
0004072686
-
-
Addison Wesley
-
AHO, A. V., SETHI, R., AND ULLMAN, J. D. 1986. Compilers Principles, Techniques and Tools. Addison Wesley.
-
(1986)
Compilers Principles, Techniques and Tools
-
-
Aho, A.V.1
Sethi, R.2
Ullman, J.D.3
-
3
-
-
0012527549
-
A dynamic multithreading processor
-
AKKARY, H. AND DHISCOLL, M. 1998. A dynamic multithreading processor. In MICRO-31.
-
(1998)
MICRO-31
-
-
Akkary, H.1
Dhiscoll, M.2
-
4
-
-
33745207042
-
Data memory alternatives for multiscalar processors
-
Computer Sciences Department, University of Wisconsin-Madison
-
BREACH, S. E., VIJAYKUMAR, T. N., GOPAL, S., SMITH, J. E., AND SOHI, G. S. 1996. Data memory alternatives for multiscalar processors. Tech. Rep. CS-TR-1997-1344, Computer Sciences Department, University of Wisconsin-Madison.
-
(1996)
Tech. Rep.
, vol.CS-TR-1997-1344
-
-
Breach, S.E.1
Vijaykumar, T.N.2
Gopal, S.3
Smith, J.E.4
Sohi, G.S.5
-
7
-
-
0033689702
-
Architectural support for scalable speculative parallelization in shared-memory multiprocessors
-
CINTRA, M., MARTÍNEZ, J. F., AND TORRELLAS, J. 2000. Architectural support for scalable speculative parallelization in shared-memory multiprocessors. In Proceedings of ISCA 27.
-
(2000)
Proceedings of ISCA
, vol.27
-
-
Cintra, M.1
Martínez, J.F.2
Torrellas, J.3
-
8
-
-
0012529668
-
Learning cross-thread violations in speculative parallelization for multiprocessors
-
CINTRA, M. AND TORRELLAS, J. 2002. Learning cross-thread violations in speculative parallelization for multiprocessors. In Proceedings of the 8th HPCA.
-
(2002)
Proceedings of the 8th HPCA
-
-
Cintra, M.1
Torrellas, J.2
-
10
-
-
0028345337
-
A study of single-chip processor/cache organizations for large number of transistors
-
FARRENS, M., TYSON, G., AND PLESZKUN, A. 1994. A study of single-chip processor/cache organizations for large number of transistors. In Proceedings of ISCA 21. pp. 338-347.
-
(1994)
Proceedings of ISCA
, vol.21
, pp. 338-347
-
-
Farrens, M.1
Tyson, G.2
Pleszkun, A.3
-
11
-
-
33745183027
-
Suds: Primitive mechanisms for memory dependence speculation
-
January
-
FRANK, M., MORITZ, C., GREENWALD, B., AMARASINGHE, S., AND AGARWAL, A. 1999. Suds: Primitive mechanisms for memory dependence speculation. Tech. Rep. MIT/LCS Technical Memo LCS-TM-591. January.
-
(1999)
Tech. Rep. MIT/LCS Technical Memo
, vol.LCS-TM-591
-
-
Frank, M.1
Moritz, C.2
Greenwald, B.3
Amarasinghe, S.4
Agarwal, A.5
-
12
-
-
0007997616
-
ARB: A hardware mechanism for dynamic reordering of memory references
-
FRANKLIN, M. AND SOHI, G. S. 1996. ARB: A hardware mechanism for dynamic reordering of memory references. IEEE Trans. Comput. 45, 5 (May).
-
(1996)
IEEE Trans. Comput.
, vol.45
, Issue.5 MAY
-
-
Franklin, M.1
Sohi, G.S.2
-
13
-
-
43949089615
-
Tradeoffs in buffering memory state for thread-level speculation in multiprocessors
-
GARZARAN, M. J., PRVULOVIC, M., LLABERIA, J. M., VINALS, V., RAUCHWERGER, L., AND TORRELLAS, J. 2003. Tradeoffs in buffering memory state for thread-level speculation in multiprocessors. In Proceedings of the Ninth International Symposium on High-Performance Computer Architecture (HPCA).
-
(2003)
Proceedings of the Ninth International Symposium on High-performance Computer Architecture (HPCA)
-
-
Garzaran, M.J.1
Prvulovic, M.2
Llaberia, J.M.3
Vinals, V.4
Rauchwerger, L.5
Torrellas, J.6
-
14
-
-
16144367233
-
Lazy threads: Implementing a fast parallel call
-
GOLDSTEIN, S. C., SCHAUSER, K. E., AND CULLER, D. E. 1996. Lazy threads: Implementing a fast parallel call. J. Para. Distrib. Comput. 37, 1 (Aug.), 5-20.
-
(1996)
J. Para. Distrib. Comput.
, vol.37
, Issue.1 AUG
, pp. 5-20
-
-
Goldstein, S.C.1
Schauser, K.E.2
Culler, D.E.3
-
15
-
-
0031599590
-
Speculative versioning cache
-
GOPAL, S., VIJAYKUMAR, T., SMITH, J., AND SOHI, G. 1998. Speculative versioning cache. In Proceedings of the Fourth International Symposium on High-Performance Computer Architecture.
-
(1998)
Proceedings of the Fourth International Symposium on High-performance Computer Architecture
-
-
Gopal, S.1
Vijaykumar, T.2
Smith, J.3
Sohi, G.4
-
16
-
-
78650416004
-
Techniques for speculative run-time parallelization of loops
-
GUPTA, M. AND NIM, R. 1998. Techniques for speculative run-time parallelization of loops. In Proceedings of Supercomputing 1998.
-
(1998)
Proceedings of Supercomputing 1998
-
-
Gupta, M.1
Nim, R.2
-
22
-
-
0030685588
-
The SGI origin: A ccNUMA highly scalable server
-
LAUDON, J. AND LENOSKI, D. 1997. The SGI Origin: A ccNUMA highly scalable server. In Proceedings of the 24th ISCA. 241-251.
-
(1997)
Proceedings of the 24th ISCA
, pp. 241-251
-
-
Laudon, J.1
Lenoski, D.2
-
25
-
-
0030717767
-
Dynamic speculation and synchronization of data dependences
-
MOSHOVOS, A. I., BREACH, S. E., VIJAYKUMAR, T., AND SOHI, G. S. 1997. Dynamic speculation and synchronization of data dependences. In Proceedings of the 24th ISCA.
-
(1997)
Proceedings of the 24th ISCA
-
-
Moshovos, A.I.1
Breach, S.E.2
Vijaykumar, T.3
Sohi, G.S.4
-
26
-
-
0002432406
-
The case for a single-chip multiprocessor
-
OLUKOTUN, K., NAYFEH, B. A., HAMMOND, L., WILSON, K., AND CHANG, K. 1996. The Case for a Single-Chip Multiprocessor. In Proceedings of ASPLOS-VII.
-
(1996)
Proceedings of ASPLOS-VII
-
-
Olukotun, K.1
Nayfeh, B.A.2
Hammond, L.3
Wilson, K.4
Chang, K.5
-
27
-
-
0034818863
-
Multiplex: Unifying conventional and speculative thread-level parallelism on a chip multiprocessor
-
OOI, C. L., KIM, S. W., PARK, I., EIGENMANN, R., FALSAFI, B., AND VIJAYKUMAR, T. N. 2001. Multiplex: Unifying conventional and speculative thread-level parallelism on a chip multiprocessor. In Proceedings of the International Conference on Supercomputing:
-
(2001)
Proceedings of the International Conference on Supercomputing
-
-
Ooi, C.L.1
Kim, S.W.2
Park, I.3
Eigenmann, R.4
Falsafi, B.5
Vijaykumar, T.N.6
-
29
-
-
0003926726
-
Quantifying the complexity of superscalar processors
-
University of Wisconsin-Madison
-
PALACHARLA, S., JOUPPI, N. P., AND SMITH, J. E. 1996. Quantifying the complexity of superscalar processors. Tech. Rep. CS-TR-1996-1328, University of Wisconsin-Madison.
-
(1996)
Tech. Rep.
, vol.CS-TR-1996-1328
-
-
Palacharla, S.1
Jouppi, N.P.2
Smith, J.E.3
-
32
-
-
0034852757
-
Removing architectural bottlenecks to the scalability of speculative parallelizatoin
-
PRVULOVIC, M., GAEZARAN, M., RAUCHWEHGEH, L., AND TOHRELLAS, J. 2001. Removing architectural bottlenecks to the scalability of speculative parallelizatoin. In proceedings of the 28th Annual International Symposium on Computer Architecture.
-
(2001)
Proceedings of the 28th Annual International Symposium on Computer Architecture
-
-
Prvulovic, M.1
Gaezaran, M.2
Rauchwehgeh, L.3
Tohrellas, J.4
-
33
-
-
84976823223
-
The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization
-
RAUCHWEHGER, L. AND PADUA, D. 1995. The LRPD Test: Speculative run-time parallelization of loops with privatization and reduction parallelization. In Proceedings of PLDI'95. 218-232.
-
(1995)
Proceedings of PLDI'95
, pp. 218-232
-
-
Rauchwehger, L.1
Padua, D.2
-
34
-
-
0009484280
-
Trace processors
-
ROTENBERG, E., JACOBSON, Q., SAZEIDES, Y., AND SMITH, J. 1997. Trace processors. In Proceedings of Micro 30.
-
(1997)
Proceedings of Micro
, vol.30
-
-
Rotenberg, E.1
Jacobson, Q.2
Sazeides, Y.3
Smith, J.4
-
37
-
-
0029178210
-
Multiscalar processors
-
SOHI, G. S., BREACH, S., AND VIJAYKUMAR, T. N. 1995. Multiscalar processors. In Proceedings of ISCA 22. 414-425.
-
(1995)
Proceedings of ISCA
, vol.22
, pp. 414-425
-
-
Sohi, G.S.1
Breach, S.2
Vijaykumar, T.N.3
-
38
-
-
84858891974
-
The SPEC benchmark suite
-
Standard Performance Evaluation Corporation
-
SPEC. 2000. The SPEC Benchmark Suite. Tech. rep., Standard Performance Evaluation Corporation. http://www.spechbench.org.
-
(2000)
Tech. Rep.
-
-
-
39
-
-
4544233320
-
Hardware support for thread-level speculation
-
Ph.D. thesis
-
STEFFAN, J. G. 2003. Hardware Support for Thread-Level Speculation. Ph.D. thesis, Carnegie Mellon University. Tech. Rep. CMU-CS-03-122.
-
(2003)
Carnegie Mellon University. Tech. Rep.
, vol.CMU-CS-03-122
-
-
Steffan, J.G.1
-
40
-
-
0003564841
-
Architectural support for thread-level data speculation
-
School of Computer Science, Carnegie Mellon University. November
-
STEFFAN, J. G., COLOHAN, C. B., AND MOWRY, T. C. 1997. Architectural Support for Thread-Level Data Speculation. Tech. Rep. CMU-CS-97-188, School of Computer Science, Carnegie Mellon University. November.
-
(1997)
Tech. Rep.
, vol.CMU-CS-97-188
-
-
Steffan, J.G.1
Colohan, C.B.2
Mowry, T.C.3
-
41
-
-
84949785579
-
Improving value communication for thread-level speculation
-
STEFFAN, J. G., COLOHAN, C. B., ZHAI, A., AND MOWRY, T. C. 2002. Improving value communication for thread-level speculation. In Proceedings of the 8th HPCA.
-
(2002)
Proceedings of the 8th HPCA
-
-
Steffan, J.G.1
Colohan, C.B.2
Zhai, A.3
Mowry, T.C.4
-
42
-
-
0033703889
-
A scalable approach to thread-level speculation
-
STEFFAN, J. G., COLOHAN, C. B., ZHAIA, A., AND MOWRY, T. C. 2000. A scalable approach to thread-level speculation. In Proceedings of ISCA 27.
-
(2000)
Proceedings of ISCA
, vol.27
-
-
Steffan, J.G.1
Colohan, C.B.2
Zhaia, A.3
Mowry, T.C.4
-
43
-
-
84976789033
-
-
Springer-Verlag, Berlin, Germany
-
TJIANG, S., WOLF, M., LAM, M., PIEPER, K., AND HENNESSY, J. 1992. Languages and Compilers for Parallel Computing. Springer-Verlag, Berlin, Germany, 137-151.
-
(1992)
Languages and Compilers for Parallel Computing
, pp. 137-151
-
-
Tjiang, S.1
Wolf, M.2
Lam, M.3
Pieper, K.4
Hennessy, J.5
-
44
-
-
0012619993
-
MAJC: Microprocessor architecture for Java computing
-
TREMBLAY, M. 1999. MAJC: Microprocessor Architecture for Java Computing. HotChips '99.
-
(1999)
HotChips '99
-
-
Tremblay, M.1
-
45
-
-
0029200683
-
Simultaneous multithreading: Maximizing on-chip parallelism
-
TULLSEN, D. M., EGGERS, S. J., AND LEVY, H. M. 1995. Simultaneous multithreading: Maximizing on-chip parallelism. In Proceedings of ISCA 22. 392-403.
-
(1995)
Proceedings of ISCA
, vol.22
, pp. 392-403
-
-
Tullsen, D.M.1
Eggers, S.J.2
Levy, H.M.3
-
47
-
-
0012583882
-
-
Ph.D. thesis, Computer Sciences Department, University of Wisconsin-Madison
-
VIJAYKUMAR, T. 1998. Compiling for the multiscalar architecture. Ph.D. thesis, Computer Sciences Department, University of Wisconsin-Madison.
-
(1998)
Compiling for the Multiscalar Architecture
-
-
Vijaykumar, T.1
-
48
-
-
0030129806
-
The MIPS R10000 superscalar microprocessor
-
YEAGER, K. 1996. The MIPS R10000 superscalar microprocessor. IEEE Micro.
-
(1996)
IEEE Micro
-
-
Yeager, K.1
-
49
-
-
0036957989
-
Compiler optimization of scalar value communication between speculative threads
-
ZHAIA, A., COLOHAN, C. B., STEFFAN, J. G., AND MOWRY, T. C. 2002. Compiler optimization of scalar value communication between speculative threads. In Proceedings of ASPLOS-X.
-
(2002)
Proceedings of ASPLOS-X
-
-
Zhaia, A.1
Colohan, C.B.2
Steffan, J.G.3
Mowry, T.C.4
-
50
-
-
3042567406
-
Compiler optimization of memory-resident value communication between speculative threads
-
ZHAIA, A., COLOHAN, C.B., STEFFAN, J. G., AND MOWRY, T. C. 2004. Compiler optimization of memory-resident value communication between speculative threads. In Proceedings of the International Symposium on Code Generation and Optimization.
-
(2004)
Proceedings of the International Symposium on Code Generation and Optimization
-
-
Zhaia, A.1
Colohan, C.B.2
Steffan, J.G.3
Mowry, T.C.4
|