-
3
-
-
34547414505
-
On the performance potential of different types of speculative thread-level parallelism
-
Cairns, Australia
-
A. Kejariwal, X. Tian, W. Li, M. Girkar, S. Kozhukhov, H. Saito, U. Banerjee, A. Nicolau, A. V. Veidenbaum, and C. D. Polychronopoulos. On the performance potential of different types of speculative thread-level parallelism. In Proceedings of the 20th ACM. International Conference on Supercomputing, pages 24-35, Cairns, Australia, 2006.
-
(2006)
Proceedings of the 20th ACM. International Conference on Supercomputing
, pp. 24-35
-
-
Kejariwal, A.1
Tian, X.2
Li, W.3
Girkar, M.4
Kozhukhov, S.5
Saito, H.6
Banerjee, U.7
Nicolau, A.8
Veidenbaum, A.V.9
Polychronopoulos, C.D.10
-
4
-
-
85067328792
-
-
SPEC CPU2006. http://www.spec.org/cpu2006.
-
(2006)
-
-
CPU, S.P.E.C.1
-
5
-
-
85060036181
-
Validity of the single processor approachtoachieving large scale computing capabilities
-
G. M. Amdahl. Validity of the single processor approachtoachieving large scale computing capabilities. In AFIPS Conference Proceedings, pages 483-485, 1967.
-
(1967)
AFIPS Conference Proceedings
, pp. 483-485
-
-
Amdahl, G.M.1
-
6
-
-
85067324496
-
-
Open Research Compiler for Itanium™Processor Family
-
Open Research Compiler for Itanium™Processor Family, http://ipf-orc.sourceforge.net/.
-
-
-
-
9
-
-
84976742360
-
A unified semantic approach for the vectorization and parallelization of generalized reductions
-
Crete, Greece, June
-
P. Jouvelot and B. Dehbonei. A unified semantic approach for the vectorization and parallelization of generalized reductions. In Proceedings of the 3rd ACM International Conference on Supercomputing, pages 186-194, Crete, Greece, June 1989.
-
(1989)
Proceedings of the 3rd ACM International Conference on Supercomputing
, pp. 186-194
-
-
Jouvelot, P.1
Dehbonei, B.2
-
10
-
-
35048873677
-
-
College Station, TX, October
-
D. J. Quinlan, M. Schordan, Q. Yi, and B. R. de Supinski. Semanticdriven parallelization of loops operating on user-defined containers, pages 524-538, College Station, TX, October 2003.
-
(2003)
Semanticdriven parallelization of loops operating on user-defined containers
, pp. 524-538
-
-
Quinlan, D.J.1
Schordan, M.2
Yi, Q.3
de Supinski, B.R.4
-
11
-
-
0027541302
-
Automatic program parallelization
-
February
-
U. Banerjee, R. Eigenmann, A. Nicolau, and D. Padua. Automatic program parallelization. Proceedings of the IEEE, 81(2):211-243, February 1993.
-
(1993)
Proceedings of the IEEE
, vol.81
, Issue.2
, pp. 211-243
-
-
Banerjee, U.1
Eigenmann, R.2
Nicolau, A.3
Padua, D.4
-
12
-
-
77950300305
-
ILP versus TLP on SMT
-
Portland, OR
-
N. Mitchell, L. Carter, J. Ferrante, and D. Tullsen. ILP versus TLP on SMT. In Proceedings of the 1999 ACM/EEE Conference on Supercomputing, page 37, Portland, OR, 1999.
-
(1999)
Proceedings of the 1999 ACM/EEE Conference on Supercomputing
, pp. 37
-
-
Mitchell, N.1
Carter, L.2
Ferrante, J.3
Tullsen, D.4
-
13
-
-
0003015894
-
Some scheduling techniques and an easily schedulable horizontal architecture for high performancescientific computing
-
Chatham, MA, December
-
B. R. Rau and C. D. Glaeser. Some scheduling techniques and an easily schedulable horizontal architecture for high performancescientific computing. In Proceedings of the 14th annual workshop on Microprogramming, pages 183-198, Chatham, MA, December 1981.
-
(1981)
Proceedings of the 14th annual workshop on Microprogramming
, pp. 183-198
-
-
Rau, B.R.1
Glaeser, C.D.2
-
16
-
-
3142719600
-
Perfect pipelining: A new loop parallelization technique
-
87-873, Dept. of Computer Science, Cornell University
-
A. Aiken and A. Nicolau. Perfect pipelining: A new loop parallelization technique. Technical/Report 87-873, Dept. of Computer Science, Cornell University, 1987.
-
(1987)
Technical/Report
-
-
Aiken, A.1
Nicolau, A.2
-
17
-
-
0033361788
-
In search of speculative thread-level parallelism
-
Newport Beach, CA, October
-
J. T. Oplinger, D. L. Heine, and M. S. Lain. In search of speculative thread-level parallelism. In Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, pages 303-313, Newport Beach, CA, October 1999.
-
(1999)
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques
, pp. 303-313
-
-
Oplinger, J.T.1
Heine, D.L.2
Lain, M.S.3
-
19
-
-
32844465384
-
Tasking with out-of-order spawn in tls chip multiprocessors: Microarchitecture and compilation
-
Cambridge, MA
-
J. Renau, J. Tuck, W. Liu, L. Ceze, K. Strauss, and J. Torrellas. Tasking with out-of-order spawn in tls chip multiprocessors: Microarchitecture and compilation. In Proceedings of the 19th ACM International Conference on Supercomputing, pages 179-188, Cambridge, MA, 2005.
-
(2005)
Proceedings of the 19th ACM International Conference on Supercomputing
, pp. 179-188
-
-
Renau, J.1
Tuck, J.2
Liu, W.3
Ceze, L.4
Strauss, K.5
Torrellas, J.6
-
20
-
-
0024664199
-
Run-time disambiguation: Coping with statically unpredictable dependencies
-
A. Nicolau. Run-time disambiguation: coping with statically unpredictable dependencies. IEEE Transactions on Computers, 38(5):633-678, 1989.
-
(1989)
IEEE Transactions on Computers
, vol.38
, Issue.5
, pp. 633-678
-
-
Nicolau, A.1
-
23
-
-
0038039851
-
A compiler framework for speculative analysis and optimizations
-
San Diego, CA
-
J. Lin, T. Chen, W.-C. Hsu, P.-C. Yew, R. D.-C. Ju, T.-F. Ngai, and S. Chan. A compiler framework for speculative analysis and optimizations. In Proceedings of the SIGPLAN '03 Conference on Programming Language Design and Implementation, pages 289-299, San Diego, CA, 2003.
-
(2003)
Proceedings of the SIGPLAN '03 Conference on Programming Language Design and Implementation
, pp. 289-299
-
-
Lin, J.1
Chen, T.2
Hsu, W.-C.3
Yew, P.-C.4
Ju, R.D.-C.5
Ngai, T.-F.6
Chan, S.7
-
24
-
-
33646849245
-
A general compiler framework for speculative optimizations using data speculative code motion
-
San Jose, CA
-
X. Dai, A. Zhai, W.-C. Hsu, and P.-C. Yew. A general compiler framework for speculative optimizations using data speculative code motion. In Proceedings of the International. Symposium, on Code Generation and Optimization, pages 280-290, San Jose, CA, 2005.
-
(2005)
Proceedings of the International. Symposium, on Code Generation and Optimization
, pp. 280-290
-
-
Dai, X.1
Zhai, A.2
Hsu, W.-C.3
Yew, P.-C.4
-
29
-
-
0022150790
-
Allocating independent subtasks on parallel processors
-
C. P. Kruskal and A. Weiss. Allocating independent subtasks on parallel processors. IEEE Transactions on Software Engineering, 11(10):1001-1016, 1985.
-
(1985)
IEEE Transactions on Software Engineering
, vol.11
, Issue.10
, pp. 1001-1016
-
-
Kruskal, C.P.1
Weiss, A.2
-
30
-
-
0004230378
-
-
Kluwer Academic Publishers, Boston, MA
-
U. Banerjee. Dependence Analysis. Kluwer Academic Publishers, Boston, MA, 1997.
-
(1997)
Dependence Analysis
-
-
Banerjee, U.1
-
31
-
-
0028016652
-
Redundant synchronization elimination, for DOACROSS loops
-
Cancun, Mexico
-
D.-K. Chen and P.-C. Yew. Redundant synchronization elimination, for DOACROSS loops. In Proceedings of the Eighth International Parallel Processing Symposium, pages 477-481, Cancun, Mexico, 1994.
-
(1994)
Proceedings of the Eighth International Parallel Processing Symposium
, pp. 477-481
-
-
Chen, D.-K.1
Yew, P.-C.2
-
34
-
-
1142268815
-
Recycling waste: Exploiting wrong-path execution to improve branch prediction
-
San Francisco, CA
-
H. Akkary, S. T. Srinivasan, and K. Lai. Recycling waste: exploiting wrong-path execution to improve branch prediction. In Proceedings of the 17th ACM International Conference on Supercomputing, pages 12-21, San Francisco, CA, 2003.
-
(2003)
Proceedings of the 17th ACM International Conference on Supercomputing
, pp. 12-21
-
-
Akkary, H.1
Srinivasan, S.T.2
Lai, K.3
|