-
1
-
-
0028513945
-
Automatic detection of parallelism: A grand challenge for high-performance computing
-
W. Blume, R. Eigenmann, J. Hoeflinger, D. Padua, P. Petersen, L. Rauchwerger, P. Tu, Automatic detection of parallelism: a grand challenge for high-performance computing, IEEE Parallel Distributed Technol. 2 (3) (1994) 37-47.
-
(1994)
IEEE Parallel Distributed Technol.
, vol.2
, Issue.3
, pp. 37-47
-
-
Blume, W.1
Eigenmann, R.2
Hoeflinger, J.3
Padua, D.4
Petersen, P.5
Rauchwerger, L.6
Tu, P.7
-
2
-
-
0022874874
-
Advanced compiler optimizations for supercomputers
-
D.A. Padua, M.J. Wolfe, Advanced compiler optimizations for supercomputers, Commun. ACM 29 (1986) 1184-1201.
-
(1986)
Commun. ACM
, vol.29
, pp. 1184-1201
-
-
Padua, D.A.1
Wolfe, M.J.2
-
4
-
-
51249162877
-
Compiler technology for machine-independent programming
-
K. Kennedy, Compiler technology for machine-independent programming, Int. J. Paral. Prog. 22 (1) (1994) 79-98.
-
(1994)
Int. J. Paral. Prog.
, vol.22
, Issue.1
, pp. 79-98
-
-
Kennedy, K.1
-
5
-
-
0004656909
-
Performance analysis of parallelizing compilers on the perfect Benchmarks™ programs
-
W. Blume, R. Eigenmann, Performance analysis of parallelizing compilers on the perfect Benchmarks™ programs, IEEE Trans. Parallel Distributed Syst. 3 (6) (1992) 643-656.
-
(1992)
IEEE Trans. Parallel Distributed Syst.
, vol.3
, Issue.6
, pp. 643-656
-
-
Blume, W.1
Eigenmann, R.2
-
6
-
-
0028405844
-
Massively parallel methods for engineering and science problems
-
W.J. Camp, S.J. Plimpton, B.A. Hendrickson, R.W. Leland, Massively parallel methods for engineering and science problems, Commun. ACM 37 (4) (1994) 31-41.
-
(1994)
Commun. ACM
, vol.37
, Issue.4
, pp. 31-41
-
-
Camp, W.J.1
Plimpton, S.J.2
Hendrickson, B.A.3
Leland, R.W.4
-
7
-
-
85013593108
-
Experience in the automatic parallelization of four perfect-benchmark programs
-
Proceedings of the Fourth Workshop on Languages and Compilers for Parallel Computing, Santa Clara, CA, August
-
R. Eigenmann, J. Hoeflinger, Z. Li, D. Padua, Experience in the automatic parallelization of four perfect-benchmark programs, Lecture Notes in Computer Science 589, in: Proceedings of the Fourth Workshop on Languages and Compilers for Parallel Computing, Santa Clara, CA, August 1991, pp. 65-83.
-
(1991)
Lecture Notes in Computer Science
, vol.589
, pp. 65-83
-
-
Eigenmann, R.1
Hoeflinger, J.2
Li, Z.3
Padua, D.4
-
9
-
-
85027612984
-
Dependence graphs and compiler optimizations
-
January
-
D.J. Kuck, R.H. Kuhn, D.A. Padua, B. Leasure, M. Wolfe, Dependence graphs and compiler optimizations, in: Proceedings of the 8th ACM Symposium on Principles of Programming Languages, January 1981, pp. 207-218.
-
(1981)
Proceedings of the 8th ACM Symposium on Principles of Programming Languages
, pp. 207-218
-
-
Kuck, D.J.1
Kuhn, R.H.2
Padua, D.A.3
Leasure, B.4
Wolfe, M.5
-
11
-
-
0346757637
-
Automatic generation of nested, fork-join parallelism
-
M. Burke, R. Cytron, J. Ferrante, W. Hsieh, Automatic generation of nested, fork-join parallelism, J. Supercomput. (1989) 71-88.
-
(1989)
J. Supercomput.
, pp. 71-88
-
-
Burke, M.1
Cytron, R.2
Ferrante, J.3
Hsieh, W.4
-
15
-
-
0347387849
-
Automatic array privatization
-
Portland, OR, August
-
P. Tu, D. Padua, Automatic array privatization, in: Proceedings 6th Annual Workshop on Languages and Compilers for Parallel Computing, Portland, OR, August 1993.
-
(1993)
Proceedings 6th Annual Workshop on Languages and Compilers for Parallel Computing
-
-
Tu, P.1
Padua, D.2
-
19
-
-
0023538229
-
Compiler algorithms for synchronization
-
S. Midkiff, D. Padua, Compiler algorithms for synchronization, IEEE Trans. Comput. C 36 (12) (1987) 1485-1495.
-
(1987)
IEEE Trans. Comput. C
, vol.36
, Issue.12
, pp. 1485-1495
-
-
Midkiff, S.1
Padua, D.2
-
20
-
-
0023362714
-
A scheme to enforce data dependence on large multiprocessor systems
-
C. Zhu, P.C. Yew, A scheme to enforce data dependence on large multiprocessor systems, IEEE Trans. Soft. Eng. 13 (6) (1987) 726-739.
-
(1987)
IEEE Trans. Soft. Eng.
, vol.13
, Issue.6
, pp. 726-739
-
-
Zhu, C.1
Yew, P.C.2
-
21
-
-
0029488237
-
A scalable method for run-time loop parallelization
-
L. Rauchwerger, N. Amato, D. Padua, A scalable method for run-time loop parallelization, IJPP 26 (6) (1995) 537-576.
-
(1995)
IJPP
, vol.26
, Issue.6
, pp. 537-576
-
-
Rauchwerger, L.1
Amato, N.2
Padua, D.3
-
22
-
-
0023577374
-
Multiple version loops
-
St. Charles, IL
-
M. Byler, J. Davies, C. Huson, B. Leasure, M. Wolfe, Multiple version loops, in: Proc. of 1987 Int'l. Conf. on Parallel Processing, St. Charles, IL, 1987.
-
(1987)
Proc. of 1987 Int'l. Conf. on Parallel Processing
-
-
Byler, M.1
Davies, J.2
Huson, C.3
Leasure, B.4
Wolfe, M.5
-
23
-
-
0028744946
-
An efficient algorithm for the run-time parallelization of doacross loops
-
Nov.
-
D.K. Chen, P.C. Yew, J. Torrellas, An efficient algorithm for the run-time parallelization of doacross loops, in: Proceedings of Supercomputing 1994, Nov. 1994, pp. 518-527.
-
(1994)
Proceedings of Supercomputing 1994
, pp. 518-527
-
-
Chen, D.K.1
Yew, P.C.2
Torrellas, J.3
-
25
-
-
0027829921
-
Improving the performance of runtime parallelization
-
May
-
S. Leung, J. Zahorjan, Improving the performance of runtime parallelization, in: 4th PPOPP, May 1993, pp. 83-91.
-
(1993)
4th PPOPP
, pp. 83-91
-
-
Leung, S.1
Zahorjan, J.2
-
26
-
-
0024054628
-
Compiler optimizations for enhancing parallelism and their impact on architecture design
-
C. Polychronopoulos, Compiler optimizations for enhancing parallelism and their impact on architecture design, IEEE Trans. Comput. C 37 (8) (1988) 991-1004.
-
(1988)
IEEE Trans. Comput. C
, vol.37
, Issue.8
, pp. 991-1004
-
-
Polychronopoulos, C.1
-
27
-
-
0042256184
-
The preprocessed doacross loop
-
H.D. Schwetman (Ed.), CRC Press
-
J. Saltz, R. Mirchandaney, The preprocessed doacross loop, in: H.D. Schwetman (Ed.), Proceedings of the 1991 International Conference on Parallel Processing, Vol. II - Software, CRC Press, 1991, pp. 174-178.
-
(1991)
Proceedings of the 1991 International Conference on Parallel Processing, Vol. II - Software
, vol.2
, pp. 174-178
-
-
Saltz, J.1
Mirchandaney, R.2
-
30
-
-
0342958310
-
The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization
-
Univ. of Illinois at Urbana-Champaign, Cntr. for Supercomputing Res. and Dev., November
-
L. Rauchwerger, D. Padua, The LRPD test: speculative run-time parallelization of loops with privatization and reduction parallelization, Technical Report 1390, Univ. of Illinois at Urbana-Champaign, Cntr. for Supercomputing Res. and Dev., November 1994.
-
(1994)
Technical Report
, vol.1390
-
-
Rauchwerger, L.1
Padua, D.2
-
32
-
-
84976742595
-
The doconsider loop
-
June
-
J. Saltz, R. Mirchandaney, K. Crowley, The doconsider loop, in: Proceedings of the 1989 International Conference on Supercomputing, June 1989, pp. 29-40.
-
(1989)
Proceedings of the 1989 International Conference on Supercomputing
, pp. 29-40
-
-
Saltz, J.1
Mirchandaney, R.2
Crowley, K.3
-
33
-
-
0343393674
-
Runtime compilation methods for multicomputers
-
H.D. Schwetman (Ed.), CRC Press
-
J. Wu, J. Saltz, S. Hiranandani, H. Berryman, Runtime compilation methods for multicomputers, in: H.D. Schwetman (Ed.), Proceedings of the 1991 International Conference on Parallel Processing, Vol. II - Software, CRC Press, 1991, pp. 26-30.
-
(1991)
Proceedings of the 1991 International Conference on Parallel Processing, Vol. II - Software
, vol.2
, pp. 26-30
-
-
Wu, J.1
Saltz, J.2
Hiranandani, S.3
Berryman, H.4
-
34
-
-
0029202238
-
Run-time methods for parallelizing partially parallel loops
-
Barcelona, Spain, July
-
L. Rauchwerger, N. Amato, D. Padua, Run-time methods for parallelizing partially parallel loops, in: Proceedings of the 1995 International Conference on Supercomputing, Barcelona, Spain, July 1995, pp. 137-146.
-
(1995)
Proceedings of the 1995 International Conference on Supercomputing
, pp. 137-146
-
-
Rauchwerger, L.1
Amato, N.2
Padua, D.3
-
35
-
-
0024664199
-
Run-time disambiguation: Coping with statically unpredictable dependencies
-
A. Nicolau, Run-time disambiguation: coping with statically unpredictable dependencies, IEEE Trans. Comput. 38 (5) (1989) 663-678.
-
(1989)
IEEE Trans. Comput.
, vol.38
, Issue.5
, pp. 663-678
-
-
Nicolau, A.1
-
36
-
-
0028313937
-
Speculative disambiguation: A compilation technique for dynamic memory disambiguation
-
Chicago, IL, April
-
A.S. Huang, G. Slavenburg, J.P. Shen, Speculative disambiguation: A compilation technique for dynamic memory disambiguation, in: Proceedings of the 21st Annual International Symposium on Computer Architecture, Chicago, IL, April 1994, pp. 200-210.
-
(1994)
Proceedings of the 21st Annual International Symposium on Computer Architecture
, pp. 200-210
-
-
Huang, A.S.1
Slavenburg, G.2
Shen, J.P.3
-
37
-
-
84976845370
-
Dynamic memory disambiguation using the memory conflict buffer
-
Chicago, IL, April
-
D.M. Gallagher, W.Y. Chen, S.A. Malke, J.G. Gyllenhaal, W.W. Hwu, Dynamic memory disambiguation using the memory conflict buffer, in: Proc. 21st Ann. Int'l. Symp. Computer Architecture, Chicago, IL, April 1994, pp. 183-195.
-
(1994)
Proc. 21st Ann. Int'l. Symp. Computer Architecture
, pp. 183-195
-
-
Gallagher, D.M.1
Chen, W.Y.2
Malke, S.A.3
Gyllenhaal, J.G.4
Hwu, W.W.5
-
38
-
-
84976823223
-
The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization
-
La Jolla, CA, June
-
L. Rauchwerger, D.A. Padua, The LRPD test: speculative run-time parallelization of loops with privatization and reduction parallelization, in: Proceedings of the SIGPLAN 1995 Conference on Programming Language Design and Implementation, La Jolla, CA, June 1995, pp. 218-232.
-
(1995)
Proceedings of the SIGPLAN 1995 Conference on Programming Language Design and Implementation
, pp. 218-232
-
-
Rauchwerger, L.1
Padua, D.A.2
-
41
-
-
0003477925
-
The PERFECT club benchmarks: Effective performance evaluation of supercomputers
-
Center for Supercomputing Research and Development, University of Illinois, Urbana, IL, May
-
M. Berry, D. Chen, P. Koss, D. Kuck, S. Lo, Y. Pang, R. Roloff, A. Sameh, E. Clementi, S. Chin, D. Schneider, G. Fox, P. Messina, D. Walker, C. Hsiung, J. Schwarzmeier, K. Lue, S. Orzag, F. Seidl, O. Johnson, G. Swanson, R. Goodrum, J. Martin, The PERFECT club benchmarks: Effective performance evaluation of supercomputers, Technical Report CSRD-827, Center for Supercomputing Research and Development, University of Illinois, Urbana, IL, May 1989.
-
(1989)
Technical Report CSRD-827
-
-
Berry, M.1
Chen, D.2
Koss, P.3
Kuck, D.4
Lo, S.5
Pang, Y.6
Roloff, R.7
Sameh, A.8
Clementi, E.9
Chin, S.10
Schneider, D.11
Fox, G.12
Messina, P.13
Walker, D.14
Hsiung, C.15
Schwarzmeier, J.16
Lue, K.17
Orzag, S.18
Seidl, F.19
Johnson, O.20
Swanson, G.21
Goodrum, R.22
Martin, J.23
more..
-
42
-
-
0023596512
-
Debugging fortran on a shared-memory machine
-
St. Charles, IL
-
T. Allen, D.A. Padua, Debugging fortran on a shared-memory machine, in: Proceedings of the 1987 International Conference on Parallel Processing, St. Charles, IL, 1987, pp. 721-727.
-
(1987)
Proceedings of the 1987 International Conference on Parallel Processing
, pp. 721-727
-
-
Allen, T.1
Padua, D.A.2
-
43
-
-
0026745444
-
Detecting nondeterminacy in parallel programs
-
January
-
P.A. Emrath, S. Ghosh, D.A. Padua, Detecting nondeterminacy in parallel programs, IEEE Soft., January 1992,
-
(1992)
IEEE Soft.
-
-
Emrath, P.A.1
Ghosh, S.2
Padua, D.A.3
-
47
-
-
0026274708
-
On-the-fly detection of data races for programs with nested fork-join parallelism
-
Albuquerque, NM, Nov.
-
J. Mellor-Crummey, On-the-fly detection of data races for programs with nested fork-join parallelism, in: Proceedings of Supercomputing 1991, Albuquerque, NM, Nov. 1991, pp. 24-33.
-
(1991)
Proceedings of Supercomputing 1991
, pp. 24-33
-
-
Mellor-Crummey, J.1
-
48
-
-
84976797669
-
Compile-time support for efficient data race detection in shared-memory parallel programs
-
San Diego, CA, May
-
J. Mellor-Crummey, Compile-time support for efficient data race detection in shared-memory parallel programs, in: Proc. of the ACM/ONR Workshop on Parallel and Distributed Debugging, San Diego, CA, May 1993, pp. 129-139.
-
(1993)
Proc. of the ACM/ONR Workshop on Parallel and Distributed Debugging
, pp. 129-139
-
-
Mellor-Crummey, J.1
|