-
1
-
-
0031623811
-
Using integer sets for data-parallel analysis and optimization
-
Montreal, Canada
-
V. Adve, J. Mellor-Crummey, Using integer sets for data-parallel analysis and optimization, in: ACM SIGPLAN Conference on Programming Language Design and Implementation, Montreal, Canada, 1998, pp. 186-198.
-
(1998)
ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 186-198
-
-
Adve, V.1
Mellor-Crummey, J.2
-
2
-
-
0029373981
-
Automatic partitioning of parallel loops and data arrays for distributed shared-memory multiprocessors
-
A. Agarwal, D. Kranz, V. Natrajan, Automatic partitioning of parallel loops and data arrays for distributed shared-memory multiprocessors, IEEE Transactions on Parallel and Distributed Systems 6 (9) (1995) 943-962.
-
(1995)
IEEE Transactions on Parallel and Distributed Systems
, vol.6
, Issue.9
, pp. 943-962
-
-
Agarwal, A.1
Kranz, D.2
Natrajan, V.3
-
3
-
-
0032114494
-
Interprocedural partial redundancy elimination with application to distributed memory compilation
-
G. Agrawal, Interprocedural partial redundancy elimination with application to distributed memory compilation, IEEE Transactions on Parallel and Distributed Systems 9 (7) (1998) 609-625.
-
(1998)
IEEE Transactions on Parallel and Distributed Systems
, vol.9
, Issue.7
, pp. 609-625
-
-
Agrawal, G.1
-
4
-
-
0004072686
-
-
Addison-Wesley, Reading, MA
-
A. Aho, R. Sethi, J. Ullman, Compilers: Principles, Techniques and Tools, Addison-Wesley, Reading, MA, 1988.
-
(1988)
Compilers: Principles, Techniques and Tools
-
-
Aho, A.1
Sethi, R.2
Ullman, J.3
-
7
-
-
0342363542
-
Vector register allocation
-
Rice University, Houston, TX, December
-
R. Allen, K. Kennedy, Vector register allocation, Technical Report TR86-45, Rice University, Houston, TX, December 1986.
-
(1986)
Technical Report TR86-45
-
-
Allen, R.1
Kennedy, K.2
-
8
-
-
0012777446
-
-
Ph.D. Thesis, McGill University, Montreal, Quebec
-
E.R. Altman, Optimal software pipelining with functional unit and register constraints, Ph.D. Thesis, McGill University, Montreal, Quebec, 1995.
-
(1995)
Optimal Software Pipelining with Functional Unit and Register Constraints
-
-
Altman, E.R.1
-
10
-
-
0027802136
-
Communication optimization and code generation for distributed memory machines
-
Albuquerque, New Mexico, June
-
S.P. Amarasinghe, M.S. Lam, Communication optimization and code generation for distributed memory machines, in: Proceedings ACM SIGPLAN'93 Conference on Programming Language Design and Implementation, Albuquerque, New Mexico, June 1993.
-
(1993)
Proceedings ACM SIGPLAN'93 Conference on Programming Language Design and Implementation
-
-
Amarasinghe, S.P.1
Lam, M.S.2
-
11
-
-
84976766536
-
Scanning polyhedra with do loops
-
Williamsburg, VA, April
-
C. Ancourt, F. Irigoin, Scanning polyhedra with do loops, in: Proceedings of the Third ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Williamsburg, VA, April 1991.
-
(1991)
Proceedings of the Third ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
-
-
Ancourt, C.1
Irigoin, F.2
-
12
-
-
0031104380
-
A linear algebra framework for static HPF code distribution
-
A. Ancourt, F. Coelho, F. Irigoin, R. Keryell, A linear algebra framework for static HPF code distribution, Scientific Programming 6 (1) (1997) 3-28.
-
(1997)
Scientific Programming
, vol.6
, Issue.1
, pp. 3-28
-
-
Ancourt, A.1
Coelho, F.2
Irigoin, F.3
Keryell, R.4
-
14
-
-
0029710317
-
Using register-transfer paths in code generation for heterogeneous memory-register architectures
-
G. Araujo, S. Malik, M. Lee, Using register-transfer paths in code generation for heterogeneous memory-register architectures, in: Proceedings of the 33rd ACM/IEEE Design Automation Conference, 1996, pp. 591-596.
-
(1996)
Proceedings of the 33rd ACM/IEEE Design Automation Conference
, pp. 591-596
-
-
Araujo, G.1
Malik, S.2
Lee, M.3
-
15
-
-
85008031722
-
Instruction set design and optimization for address computation in DSP architectures
-
G. Araujo, A. Sudarsanam, S. Malik, Instruction set design and optimization for address computation in DSP architectures, in: Proceedings of the Ninth International Symposium on System Synthesis, 1997, pp. 31-37.
-
(1997)
Proceedings of the Ninth International Symposium on System Synthesis
, pp. 31-37
-
-
Araujo, G.1
Sudarsanam, A.2
Malik, S.3
-
16
-
-
0003487052
-
-
Addison-Wesley, Reading, MA
-
K. Arnold, J. Gosling, The Java Programming Language, Addison-Wesley, Reading, MA, 1996.
-
(1996)
The Java Programming Language
-
-
Arnold, K.1
Gosling, J.2
-
17
-
-
0031359056
-
A framework for balancing control flow and predication
-
Research Triangle Park, North Carolina, December
-
D.I. August, W.W. Hwu, S.A. Mahlke, A framework for balancing control flow and predication, in: Proceedings of the 30th Annual International Symposium on Microarchitecture, Research Triangle Park, North Carolina, December 1997, pp. 92-103.
-
(1997)
Proceedings of the 30th Annual International Symposium on Microarchitecture
, pp. 92-103
-
-
August, D.I.1
Hwu, W.W.2
Mahlke, S.A.3
-
20
-
-
0029394470
-
The PARADIGM compiler for distributed-memory multicomputers
-
P. Banerjee, J.A. Chandy, M. Gupta, E.W. Hodges IV, J.G. Holm, A. Lain, D.J. Palermo, S. Ramaswamy, E. Su, The PARADIGM compiler for distributed-memory multicomputers, IEEE Computer 28 (10) (1995) 37-47.
-
(1995)
IEEE Computer
, vol.28
, Issue.10
, pp. 37-47
-
-
Banerjee, P.1
Chandy, J.A.2
Gupta, M.3
Hodges E.W. IV4
Holm, J.G.5
Lain, A.6
Palermo, D.J.7
Ramaswamy, S.8
Su, E.9
-
21
-
-
0026817662
-
Optimizing stack frame accesses for processors with restricted addressing modes
-
D. Bartley, Optimizing stack frame accesses for processors with restricted addressing modes, Software Practice and Experience 22 (2) (1992) 101-110.
-
(1992)
Software Practice and Experience
, vol.22
, Issue.2
, pp. 101-110
-
-
Bartley, D.1
-
22
-
-
84947776744
-
Solving alignment using elementary linear algebra
-
Proceedings of the Seventh Workshop on Languages and Compilers for Parallel Computing, Ithica, NY, Springer, Berlin
-
D. Bau, I. Koduklula, V. Kotlyar, K. Pingali, P. Stodghill, Solving alignment using elementary linear algebra, in: Proceedings of the Seventh Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science, vol. 892, Ithica, NY, 1994, Springer, Berlin, 1995, pp. 46-60.
-
(1994)
Lecture Notes in Computer Science
, vol.892
, pp. 46-60
-
-
Bau, D.1
Koduklula, I.2
Kotlyar, V.3
Pingali, K.4
Stodghill, P.5
-
23
-
-
0027001568
-
Vienna Fortran 90
-
Williamsburg, VA, April
-
S. Benkner, B. Chapman, H. Zima, Vienna Fortran 90, in: Proceedings of the 1992 Scalable High Performance Computing Conference, Williamsburg, VA, April 1992.
-
(1992)
Proceedings of the 1992 Scalable High Performance Computing Conference
-
-
Benkner, S.1
Chapman, B.2
Zima, H.3
-
25
-
-
0028594328
-
Resource spackling: A framework for integrating register allocation in local and global schedulers
-
D.A. Berson, R. Gupta, M.L. Soffa, Resource spackling: a framework for integrating register allocation in local and global schedulers, in: Proceedings of IFIP WG 10.3 Working Conference on Parallel Architectures and Compilation Techniques, 1994, pp. 135-146.
-
(1994)
Proceedings of IFIP WG 10.3 Working Conference on Parallel Architectures and Compilation Techniques
, pp. 135-146
-
-
Berson, D.A.1
Gupta, R.2
Soffa, M.L.3
-
26
-
-
0028583166
-
Automatic data layout using 0-1 integer programming
-
Montréal, Canada, August
-
R. Bixby, K. Kennedy, U. Kremer, Automatic data layout using 0-1 integer programming, in: Proceedings of the 1994 International Conference on Parallel Architectures and Compilation Techniques, Montréal, Canada, August 1994, pp. 111-122.
-
(1994)
Proceedings of the 1994 International Conference on Parallel Architectures and Compilation Techniques
, pp. 111-122
-
-
Bixby, R.1
Kennedy, K.2
Kremer, U.3
-
27
-
-
0031679132
-
Escape analysis: Correctness, proof, implementation and experimental results
-
San Diego, CA, January
-
B. Blanchet, Escape analysis: correctness, proof, implementation and experimental results, in: Proceedings of the 25th Annual ACM Symposium on Principles of Programming Languages, San Diego, CA, January 1998, pp. 25-37.
-
(1998)
Proceedings of the 25th Annual ACM Symposium on Principles of Programming Languages
, pp. 25-37
-
-
Blanchet, B.1
-
29
-
-
0348137596
-
Complete removal of redundant expressions
-
Montreal, Canada, June
-
R. Bodik, R. Gupta, M.L. Soffa, Complete removal of redundant expressions, in: Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, Montreal, Canada, June 1998, pp. 1-14.
-
(1998)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 1-14
-
-
Bodik, R.1
Gupta, R.2
Soffa, M.L.3
-
30
-
-
0030645017
-
Partial dead code elimination using slicing transformations
-
Las Vegas, Nevada, June
-
R. Bodik, R. Gupta, Partial dead code elimination using slicing transformations, in: Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, Las Vegas, Nevada, June 1997, pp. 159-170.
-
(1997)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 159-170
-
-
Bodik, R.1
Gupta, R.2
-
33
-
-
17144418556
-
Global communication analysis and optimization
-
Philadelphia, PA, May
-
S. Chakrabarti, M. Gupta, J.-D. Choi, Global communication analysis and optimization, in: Proceedings ACM SIGPLAN Conference on Programming Language Design and Implementation, Philadelphia, PA, May 1996.
-
(1996)
Proceedings ACM SIGPLAN Conference on Programming Language Design and Implementation
-
-
Chakrabarti, S.1
Gupta, M.2
Choi, J.-D.3
-
34
-
-
0011611816
-
CC++: A declarative concurrent object-oriented programming notation
-
MIT Press, Cambridge, MA
-
K.M. Chandy, C. Kesselman, CC++: a declarative concurrent object-oriented programming notation, in: Research Directions in Concurrent Object Oriented Programming, MIT Press, Cambridge, MA, 1993.
-
(1993)
Research Directions in Concurrent Object Oriented Programming
-
-
Chandy, K.M.1
Kesselman, C.2
-
35
-
-
0343668689
-
Automatic support for data distribution on distributed memory multiprocessor systems
-
Proceedings of the Sixth Workshop on Languages and Compilers for Parallel Computing, Portland, OR, Aug. Springer, Berlin
-
B. Chapman, T. Fahringer, H. Zima, Automatic support for data distribution on distributed memory multiprocessor systems, in: Proceedings of the Sixth Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science, vol. 768, Portland, OR, Aug. 1993, Springer, Berlin, 1994, pp. 184-199.
-
(1993)
Lecture Notes in Computer Science
, vol.768
, pp. 184-199
-
-
Chapman, B.1
Fahringer, T.2
Zima, H.3
-
36
-
-
0343668692
-
The alignment-distribution graph
-
Languages and Compilers for Parallel Computing, Sixth International Workshop, Springer, Berlin
-
S. Chatterjee, J. Gilbert, R. Schreiber, The alignment-distribution graph, in: Languages and Compilers for Parallel Computing, Sixth International Workshop, Lecture Notes in Computer Science, vol. 768, Springer, Berlin, 1993.
-
(1993)
Lecture Notes in Computer Science
, vol.768
-
-
Chatterjee, S.1
Gilbert, J.2
Schreiber, R.3
-
37
-
-
0002742410
-
Generating local addresses and communication sets for data parallel programs
-
S. Chatterjee, J. Gilbert, F. Long, R. Schreiber, S. Teng, Generating local addresses and communication sets for data parallel programs, Journal of Parallel and Distributed Computing 26 (1) (1995) 72-84.
-
(1995)
Journal of Parallel and Distributed Computing
, vol.26
, Issue.1
, pp. 72-84
-
-
Chatterjee, S.1
Gilbert, J.2
Long, F.3
Schreiber, R.4
Teng, S.5
-
38
-
-
0028499023
-
Communication-free data allocation techniques for parallelizing compilers on multicomputers
-
T.S. Chen, J.P. Sheu, Communication-free data allocation techniques for parallelizing compilers on multicomputers, IEEE Transactions on Parallel and Distributed Systems 5 (9) (1994) 924-938.
-
(1994)
IEEE Transactions on Parallel and Distributed Systems
, vol.5
, Issue.9
, pp. 924-938
-
-
Chen, T.S.1
Sheu, J.P.2
-
39
-
-
0031594025
-
Memory dependence prediction using store sets
-
Barcelona, Spain, July
-
G. Chrysos, J. Emer, Memory dependence prediction using store sets, in: Proceedings of the ACM/ IEEE 25th International Symposium on Computer Architecture, Barcelona, Spain, July 1998, pp. 142-154.
-
(1998)
Proceedings of the ACM/ IEEE 25th International Symposium on Computer Architecture
, pp. 142-154
-
-
Chrysos, G.1
Emer, J.2
-
40
-
-
0343668688
-
Evolutionary compilation to long instruction superscalar microarchitectures for exploiting parallelism at all levels
-
T.M. Conte, Evolutionary compilation to long instruction superscalar microarchitectures for exploiting parallelism at all levels, in: ASPLOS Wild and Crazy Idea Session, 1998.
-
(1998)
ASPLOS Wild and Crazy Idea Session
-
-
Conte, T.M.1
-
42
-
-
0022793229
-
The impact of interprocedural analysis and optimization in the Rn programming environment
-
K.D. Cooper, K. Kennedy, L. Torczon, The impact of interprocedural analysis and optimization in the Rn programming environment, ACM Transactions on Programming Languages and Systems 8 (4) (1986) 491-523.
-
(1986)
ACM Transactions on Programming Languages and Systems
, vol.8
, Issue.4
, pp. 491-523
-
-
Cooper, K.D.1
Kennedy, K.2
Torczon, L.3
-
43
-
-
84958956033
-
Non-local instruction scheduling with limited code growth
-
K. Cooper, P. Schielke, Non-local instruction scheduling with limited code growth, in: Proceedings of Languages, Compilers and Tools for Embedded Systems, 1998, pp. 193-207.
-
(1998)
Proceedings of Languages, Compilers and Tools for Embedded Systems
, pp. 193-207
-
-
Cooper, K.1
Schielke, P.2
-
44
-
-
0027710762
-
Parallel programming in Split-C
-
D. Culler, A. Dusseau, S. Goldstein, A. Krishnamurthy, S. Lumetta, T. von Eicken, K. Yelick, Parallel programming in Split-C, in: Proceedings of Supercomputing'93, 1993.
-
(1993)
Proceedings of Supercomputing'93
-
-
Culler, D.1
Dusseau, A.2
Goldstein, S.3
Krishnamurthy, A.4
Lumetta, S.5
Von Eicken, T.6
Yelick, K.7
-
46
-
-
0022920260
-
The KAP/S-1: An advanced source-to-source vectorizer for the S-1 Mark IIa Supercomputer
-
St. Charles, Illinois, August
-
J. Davies, C. Huson, T. Macke, B. Leasure, M. Wolfe, The KAP/S-1: an advanced source-to-source vectorizer for the S-1 Mark IIa Supercomputer, in: Proceedings of the 1986 International Conference on Parallel Processing, St. Charles, Illinois, August 1986, pp. 833-835.
-
(1986)
Proceedings of the 1986 International Conference on Parallel Processing
, pp. 833-835
-
-
Davies, J.1
Huson, C.2
Macke, T.3
Leasure, B.4
Wolfe, M.5
-
48
-
-
0032123777
-
The IA-64 architecture at work
-
C. Dulong, The IA-64 architecture at work, IEEE Computer (1998) 24-32.
-
(1998)
IEEE Computer
, pp. 24-32
-
-
Dulong, C.1
-
49
-
-
0030645966
-
DAISY: Dynamic compilation for 100% architectural compatibility
-
Denver, Colorado
-
K. Ebcioglu, E. Altman, DAISY: dynamic compilation for 100% architectural compatibility, in: Proceedings of the International Symposium on Computer Architecture, Denver, Colorado, 1997, pp. 26-37.
-
(1997)
Proceedings of the International Symposium on Computer Architecture
, pp. 26-37
-
-
Ebcioglu, K.1
Altman, E.2
-
50
-
-
38149020846
-
A report on sisal language project
-
J.T. Feo, D.C. Cann, R.R. Oldehoeft, A report on sisal language project, Journal of Parallel and Distributed Computing 10 (4) (1990) 349-366.
-
(1990)
Journal of Parallel and Distributed Computing
, vol.10
, Issue.4
, pp. 349-366
-
-
Feo, J.T.1
Cann, D.C.2
Oldehoeft, R.R.3
-
51
-
-
0023385308
-
The program dependence graph and its use in optimization
-
J. Ferrante, K. Ottenstein, J. Warren, The program dependence graph and its use in optimization, ACM Transactions on Programming Languages and Systems 9 (3) (1987) 319-349.
-
(1987)
ACM Transactions on Programming Languages and Systems
, vol.9
, Issue.3
, pp. 319-349
-
-
Ferrante, J.1
Ottenstein, K.2
Warren, J.3
-
52
-
-
0019596071
-
Trace scheduling: A technique for global microcode compaction
-
J.A. Fisher, Trace scheduling: a technique for global microcode compaction, IEEE Transactions on Computers 30 (7) (1981) 478-490.
-
(1981)
IEEE Transactions on Computers
, vol.30
, Issue.7
, pp. 478-490
-
-
Fisher, J.A.1
-
53
-
-
0031237815
-
Walk-time techniques: Catalyst for architectural change
-
J.A. Fisher, Walk-time techniques: catalyst for architectural change, IEEE Computer 30 (9) (1997) 40-42.
-
(1997)
IEEE Computer
, vol.30
, Issue.9
, pp. 40-42
-
-
Fisher, J.A.1
-
54
-
-
0028461905
-
Avoidance and suppression of compensation code in a trace scheduling compiler
-
S. Freudenberger, T. Gross, P.G. Lowney, Avoidance and suppression of compensation code in a trace scheduling compiler, ACM Transactions on Programming Languages and Systems 16 (4) (1994) 1156-1214.
-
(1994)
ACM Transactions on Programming Languages and Systems
, vol.16
, Issue.4
, pp. 1156-1214
-
-
Freudenberger, S.1
Gross, T.2
Lowney, P.G.3
-
55
-
-
0032312214
-
Putting the fill unit to work: Dynamic optimizations for trace cache microprocessors
-
D.H. Friendly, S.J. Patel, Y.N. Patt, Putting the fill unit to work: dynamic optimizations for trace cache microprocessors, in: Proceedings of the 31st Annual ACM/IEEE Symposium on Microarchitecture, 1998, pp. 173-181.
-
(1998)
Proceedings of the 31st Annual ACM/IEEE Symposium on Microarchitecture
, pp. 173-181
-
-
Friendly, D.H.1
Patel, S.J.2
Patt, Y.N.3
-
56
-
-
0031611718
-
Value speculation scheduling for high performance processors
-
C. Fu, M.D. Jennings, S.Y. Larin, T.M. Conte, Value speculation scheduling for high performance processors, in: Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems, 1998, pp. 262-271.
-
(1998)
Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 262-271
-
-
Fu, C.1
Jennings, M.D.2
Larin, S.Y.3
Conte, T.M.4
-
57
-
-
0343668683
-
The use of BLAS3 in linear algebra on a parallel processor with a hierarchical memory
-
Center for Supercomputing Res. and Dev., University of Illinois, October
-
K. Gallivan, W. Jalby, U. Meier, The use of BLAS3 in linear algebra on a parallel processor with a hierarchical memory, Technical Report CSRD Rpt. No. 610, Center for Supercomputing Res. and Dev., University of Illinois, October 1986.
-
(1986)
Technical Report CSRD Rpt. No. 610
, vol.610
-
-
Gallivan, K.1
Jalby, W.2
Meier, U.3
-
58
-
-
84957095558
-
A novel approach towards automatic data distribution
-
Houston, TX, April
-
J. Garcia, E. Ayguadé, J. Labarta, A novel approach towards automatic data distribution, in: Proceedings of the Workshop on Automatic Data Layout and Performance Prediction, Houston, TX, April 1995.
-
(1995)
Proceedings of the Workshop on Automatic Data Layout and Performance Prediction
-
-
Garcia, J.1
Ayguadé, E.2
Labarta, J.3
-
59
-
-
0342798380
-
SUPERB: Experiences and future research
-
North-Holland, Amsterdam, The Netherlands
-
M. Gerndt, H. Zima, SUPERB: Experiences and future research, in: Proceedings of the Workshop on Languages, Compilers, and Run-Time Environments for Distributed Memory Machines, North-Holland, Amsterdam, The Netherlands, 1992.
-
(1992)
Proceedings of the Workshop on Languages, Compilers, and Run-Time Environments for Distributed Memory Machines
-
-
Gerndt, M.1
Zima, H.2
-
61
-
-
84976790479
-
Practical dependence testing
-
Toronto, Canada
-
G. Golf, K. Kennedy, C.W. Tseng, Practical dependence testing, in: Proceedings of the SIGPLAN'91 Conference on Programming Language Design and Implementation, Toronto, Canada, 1991.
-
(1991)
Proceedings of the SIGPLAN'91 Conference on Programming Language Design and Implementation
-
-
Golf, G.1
Kennedy, K.2
Tseng, C.W.3
-
62
-
-
84947738468
-
Compilation techniques for optimizing communication in distributed-memory systems
-
St. Charles, IL, August
-
C. Gong, R. Gupta, R. Melhem, Compilation techniques for optimizing communication in distributed-memory systems, in: Proceedings 1993 International Conference on Parallel Processing, St. Charles, IL, August 1993.
-
(1993)
Proceedings 1993 International Conference on Parallel Processing
-
-
Gong, C.1
Gupta, R.2
Melhem, R.3
-
65
-
-
0030075551
-
On compiling array expressions for efficient execution on distributed-memory machines
-
S.K.S. Gupta, S.D. Kaushik, C.-H. Huang, P. Sadayappan, On compiling array expressions for efficient execution on distributed-memory machines, Journal of Parallel and Distributed Computing 32 (2) (1996) 155-172.
-
(1996)
Journal of Parallel and Distributed Computing
, vol.32
, Issue.2
, pp. 155-172
-
-
Gupta, S.K.S.1
Kaushik, S.D.2
Huang, C.-H.3
Sadayappan, P.4
-
66
-
-
0025413768
-
Region scheduling: An approach for detecting and redistributing parallelism
-
R. Gupta, M.L. Soffa, Region scheduling: an approach for detecting and redistributing parallelism, IEEE Transactions on Software Engineering 16 (4) (1990) 421-431.
-
(1990)
IEEE Transactions on Software Engineering
, vol.16
, Issue.4
, pp. 421-431
-
-
Gupta, R.1
Soffa, M.L.2
-
68
-
-
84949185314
-
Register pressure sensitive redundancy elimination
-
Proceedings of the International Conference on Compiler Construction, Springer, Amsterdam, Netherlands
-
R. Gupta, R. Bodik, Register pressure sensitive redundancy elimination, in: Proceedings of the International Conference on Compiler Construction, Lecture Notes in Computer Science, vol. 1575, Springer, Amsterdam, Netherlands, pp. 107-121.
-
Lecture Notes in Computer Science
, vol.1575
, pp. 107-121
-
-
Gupta, R.1
Bodik, R.2
-
69
-
-
0031372826
-
Resource-sensitive profile-directed data flow analysis for code optimization
-
Research Triangle Park, North Carolina
-
R. Gupta, D. Berson, J.Z. Fang, Resource-sensitive profile-directed data flow analysis for code optimization, in: Proceedings of the 30th Annual IEEE/ACM International Symposium on Microarchitecture, Research Triangle Park, North Carolina, 1997, pp. 358-368.
-
(1997)
Proceedings of the 30th Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 358-368
-
-
Gupta, R.1
Berson, D.2
Fang, J.Z.3
-
70
-
-
0031627691
-
Path profile guided partial redundancy elimination using speculation
-
Chicago, Illinois
-
R. Gupta, D. Berson, J.Z. Fang, Path profile guided partial redundancy elimination using speculation, in: Proceedings of the IEEE International Conference on Computer Languages, Chicago, Illinois, 1998, pp. 230-239.
-
(1998)
Proceedings of the IEEE International Conference on Computer Languages
, pp. 230-239
-
-
Gupta, R.1
Berson, D.2
Fang, J.Z.3
-
71
-
-
0031372538
-
Path profile guided partial dead code elimination using predication
-
San Francisco, California
-
R. Gupta, D. Berson, J.Z. Fang, Path profile guided partial dead code elimination using predication, in: Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, San Francisco, California, 1997, pp. 102-115.
-
(1997)
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques
, pp. 102-115
-
-
Gupta, R.1
Berson, D.2
Fang, J.Z.3
-
74
-
-
84976702373
-
Give-n-take - A balanced code placement framework
-
Orlando, Florida, June
-
R.v. Hanxleden, K. Kennedy, Give-n-take - a balanced code placement framework, in: Proceedings of the ACM SIGPLAN '94 Conference on Programming Language Design and Implementation, Orlando, Florida, June 1994.
-
(1994)
Proceedings of the ACM SIGPLAN '94 Conference on Programming Language Design and Implementation
-
-
Hanxleden, R.V.1
Kennedy, K.2
-
75
-
-
0043058118
-
Compiler analysis for irregular problems in Fortran D
-
New Haven, CT, August
-
R.v. Hanxleden, K. Kennedy, C. Koelbel, R. Das, J. Saltz, Compiler analysis for irregular problems in Fortran D, in: Proceedings Fifth Workshop on Languages and Compilers for Parallel Computing, New Haven, CT, August 1992.
-
(1992)
Proceedings Fifth Workshop on Languages and Compilers for Parallel Computing
-
-
Hanxleden, R.V.1
Kennedy, K.2
Koelbel, C.3
Das, R.4
Saltz, J.5
-
76
-
-
0003609682
-
-
The MIT Press, Cambridge, MA
-
P. Hatcher, M. Quinn, Data-Parallel Programming on MIMD Com;uters, The MIT Press, Cambridge, MA, 1991.
-
(1991)
Data-Parallel Programming on MIMD Com;uters
-
-
Hatcher, P.1
Quinn, M.2
-
77
-
-
0004302191
-
-
Morgan Kaufmann, Los Altos, CA
-
J.L. Hennessy, D.A. Patterson, Computer Architecture: A Quantitative Approach, Morgan Kaufmann, Los Altos, CA, 1990.
-
(1990)
Computer Architecture: A Quantitative Approach
-
-
Hennessy, J.L.1
Patterson, D.A.2
-
78
-
-
0003565855
-
High Performance Fortran Forum, High Performance Fortran language specification, version 2.0
-
Center for Research on Parallel Computation, Rice University, Houston, TX, January
-
High Performance Fortran Forum, High Performance Fortran language specification, version 2.0. Technical Report CRPC-TR92225, Center for Research on Parallel Computation, Rice University, Houston, TX, January 1997.
-
(1997)
Technical Report CRPC-TR92225
-
-
-
79
-
-
84976706957
-
Interprocedural compilation of fortran D for MIMD distributed-memory machines
-
Minneapolis, MN
-
M.W. Hall, S. Hiranandani, K. Kennedy, C. Tseng, Interprocedural compilation of fortran D for MIMD distributed-memory machines, in: Proceedings of Supercomputing'92, Minneapolis, MN, 1992, pp. 522-534.
-
(1992)
Proceedings of Supercomputing'92
, pp. 522-534
-
-
Hall, M.W.1
Hiranandani, S.2
Kennedy, K.3
Tseng, C.4
-
80
-
-
84976813879
-
Compiling Fortran D for MIMD distributed-memory machines
-
S. Hiranandani, K. Kennedy, C.W. Tseng, Compiling Fortran D for MIMD distributed-memory machines, Communications of the ACM 35 (8) (1992) 66-80.
-
(1992)
Communications of the ACM
, vol.35
, Issue.8
, pp. 66-80
-
-
Hiranandani, S.1
Kennedy, K.2
Tseng, C.W.3
-
81
-
-
0027711187
-
Preliminary experiences with the fortran D compiler
-
Portland, OR, November
-
S. Hiranandani, K. Kennedy, C.-W. Tseng, Preliminary experiences with the fortran D compiler, in: Proceedings of Supercomputing'93, Portland, OR, November 1993.
-
(1993)
Proceedings of Supercomputing'93
-
-
Hiranandani, S.1
Kennedy, K.2
Tseng, C.-W.3
-
84
-
-
0342798375
-
Compiling parallel loops for high performance computers -partitioning
-
Kluwer Academic Publishers, Boston, MA
-
D.E. Hudak, S.G. Abraham, Compiling parallel loops for high performance computers -partitioning, in: Data Assignment and Remapping, Kluwer Academic Publishers, Boston, MA, 1993.
-
(1993)
Data Assignment and Remapping
-
-
Hudak, D.E.1
Abraham, S.G.2
-
85
-
-
0342363503
-
Technology outlook: Introduction to predicated execution
-
W-m. Hwu, Technology outlook: introduction to predicated execution, IEEE Computer 31 (1) (1998) 49-50.
-
(1998)
IEEE Computer
, vol.31
, Issue.1
, pp. 49-50
-
-
Hwu, W.-M.1
-
86
-
-
0027595384
-
The superblock: An effective technique for VLIW and superscalar compilation
-
W-M. Hwu, S.A. Mahlke, W.Y. Chen, P.P. Chang, N.J. Warter, R.A. Bringmann, R.G. Ouellette, R.E. Hank, T. Kiyohara, G.E. Haab, J.G. Holm, D.M. Lavery, The superblock: an effective technique for VLIW and superscalar compilation, Journal of Supercomputing A (1993) 229-248.
-
(1993)
Journal of Supercomputing A
, pp. 229-248
-
-
Hwu, W.-M.1
Mahlke, S.A.2
Chen, W.Y.3
Chang, P.P.4
Warter, N.J.5
Bringmann, R.A.6
Ouellette, R.G.7
Hank, R.E.8
Kiyohara, T.9
Haab, G.E.10
Holm, J.G.11
Lavery, D.M.12
-
87
-
-
85031523618
-
-
Document SC23-0526-01
-
IBM, Engineering and Scientific Subroutine Library (ESSL), Guide and Reference, Document SC23-0526-01, 1994.
-
(1994)
Guide and Reference
-
-
-
89
-
-
0008434041
-
Multiple threads template library
-
Real World Computing Partnership, September
-
Y. Ishikawa, Multiple threads template library, Technical Report TR-96-012, Real World Computing Partnership, September 1996.
-
(1996)
Technical Report TR-96-012
-
-
Ishikawa, Y.1
-
91
-
-
85031535123
-
Code motion for generating compact code on embedded DSPs'
-
Washington, DC, 4-6 December'98
-
V. Jain, S. Pande, Code motion for generating compact code on embedded DSPs', 1998 Workshop on Compiler and architecture support for embedded systems, Washington, DC, 4-6 December'98. Available under publications link at http://www.ececs.uc.edu/ compiler.
-
1998 Workshop on Compiler and Architecture Support for Embedded Systems
-
-
Jain, V.1
Pande, S.2
-
92
-
-
0342798373
-
HPC++: Experiments with the parallel standard template library
-
Indiana University, Department of Computer Science, December
-
E. Johnson, D. Gannon, HPC++: Experiments with the parallel standard template library, Technical Report TR-96-51, Indiana University, Department of Computer Science, December 1996.
-
(1996)
Technical Report TR-96-51
-
-
Johnson, E.1
Gannon, D.2
-
94
-
-
0343668673
-
Minimizing data and synchronization costs in one-way communication
-
M. Kandemir, N. Shenoy, P. Banerjee, J. Ramanujam, A. Choudhary, Minimizing data and synchronization costs in one-way communication, in: International Conference on Parallel Processing, 1998, pp. 180-188.
-
(1998)
International Conference on Parallel Processing
, pp. 180-188
-
-
Kandemir, M.1
Shenoy, N.2
Banerjee, P.3
Ramanujam, J.4
Choudhary, A.5
-
96
-
-
0029192689
-
A linear-time algorithm for computing the memory access sequence in data-parallel programs
-
Santa Barbara, CA
-
K. Kennedy, N. Nedeljkovic, A. Sethi, A linear-time algorithm for computing the memory access sequence in data-parallel programs, in: Proceedings of Fifth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Santa Barbara, CA, 1995, pp. 102-111.
-
(1995)
Proceedings of Fifth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 102-111
-
-
Kennedy, K.1
Nedeljkovic, N.2
Sethi, A.3
-
98
-
-
0042805977
-
Resource-based communication placement analysis
-
San Jose, CA, August
-
K. Kennedy, A. Sethi, Resource-based communication placement analysis, in: Proceedings Ninth Workshop on Languages and Compilers for Parallel Computing, San Jose, CA, August 1996.
-
(1996)
Proceedings Ninth Workshop on Languages and Compilers for Parallel Computing
-
-
Kennedy, K.1
Sethi, A.2
-
99
-
-
0027677302
-
Optimization techniques for SIMD fortran compilers
-
K. Knobe, J. Lukas, M. Weiss, Optimization techniques for SIMD fortran compilers, Concurrency: Practice and Experience 5 (7) (1993) 527-552.
-
(1993)
Concurrency: Practice and Experience
, vol.5
, Issue.7
, pp. 527-552
-
-
Knobe, K.1
Lukas, J.2
Weiss, M.3
-
100
-
-
0031705047
-
Array SSA form and its use in parallelization
-
San Diego, California, January
-
K. Knobe, V. Sarkar, Array SSA form and its use in parallelization, in: Proceedings of the 25th ACM Symposium on Principles of Programming Languages, San Diego, California, January 1998.
-
(1998)
Proceedings of the 25th ACM Symposium on Principles of Programming Languages
-
-
Knobe, K.1
Sarkar, V.2
-
102
-
-
0026294380
-
Compile-time Generation of Communication for scientific programs
-
Albuquerque, NM
-
C. Koelbel, Compile-time Generation of Communication for scientific programs, in: Proceedings of Supercomputing '91, Albuquerque, NM, 1991, pp. 101-110.
-
(1991)
Proceedings of Supercomputing '91
, pp. 101-110
-
-
Koelbel, C.1
-
103
-
-
0026190245
-
The I-Test: An improved dependence test for automatic parallelization and vectorization
-
IEEE Transactions on Parallel and Distributed Systems
-
X. Kong, D. Klappholz, K. Psarris, The I-Test: An improved dependence test for automatic parallelization and vectorization, IEEE Transactions on Parallel and Distributed Systems, Special Issue on Parallel Languages and Compilers 2 (3) (1991).
-
(1991)
Parallel Languages and Compilers
, vol.2
, Issue.3 SPEC. ISSUE
-
-
Kong, X.1
Klappholz, D.2
Psarris, K.3
-
104
-
-
0003657590
-
-
Addison-Wesley, Reading, MA
-
D. Knuth, The Art of Computer Programming, vol. 2, Seminumerical Algorithms, Addison-Wesley, Reading, MA, 1981.
-
(1981)
The Art of Computer Programming, Vol. 2, Seminumerical Algorithms
, vol.2
-
-
Knuth, D.1
-
105
-
-
85033186207
-
-
Document #9603001, Champaign, IL
-
Kuck and Associates, Inc., KAP for IBM fortran, user's guide version 3.3, Document #9603001, Champaign, IL, 1996.
-
(1996)
KAP for IBM Fortran, User's Guide Version 3.3
-
-
-
106
-
-
0016026944
-
The parallel execution of DO loops
-
L. Lamport, The parallel execution of DO loops, Communications of the ACM 17 (2) (1974) 83-93.
-
(1974)
Communications of the ACM
, vol.17
, Issue.2
, pp. 83-93
-
-
Lamport, L.1
-
107
-
-
0003327314
-
Concurrent static single assignment form and constant propagation for explicitly parallel Programs
-
Proceedings of the 10th International Workshop on Languages and Compilers for Parallel Computing, Springer, Minneapolis, MN, August
-
J. Lee, S.P. Midkiff, D.A. Padua, Concurrent static single assignment form and constant propagation for explicitly parallel Programs, in: Proceedings of the 10th International Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science, Springer, Minneapolis, MN, August 1997.
-
(1997)
Lecture Notes in Computer Science
-
-
Lee, J.1
Midkiff, S.P.2
Padua, D.A.3
-
108
-
-
0026187669
-
Compiling communication-efficient programs for massively parallel machines
-
J. Li, M. Chen, Compiling communication-efficient programs for massively parallel machines, IEEE Transactions on Parallel and Distributed Systems 2 (3) (1991) 361-376.
-
(1991)
IEEE Transactions on Parallel and Distributed Systems
, vol.2
, Issue.3
, pp. 361-376
-
-
Li, J.1
Chen, M.2
-
109
-
-
0343232962
-
Efficient interprocedural analysis for program parallelization and restructuring
-
New Haven, CT, July
-
Z. Li, P. Yew, Efficient interprocedural analysis for program parallelization and restructuring, in: Proceedings of the ACM/SIGPLAN Symposium on Parallel Programming, New Haven, CT, July 1998.
-
(1998)
Proceedings of the ACM/SIGPLAN Symposium on Parallel Programming
-
-
Li, Z.1
Yew, P.2
-
114
-
-
0030265013
-
Value locality and load value prediction
-
Cambridge, MA
-
M.H. Lipasti, C.B. Wilkerson, J.P. Shen, Value locality and load value prediction, in: Proceedings of the Seventh International Conference on Architectural Support for Programming Languages and Operating Systems, Cambridge, MA, 1996, pp. 138-149.
-
(1996)
Proceedings of the Seventh International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 138-149
-
-
Lipasti, M.H.1
Wilkerson, C.B.2
Shen, J.P.3
-
115
-
-
0011598990
-
Addressing in Cray research's MPP Fortran
-
Vienna, Austria
-
T. MacDonald, D. Pase, A. Meltzer, Addressing in Cray research's MPP Fortran, in: Proceedings of the Third Workshop on Compilers for Parallel Computers, Vienna, Austria, 1992, pp. 161-172.
-
(1992)
Proceedings of the Third Workshop on Compilers for Parallel Computers
, pp. 161-172
-
-
MacDonald, T.1
Pase, D.2
Meltzer, A.3
-
118
-
-
0030717767
-
Dynamic speculation and synchronization of data dependences
-
A.I. Moshovos, S.E. Breach, T.N. Vijaykumar, G.S. Sohi, Dynamic speculation and synchronization of data dependences, in: Proceedings of the 24th International Symposium on Computer Architecture, 1997.
-
(1997)
Proceedings of the 24th International Symposium on Computer Architecture
-
-
Moshovos, A.I.1
Breach, S.E.2
Vijaykumar, T.N.3
Sohi, G.S.4
-
119
-
-
0030674213
-
Exploiting instruction level parallelism in processors by caching scheduled groups
-
Denver, Colorado
-
R. Nair, M.E. Hopkins, Exploiting instruction level parallelism in processors by caching scheduled groups, in: Proceedings of the International Symposium on Computer Architecture, Denver, Colorado, 1997, pp. 13-25.
-
(1997)
Proceedings of the International Symposium on Computer Architecture
, pp. 13-25
-
-
Nair, R.1
Hopkins, M.E.2
-
120
-
-
0032672879
-
Value prediction in VLIW machines
-
Atlanta, Georgia
-
T. Nakra, R. Gupta, M.L. Soffa, Value prediction in VLIW machines, in: Proceedings of the ACM/ IEEE 26th International Symposium on Computer Architecture, Atlanta, Georgia, 1999.
-
(1999)
Proceedings of the ACM/ IEEE 26th International Symposium on Computer Architecture
-
-
Nakra, T.1
Gupta, R.2
Soffa, M.L.3
-
121
-
-
0027767071
-
A scheduler-sensitive global register allocator
-
Portland, Oregon
-
C. Norris, L.L. Pollock, A scheduler-sensitive global register allocator, in: Proceedings of Supercomputing'93, Portland, Oregon, 1993, pp. 804-813.
-
(1993)
Proceedings of Supercomputing'93
, pp. 804-813
-
-
Norris, C.1
Pollock, L.L.2
-
122
-
-
28444453374
-
Superscalar execution with direct data forwarding
-
Paris, France
-
S. Onder, R. Gupta, Superscalar execution with direct data forwarding, in: Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, Paris, France, 1998, pp. 130-135.
-
(1998)
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques
, pp. 130-135
-
-
Onder, S.1
Gupta, R.2
-
123
-
-
0003701628
-
-
Stanford University Computer Systems Lab, Technical Report CSL-TR-97-715, February
-
J. Oplinger, D. Heine, S-W. Liao, B.A. Nayfeh, M.S. Lam, K. Olukotun, Software and hardware for exploiting speculative parallelism with a multiprocessor, Stanford University Computer Systems Lab, Technical Report CSL-TR-97-715, February 1997.
-
(1997)
Software and Hardware for Exploiting Speculative Parallelism with a Multiprocessor
-
-
Oplinger, J.1
Heine, D.2
Liao, S.-W.3
Nayfeh, B.A.4
Lam, M.S.5
Olukotun, K.6
-
124
-
-
0040822529
-
-
Ph.D. Thesis, Department of Electrical and Computer Engineering, University of Ilinois, Urbana, IL, June
-
D.J. Palermo, Compiler techniques for optimizing communication and data distribution for distributed-memory multicomputers, Ph.D. Thesis, Department of Electrical and Computer Engineering, University of Ilinois, Urbana, IL, June 1996.
-
(1996)
Compiler Techniques for Optimizing Communication and Data Distribution for Distributed-memory Multicomputers
-
-
Palermo, D.J.1
-
126
-
-
85031535440
-
A computation + communication load balanced loop partitioning method for distributed memory systems
-
to appear
-
S. Pande, T. Bali, A computation + communication load balanced loop partitioning method for distributed memory systems, Journal of Parallel and Distributed Computing, to appear.
-
Journal of Parallel and Distributed Computing
-
-
Pande, S.1
Bali, T.2
-
128
-
-
0002524997
-
A compile time partitioning method for DOALL loops on distributed memory systems
-
IEEE Computer Society Press, Silver Spring, MD
-
S. Pande, A compile time partitioning method for DOALL loops on distributed memory systems, in: International Conference on Parallel Processing, vol. III, IEEE Computer Society Press, Silver Spring, MD, 1996, pp. 35-44.
-
(1996)
International Conference on Parallel Processing
, vol.3
, pp. 35-44
-
-
Pande, S.1
-
130
-
-
84984058313
-
Dependence flow graphs: An algebraic approach to program dependences
-
K. Pingali, M. Beck, R. Johnson, M. Moudgill, P. Stodghill, Dependence flow graphs: An algebraic approach to program dependences, in: Proceedings of the ACM Symposium on Principles of Programming Languages, 1991.
-
(1991)
Proceedings of the ACM Symposium on Principles of Programming Languages
-
-
Pingali, K.1
Beck, M.2
Johnson, R.3
Moudgill, M.4
Stodghill, P.5
-
131
-
-
0030076621
-
The Banerjee-Wolfe and GCD tests on exact data dependence information
-
K. Psarris, The Banerjee-Wolfe and GCD tests on exact data dependence information, Journal of Parallel and Distributed Computing 32 (2) (1996).
-
(1996)
Journal of Parallel and Distributed Computing
, vol.32
, Issue.2
-
-
Psarris, K.1
-
132
-
-
33745192514
-
On the accuracy of the Banerjee test, Journal of Parallel and Distributed Computing
-
K. Psarris, D. Klappholz, X. Kong, On the accuracy of the Banerjee test, Journal of Parallel and Distributed Computing, Special Issue on Shared Memory Multiprocessors 12 (2) (1991).
-
(1991)
Shared Memory Multiprocessors
, vol.12
, Issue.2 SPEC. ISSUE
-
-
Psarris, K.1
Klappholz, D.2
Kong, X.3
-
133
-
-
79851482676
-
An empirical study of the I test for exact data dependence
-
St. Charles, IL, August
-
K. Psarris, S. Pande, An empirical study of the I test for exact data dependence, in: Proceedings of the 1994 International Conference on Parallel Processing, St. Charles, IL, August 1994.
-
(1994)
Proceedings of the 1994 International Conference on Parallel Processing
-
-
Psarris, K.1
Pande, S.2
-
135
-
-
84976676720
-
A practical algorithm for exact array dependence analysis
-
W. Pugh, A practical algorithm for exact array dependence analysis, Communications of the ACM 35 (8) (1992).
-
(1992)
Communications of the ACM
, vol.35
, Issue.8
-
-
Pugh, W.1
-
137
-
-
84966585509
-
Optimal task scheduling to minimize inter-tile latencies
-
F. Rastello, A. Rao, S. Pande, Optimal task scheduling to minimize inter-tile latencies, in: International Conference on Parallel Processing, 1998, pp. 172-179.
-
(1998)
International Conference on Parallel Processing
, pp. 172-179
-
-
Rastello, F.1
Rao, A.2
Pande, S.3
-
138
-
-
0003015894
-
Some scheduling techniques and an easily schedulable horizontal architecture for high performance scientific computing
-
MA
-
B.R. Rau, C.D. Glaser, Some scheduling techniques and an easily schedulable horizontal architecture for high performance scientific computing, in: Proceedings of the 14th Annual Microprogramming Workshop Chatham, MA, 1981, pp. 183-198.
-
(1981)
Proceedings of the 14th Annual Microprogramming Workshop Chatham
, pp. 183-198
-
-
Rau, B.R.1
Glaser, C.D.2
-
140
-
-
0030247075
-
An implementation framework for HPF distributed arrays on message-passing parallel computer systems
-
C. van Reeuwijk, H. Sips, W. Denissen, E. Paalvast, An implementation framework for HPF distributed arrays on message-passing parallel computer systems, IEEE Transactions on Parallel and Distributed Systems 7 (9) (1996) 897-914.
-
(1996)
IEEE Transactions on Parallel and Distributed Systems
, vol.7
, Issue.9
, pp. 897-914
-
-
Van Reeuwijk, C.1
Sips, H.2
Denissen, W.3
Paalvast, E.4
-
141
-
-
85031529201
-
-
Document Number VA061, Santa Monica, CA
-
Pacific-Sierra Research Corporation, VAST-2 for XL Fortran, User's Guide, Edition 1.2, Document Number VA061, Santa Monica, CA, 1994.
-
(1994)
VAST-2 for XL Fortran, User's Guide, Edition 1.2
-
-
-
143
-
-
17144391554
-
Software pipelining showdown: Optimal vs. heuristic methods in a production compiler
-
Philadelphia, Pennsylvania
-
J. Ruttenberg, G.R. Gao, A. Stoutchinin, W. Lichtenstein, Software pipelining showdown: optimal vs. heuristic methods in a production compiler, in: Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, Philadelphia, Pennsylvania, 1996, pp. 1-11.
-
(1996)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 1-11
-
-
Ruttenberg, J.1
Gao, G.R.2
Stoutchinin, A.3
Lichtenstein, W.4
-
144
-
-
0343668653
-
Optimizing CM Fortran compiler for connection machine computers
-
G. Sabot, Optimizing CM Fortran compiler for connection machine computers, Journal of Parallel and Distributed Computing 23 (1) (1994) 224-238.
-
(1994)
Journal of Parallel and Distributed Computing
, vol.23
, Issue.1
, pp. 224-238
-
-
Sabot, G.1
-
145
-
-
0025416470
-
Run-time scheduling and execution of loops on message passing machines
-
J. Saltz, K. Crowley, R. Mirchandaney, H. Berryman, Run-time scheduling and execution of loops on message passing machines, Journal of Parallel and Distributed Computing 8 (4) (1990).
-
(1990)
Journal of Parallel and Distributed Computing
, vol.8
, Issue.4
-
-
Saltz, J.1
Crowley, K.2
Mirchandaney, R.3
Berryman, H.4
-
146
-
-
0003493010
-
-
Pitman, London and The MIT Press, Cambridge, MA
-
V. Sarkar, Partitioning and scheduling parallel programs for multiprocessors, Pitman, London and The MIT Press, Cambridge, MA, 1989.
-
(1989)
Partitioning and Scheduling Parallel Programs for Multiprocessors
-
-
Sarkar, V.1
-
147
-
-
0026213832
-
Automatic partitioning of a program dependence graph into parallel tasks
-
V. Sarkar, Automatic partitioning of a program dependence graph into parallel tasks, IBM Journal of Research and Development, 35 (5/6) (1991).
-
(1991)
IBM Journal of Research and Development
, vol.35
, Issue.5-6
-
-
Sarkar, V.1
-
148
-
-
0026991030
-
A general framework for iteration-reordering loop transformations
-
San Francisco, California
-
V. Sarkar, R. Thekkath, A general framework for iteration-reordering loop transformations, in: Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, San Francisco, California, 1992, pp. 175-187.
-
(1992)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 175-187
-
-
Sarkar, V.1
Thekkath, R.2
-
149
-
-
0031140581
-
Automatic selection of high order transformations in the IBM XL Fortran Compilers
-
V. Sarkar, Automatic selection of high order transformations in the IBM XL Fortran Compilers, IBM Journal of Research and Development 41 (3) (1997).
-
(1997)
IBM Journal of Research and Development
, vol.41
, Issue.3
-
-
Sarkar, V.1
-
153
-
-
84955559042
-
Efficient Distribution Analysis via Graph Contraction
-
Proceedings of the Eighth Workshop on Languages and Compilers for Parallel Computing, Columbus, OH, August Springer, Berlin
-
T.J. Sheffler, R. Schreiber, J.R. Gilbert, W. Pugh, Efficient Distribution Analysis via Graph Contraction, in: Proceedings of the Eighth Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science 1033, Columbus, OH, August 1995. Springer, Berlin, 1996, pp. 377-391.
-
(1995)
Lecture Notes in Computer Science
, vol.1033
, pp. 377-391
-
-
Sheffler, T.J.1
Schreiber, R.2
Gilbert, J.R.3
Pugh, W.4
-
155
-
-
0029228631
-
The communication software and parallel environment of the IBM SP2
-
M. Snir et al., The communication software and parallel environment of the IBM SP2, IBM Systems Journal 34 (2) (1995) 205-221.
-
(1995)
IBM Systems Journal
, vol.34
, Issue.2
, pp. 205-221
-
-
Snir, M.1
-
156
-
-
0343232951
-
-
SPAM Research Group, SPAM compiler user's manual, 1997, http://www.ee.princeton.edu/spam.
-
(1997)
SPAM Compiler User's Manual
-
-
-
157
-
-
0342798352
-
-
Ph.D. Thesis, Department of Computer Science, University of Colorado, Denver, Colorado
-
H. Srinivasan, Optimizing explicitly parallel programs, Ph.D. Thesis, Department of Computer Science, University of Colorado, Denver, Colorado, 1994.
-
(1994)
Optimizing Explicitly Parallel Programs
-
-
Srinivasan, H.1
-
158
-
-
0000412263
-
Generating communication for array statements: Design, implementation and evaluation
-
J. Stichnoth, D. O'Hallaron, T. Gross, Generating communication for array statements: Design, implementation and evaluation, Journal of Parallel and Distributed Computing 21 (1) (1994) 150-159.
-
(1994)
Journal of Parallel and Distributed Computing
, vol.21
, Issue.1
, pp. 150-159
-
-
Stichnoth, J.1
O'Hallaron, D.2
Gross, T.3
-
162
-
-
0000606960
-
Fast address sequence generation for data-parallel programs using integer lattices
-
Proceedings of the languages and compilers for parallel computing, Springer, Berlin
-
A. Thirumalai, J. Ramanujam, Fast address sequence generation for data-parallel programs using integer lattices, in: Proceedings of the languages and compilers for parallel computing, Lecture Notes in Computer Science 1033, Springer, Berlin, 1996, pp. 191-208.
-
(1996)
Lecture Notes in Computer Science
, vol.1033
, pp. 191-208
-
-
Thirumalai, A.1
Ramanujam, J.2
-
163
-
-
0030295507
-
Efficient computation of address sequences in data-parallel programs using closed forms for basis vectors
-
A. Thirumalai, J. Ramanujam, Efficient computation of address sequences in data-parallel programs using closed forms for basis vectors, Journal of Parallel and Distributed Computing 38 (2) (1996) 188-203.
-
(1996)
Journal of Parallel and Distributed Computing
, vol.38
, Issue.2
, pp. 188-203
-
-
Thirumalai, A.1
Ramanujam, J.2
-
164
-
-
84886627169
-
Dataflow analysis driven dynamic data partitioning
-
Fourth Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers, Springer, Pittsburgh, PA, May
-
J. Tims, R. Gupta, M.L. Soffa, Dataflow analysis driven dynamic data partitioning, in: Fourth Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers, Lecture Notes in Computer Science, vol. 1511, Springer, Pittsburgh, PA, May 1998, pp. 75-90.
-
(1998)
Lecture Notes in Computer Science
, vol.1511
, pp. 75-90
-
-
Tims, J.1
Gupta, R.2
Soffa, M.L.3
-
165
-
-
0026986882
-
Global analysis for partitioning non-strict programs into sequential threads
-
San Francisco, CA
-
K.R. Traub, D.E. Culler, K.E. Schauser, Global analysis for partitioning non-strict programs into sequential threads, in: ACM Conference on Lisp and Functional Programming, San Francisco, CA, 1992.
-
(1992)
ACM Conference on Lisp and Functional Programming
-
-
Traub, K.R.1
Culler, D.E.2
Schauser, K.E.3
-
166
-
-
84976844340
-
Direct parallelization of call statements
-
R. Triolet, F. Irigoin, P. Feautrier, Direct parallelization of call statements, in: Proceedings of the Sigplan Symposium on Compiler Construction, 1986, pp. 176-185.
-
(1986)
Proceedings of the Sigplan Symposium on Compiler Construction
, pp. 176-185
-
-
Triolet, R.1
Irigoin, F.2
Feautrier, P.3
-
167
-
-
33751027415
-
Array privatization for shared and distributed memory machines
-
in ACM SIGPLAN Notices
-
P. Tu, D. Padua, Array privatization for shared and distributed memory machines, in: Proceedings Second Workshop on Languages, Compilers, and Run-Time Environments for Distributed Memory Machines, in ACM SIGPLAN Notices, 1993.
-
(1993)
Proceedings Second Workshop on Languages, Compilers, and Run-Time Environments for Distributed Memory Machines
-
-
Tu, P.1
Padua, D.2
-
168
-
-
0010224751
-
Runtime performance of parallel array assignment: An empirical study
-
Pittsburgh, PA
-
L. Wang, J. Stichnoth, S. Chatterjee, Runtime performance of parallel array assignment: an empirical study, in: Proceedings Supercomputing 96, Pittsburgh, PA, 1996.
-
(1996)
Proceedings Supercomputing 96
-
-
Wang, L.1
Stichnoth, J.2
Chatterjee, S.3
-
170
-
-
84976692695
-
SUIF: A parallelizing and optimizing research compiler
-
R. Wilson et al., SUIF: a parallelizing and optimizing research compiler, SIGPLAN Notices 29 (12) (1994) 31-37.
-
(1994)
SIGPLAN Notices
, vol.29
, Issue.12
, pp. 31-37
-
-
Wilson, R.1
-
172
-
-
0004062640
-
-
Pitman, London and The MIT Press, Cambridge, MA
-
M.J. Wolfe, Optimizing Supercompilers for Supercomputers, Pitman, London and The MIT Press, Cambridge, MA, 1989.
-
(1989)
Optimizing Supercompilers for Supercomputers
-
-
Wolfe, M.J.1
|