SCOPUS 정보 검색 플랫폼

Parallel Computing

Volumn 25, Issue 13, 1999, Pages 1741-1783

Compilation techniques for parallel systems

(4) Gupta, Rajiv a Pande, Santosh b Psarris, Kleanthis c Sarkar, Vivek d

a Gould Simpson Building (United States)

b University of Cincinnati (United States)

c Science Building (United States)

d IBM T J WATSON RESEARCH CENTER (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER ARCHITECTURE; COMPUTER SYSTEMS PROGRAMMING; DATA STORAGE EQUIPMENT; OPTIMIZATION; PROGRAM COMPILERS; RESPONSE TIME (COMPUTER SYSTEMS);

INSTRUCTION LEVEL PARALLELISM (ILP); SHARED-MEMORY MULTIPROCESSORS (SMP);

PARALLEL PROCESSING SYSTEMS;

EID: 0033283421 PISSN: 01678191 EISSN: None Source Type: Journal
DOI: 10.1016/S0167-8191(99)00086-1 Document Type: Article

Times cited : (28)

References (174)

1
- 0031623811
- Using integer sets for data-parallel analysis and optimization
- Montreal, Canada
- V. Adve, J. Mellor-Crummey, Using integer sets for data-parallel analysis and optimization, in: ACM SIGPLAN Conference on Programming Language Design and Implementation, Montreal, Canada, 1998, pp. 186-198.
- (1998) ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 186-198
- Adve, V.¹ Mellor-Crummey, J.²

2
- 0029373981
- Automatic partitioning of parallel loops and data arrays for distributed shared-memory multiprocessors
- A. Agarwal, D. Kranz, V. Natrajan, Automatic partitioning of parallel loops and data arrays for distributed shared-memory multiprocessors, IEEE Transactions on Parallel and Distributed Systems 6 (9) (1995) 943-962.
- (1995) IEEE Transactions on Parallel and Distributed Systems , vol.6 , Issue.9 , pp. 943-962
- Agarwal, A.¹ Kranz, D.² Natrajan, V.³

3
- 0032114494
- Interprocedural partial redundancy elimination with application to distributed memory compilation
- G. Agrawal, Interprocedural partial redundancy elimination with application to distributed memory compilation, IEEE Transactions on Parallel and Distributed Systems 9 (7) (1998) 609-625.
- (1998) IEEE Transactions on Parallel and Distributed Systems , vol.9 , Issue.7 , pp. 609-625
- Agrawal, G.¹

4
- 0004072686
- Addison-Wesley, Reading, MA
- A. Aho, R. Sethi, J. Ullman, Compilers: Principles, Techniques and Tools, Addison-Wesley, Reading, MA, 1988.
- (1988) Compilers: Principles, Techniques and Tools
- Aho, A.¹ Sethi, R.² Ullman, J.³

5
- 0007941219
- A development environment for horizontal microcode
- A. Aiken, A. Nicolau, A development environment for horizontal microcode, IEEE Transactions on Software Engineering 14 (5) (1988) 584-594.
- (1988) IEEE Transactions on Software Engineering , vol.14 , Issue.5 , pp. 584-594
- Aiken, A.¹ Nicolau, A.²

6
- 0023438847
- Automatic translation of FORTRAN programs to vector form
- R. Allen, K. Kennedy, Automatic translation of FORTRAN programs to vector form, ACM Transactions on Programming Languages and Systems 9 (4) (1987) 491-592.
- (1987) ACM Transactions on Programming Languages and Systems , vol.9 , Issue.4 , pp. 491-592
- Allen, R.¹ Kennedy, K.²

7
- 0342363542
- Vector register allocation
- Rice University, Houston, TX, December
- R. Allen, K. Kennedy, Vector register allocation, Technical Report TR86-45, Rice University, Houston, TX, December 1986.
- (1986) Technical Report TR86-45
- Allen, R.¹ Kennedy, K.²

8
- 0012777446
- Ph.D. Thesis, McGill University, Montreal, Quebec
- E.R. Altman, Optimal software pipelining with functional unit and register constraints, Ph.D. Thesis, McGill University, Montreal, Quebec, 1995.
- (1995) Optimal Software Pipelining with Functional Unit and Register Constraints
- Altman, E.R.¹

9
- 0042193410
- Computer Systems Laboratory, Stanford University, January
- S.P. Amarasinghe, Parallelizing Compiler Techniques Based on Linear Inequalities, Computer Systems Laboratory, Stanford University, January 1997.
- (1997) Parallelizing Compiler Techniques Based on Linear Inequalities
- Amarasinghe, S.P.¹

10
- 0027802136
- Communication optimization and code generation for distributed memory machines
- Albuquerque, New Mexico, June
- S.P. Amarasinghe, M.S. Lam, Communication optimization and code generation for distributed memory machines, in: Proceedings ACM SIGPLAN'93 Conference on Programming Language Design and Implementation, Albuquerque, New Mexico, June 1993.
- (1993) Proceedings ACM SIGPLAN'93 Conference on Programming Language Design and Implementation
- Amarasinghe, S.P.¹ Lam, M.S.²

11
- 84976766536
- Scanning polyhedra with do loops
- Williamsburg, VA, April
- C. Ancourt, F. Irigoin, Scanning polyhedra with do loops, in: Proceedings of the Third ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Williamsburg, VA, April 1991.
- (1991) Proceedings of the Third ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
- Ancourt, C.¹ Irigoin, F.²

12
- 0031104380
- A linear algebra framework for static HPF code distribution
- A. Ancourt, F. Coelho, F. Irigoin, R. Keryell, A linear algebra framework for static HPF code distribution, Scientific Programming 6 (1) (1997) 3-28.
- (1997) Scientific Programming , vol.6 , Issue.1 , pp. 3-28
- Ancourt, A.¹ Coelho, F.² Irigoin, F.³ Keryell, R.⁴

13
- 0027870804
- Global optimizations for parallelism and locality on scalable parallel machines
- June
- J.M. Anderson, M.S. Lam, Global optimizations for parallelism and locality on scalable parallel machines, in: Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), June 1993, pp. 112-125.
- (1993) Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI) , pp. 112-125
- Anderson, J.M.¹ Lam, M.S.²

14
- 0029710317
- Using register-transfer paths in code generation for heterogeneous memory-register architectures
- G. Araujo, S. Malik, M. Lee, Using register-transfer paths in code generation for heterogeneous memory-register architectures, in: Proceedings of the 33rd ACM/IEEE Design Automation Conference, 1996, pp. 591-596.
- (1996) Proceedings of the 33rd ACM/IEEE Design Automation Conference , pp. 591-596
- Araujo, G.¹ Malik, S.² Lee, M.³

15
- 85008031722
- Instruction set design and optimization for address computation in DSP architectures
- G. Araujo, A. Sudarsanam, S. Malik, Instruction set design and optimization for address computation in DSP architectures, in: Proceedings of the Ninth International Symposium on System Synthesis, 1997, pp. 31-37.
- (1997) Proceedings of the Ninth International Symposium on System Synthesis , pp. 31-37
- Araujo, G.¹ Sudarsanam, A.² Malik, S.³

16
- 0003487052
- Addison-Wesley, Reading, MA
- K. Arnold, J. Gosling, The Java Programming Language, Addison-Wesley, Reading, MA, 1996.
- (1996) The Java Programming Language
- Arnold, K.¹ Gosling, J.²

17
- 0031359056
- A framework for balancing control flow and predication
- Research Triangle Park, North Carolina, December
- D.I. August, W.W. Hwu, S.A. Mahlke, A framework for balancing control flow and predication, in: Proceedings of the 30th Annual International Symposium on Microarchitecture, Research Triangle Park, North Carolina, December 1997, pp. 92-103.
- (1997) Proceedings of the 30th Annual International Symposium on Microarchitecture , pp. 92-103
- August, D.I.¹ Hwu, W.W.² Mahlke, S.A.³

18
- 0004270780
- Kluwer Academic Publishers, Norwell, MA
- U. Banerjee, Dependence Analysis for Supercomputing, Kluwer Academic Publishers, Norwell, MA, 1988.
- (1988) Dependence Analysis for Supercomputing
- Banerjee, U.¹

19
- 0003612724
- Kluwer Academic Publishers, Boston, MA
- U. Banerjee, Loop Transformations for Restructuring Compilers: The Foundations, Kluwer Academic Publishers, Boston, MA, 1993.
- (1993) Loop Transformations for Restructuring Compilers: The Foundations
- Banerjee, U.¹

20
- 0029394470
- The PARADIGM compiler for distributed-memory multicomputers
- P. Banerjee, J.A. Chandy, M. Gupta, E.W. Hodges IV, J.G. Holm, A. Lain, D.J. Palermo, S. Ramaswamy, E. Su, The PARADIGM compiler for distributed-memory multicomputers, IEEE Computer 28 (10) (1995) 37-47.
- (1995) IEEE Computer , vol.28 , Issue.10 , pp. 37-47
- Banerjee, P.¹ Chandy, J.A.² Gupta, M.³ Hodges E.W. IV⁴ Holm, J.G.⁵ Lain, A.⁶ Palermo, D.J.⁷ Ramaswamy, S.⁸ Su, E.⁹

21
- 0026817662
- Optimizing stack frame accesses for processors with restricted addressing modes
- D. Bartley, Optimizing stack frame accesses for processors with restricted addressing modes, Software Practice and Experience 22 (2) (1992) 101-110.
- (1992) Software Practice and Experience , vol.22 , Issue.2 , pp. 101-110
- Bartley, D.¹

22
- 84947776744
- Solving alignment using elementary linear algebra
- Proceedings of the Seventh Workshop on Languages and Compilers for Parallel Computing, Ithica, NY, Springer, Berlin
- D. Bau, I. Koduklula, V. Kotlyar, K. Pingali, P. Stodghill, Solving alignment using elementary linear algebra, in: Proceedings of the Seventh Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science, vol. 892, Ithica, NY, 1994, Springer, Berlin, 1995, pp. 46-60.
- (1994) Lecture Notes in Computer Science , vol.892 , pp. 46-60
- Bau, D.¹ Koduklula, I.² Kotlyar, V.³ Pingali, K.⁴ Stodghill, P.⁵

23
- 0027001568
- Vienna Fortran 90
- Williamsburg, VA, April
- S. Benkner, B. Chapman, H. Zima, Vienna Fortran 90, in: Proceedings of the 1992 Scalable High Performance Computing Conference, Williamsburg, VA, April 1992.
- (1992) Proceedings of the 1992 Scalable High Performance Computing Conference
- Benkner, S.¹ Chapman, B.² Zima, H.³

24
- 0029718941
- Tulip: A portable run-time system for object-parallel systems
- April
- P. Beckman, D. Gannon, Tulip: a portable run-time system for object-parallel systems, in: Proceedings of the 10th International Parallel Processing Symposium, April 1996.
- (1996) Proceedings of the 10th International Parallel Processing Symposium
- Beckman, P.¹ Gannon, D.²

25
- 0028594328
- Resource spackling: A framework for integrating register allocation in local and global schedulers
- D.A. Berson, R. Gupta, M.L. Soffa, Resource spackling: a framework for integrating register allocation in local and global schedulers, in: Proceedings of IFIP WG 10.3 Working Conference on Parallel Architectures and Compilation Techniques, 1994, pp. 135-146.
- (1994) Proceedings of IFIP WG 10.3 Working Conference on Parallel Architectures and Compilation Techniques , pp. 135-146
- Berson, D.A.¹ Gupta, R.² Soffa, M.L.³

26
- 0028583166
- Automatic data layout using 0-1 integer programming
- Montréal, Canada, August
- R. Bixby, K. Kennedy, U. Kremer, Automatic data layout using 0-1 integer programming, in: Proceedings of the 1994 International Conference on Parallel Architectures and Compilation Techniques, Montréal, Canada, August 1994, pp. 111-122.
- (1994) Proceedings of the 1994 International Conference on Parallel Architectures and Compilation Techniques , pp. 111-122
- Bixby, R.¹ Kennedy, K.² Kremer, U.³

27
- 0031679132
- Escape analysis: Correctness, proof, implementation and experimental results
- San Diego, CA, January
- B. Blanchet, Escape analysis: correctness, proof, implementation and experimental results, in: Proceedings of the 25th Annual ACM Symposium on Principles of Programming Languages, San Diego, CA, January 1998, pp. 25-37.
- (1998) Proceedings of the 25th Annual ACM Symposium on Principles of Programming Languages , pp. 25-37
- Blanchet, B.¹

28
- 0032313172
- Nonlinear and symbolic data dependence testing
- W. Blume, R. Eigenmann, Nonlinear and symbolic data dependence testing, IEEE Transactions on Parallel and Distributed Systems 9 (12) (1998).
- (1998) IEEE Transactions on Parallel and Distributed Systems , vol.9 , Issue.12
- Blume, W.¹ Eigenmann, R.²

29
- 0348137596
- Complete removal of redundant expressions
- Montreal, Canada, June
- R. Bodik, R. Gupta, M.L. Soffa, Complete removal of redundant expressions, in: Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, Montreal, Canada, June 1998, pp. 1-14.
- (1998) Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 1-14
- Bodik, R.¹ Gupta, R.² Soffa, M.L.³

30
- 0030645017
- Partial dead code elimination using slicing transformations
- Las Vegas, Nevada, June
- R. Bodik, R. Gupta, Partial dead code elimination using slicing transformations, in: Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, Las Vegas, Nevada, June 1997, pp. 159-170.
- (1997) Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 159-170
- Bodik, R.¹ Gupta, R.²

31
- 84976782196
- Interprocedural dependence analysis and parallelization
- July
- M. Burke, R. Cytron, Interprocedural dependence analysis and parallelization, in: Proceedings of the SIGPLAN Symposium on Compiler Construction, July 1986, pp. 162-175.
- (1986) Proceedings of the SIGPLAN Symposium on Compiler Construction , pp. 162-175
- Burke, M.¹ Cytron, R.²

32
- 0013045469
- Ph.D. Thesis, Massachussetts Institute of Technology
- A. Caro, Generating multithreaded code from parallel Haskell for symmetric multiprocessors, Ph.D. Thesis, Massachussetts Institute of Technology, 1999.
- (1999) Generating Multithreaded Code from Parallel Haskell for Symmetric Multiprocessors
- Caro, A.¹

33
- 17144418556
- Global communication analysis and optimization
- Philadelphia, PA, May
- S. Chakrabarti, M. Gupta, J.-D. Choi, Global communication analysis and optimization, in: Proceedings ACM SIGPLAN Conference on Programming Language Design and Implementation, Philadelphia, PA, May 1996.
- (1996) Proceedings ACM SIGPLAN Conference on Programming Language Design and Implementation
- Chakrabarti, S.¹ Gupta, M.² Choi, J.-D.³

34
- 0011611816
- CC++: A declarative concurrent object-oriented programming notation
- MIT Press, Cambridge, MA
- K.M. Chandy, C. Kesselman, CC++: a declarative concurrent object-oriented programming notation, in: Research Directions in Concurrent Object Oriented Programming, MIT Press, Cambridge, MA, 1993.
- (1993) Research Directions in Concurrent Object Oriented Programming
- Chandy, K.M.¹ Kesselman, C.²

35
- 0343668689
- Automatic support for data distribution on distributed memory multiprocessor systems
- Proceedings of the Sixth Workshop on Languages and Compilers for Parallel Computing, Portland, OR, Aug. Springer, Berlin
- B. Chapman, T. Fahringer, H. Zima, Automatic support for data distribution on distributed memory multiprocessor systems, in: Proceedings of the Sixth Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science, vol. 768, Portland, OR, Aug. 1993, Springer, Berlin, 1994, pp. 184-199.
- (1993) Lecture Notes in Computer Science , vol.768 , pp. 184-199
- Chapman, B.¹ Fahringer, T.² Zima, H.³

36
- 0343668692
- The alignment-distribution graph
- Languages and Compilers for Parallel Computing, Sixth International Workshop, Springer, Berlin
- S. Chatterjee, J. Gilbert, R. Schreiber, The alignment-distribution graph, in: Languages and Compilers for Parallel Computing, Sixth International Workshop, Lecture Notes in Computer Science, vol. 768, Springer, Berlin, 1993.
- (1993) Lecture Notes in Computer Science , vol.768
- Chatterjee, S.¹ Gilbert, J.² Schreiber, R.³

37
- 0002742410
- Generating local addresses and communication sets for data parallel programs
- S. Chatterjee, J. Gilbert, F. Long, R. Schreiber, S. Teng, Generating local addresses and communication sets for data parallel programs, Journal of Parallel and Distributed Computing 26 (1) (1995) 72-84.
- (1995) Journal of Parallel and Distributed Computing , vol.26 , Issue.1 , pp. 72-84
- Chatterjee, S.¹ Gilbert, J.² Long, F.³ Schreiber, R.⁴ Teng, S.⁵

38
- 0028499023
- Communication-free data allocation techniques for parallelizing compilers on multicomputers
- T.S. Chen, J.P. Sheu, Communication-free data allocation techniques for parallelizing compilers on multicomputers, IEEE Transactions on Parallel and Distributed Systems 5 (9) (1994) 924-938.
- (1994) IEEE Transactions on Parallel and Distributed Systems , vol.5 , Issue.9 , pp. 924-938
- Chen, T.S.¹ Sheu, J.P.²

39
- 0031594025
- Memory dependence prediction using store sets
- Barcelona, Spain, July
- G. Chrysos, J. Emer, Memory dependence prediction using store sets, in: Proceedings of the ACM/ IEEE 25th International Symposium on Computer Architecture, Barcelona, Spain, July 1998, pp. 142-154.
- (1998) Proceedings of the ACM/ IEEE 25th International Symposium on Computer Architecture , pp. 142-154
- Chrysos, G.¹ Emer, J.²

40
- 0343668688
- Evolutionary compilation to long instruction superscalar microarchitectures for exploiting parallelism at all levels
- T.M. Conte, Evolutionary compilation to long instruction superscalar microarchitectures for exploiting parallelism at all levels, in: ASPLOS Wild and Crazy Idea Session, 1998.
- (1998) ASPLOS Wild and Crazy Idea Session
- Conte, T.M.¹

41
- 84976666650
- Efficient computation of flow insensitive interprocedural summary information
- June
- K. Cooper, K. Kennedy, Efficient computation of flow insensitive interprocedural summary information, in: Proceedings of the ACM SIGPLAN'84 Symposium on Compiler Construction, June 1984.
- (1984) Proceedings of the ACM SIGPLAN'84 Symposium on Compiler Construction
- Cooper, K.¹ Kennedy, K.²

42
- 0022793229
- The impact of interprocedural analysis and optimization in the Rn programming environment
- K.D. Cooper, K. Kennedy, L. Torczon, The impact of interprocedural analysis and optimization in the Rn programming environment, ACM Transactions on Programming Languages and Systems 8 (4) (1986) 491-523.
- (1986) ACM Transactions on Programming Languages and Systems , vol.8 , Issue.4 , pp. 491-523
- Cooper, K.D.¹ Kennedy, K.² Torczon, L.³

43
- 84958956033
- Non-local instruction scheduling with limited code growth
- K. Cooper, P. Schielke, Non-local instruction scheduling with limited code growth, in: Proceedings of Languages, Compilers and Tools for Embedded Systems, 1998, pp. 193-207.
- (1998) Proceedings of Languages, Compilers and Tools for Embedded Systems , pp. 193-207
- Cooper, K.¹ Schielke, P.²

44
- 0027710762
- Parallel programming in Split-C
- D. Culler, A. Dusseau, S. Goldstein, A. Krishnamurthy, S. Lumetta, T. von Eicken, K. Yelick, Parallel programming in Split-C, in: Proceedings of Supercomputing'93, 1993.
- (1993) Proceedings of Supercomputing'93
- Culler, D.¹ Dusseau, A.² Goldstein, S.³ Krishnamurthy, A.⁴ Lumetta, S.⁵ Von Eicken, T.⁶ Yelick, K.⁷

45
- 49549162745
- Fourier-Motzkin Elimination and its Dual
- G. Dantzig, B. Eaves, Fourier-Motzkin Elimination and its Dual, Journal of Combinatorial Theory (A) 14 (1973).
- (1973) Journal of Combinatorial Theory , vol.14 , Issue.A
- Dantzig, G.¹ Eaves, B.²

46
- 0022920260
- The KAP/S-1: An advanced source-to-source vectorizer for the S-1 Mark IIa Supercomputer
- St. Charles, Illinois, August
- J. Davies, C. Huson, T. Macke, B. Leasure, M. Wolfe, The KAP/S-1: an advanced source-to-source vectorizer for the S-1 Mark IIa Supercomputer, in: Proceedings of the 1986 International Conference on Parallel Processing, St. Charles, Illinois, August 1986, pp. 833-835.
- (1986) Proceedings of the 1986 International Conference on Parallel Processing , pp. 833-835
- Davies, J.¹ Huson, C.² Macke, T.³ Leasure, B.⁴ Wolfe, M.⁵

47
- 0342798381
- How to optimize residual communications?
- M. Dion, C. Randriamaro, Y. Robert, How to optimize residual communications? (special issue), Journal of Parallel and Distributed Computing on Compilation Techniques for Distributed Memory Systems 38 (1996).
- (1996) Journal of Parallel and Distributed Computing on Compilation Techniques for Distributed Memory Systems , vol.38 , Issue.SPEC. ISSUE
- Dion, M.¹ Randriamaro, C.² Robert, Y.³

48
- 0032123777
- The IA-64 architecture at work
- C. Dulong, The IA-64 architecture at work, IEEE Computer (1998) 24-32.
- (1998) IEEE Computer , pp. 24-32
- Dulong, C.¹

49
- 0030645966
- DAISY: Dynamic compilation for 100% architectural compatibility
- Denver, Colorado
- K. Ebcioglu, E. Altman, DAISY: dynamic compilation for 100% architectural compatibility, in: Proceedings of the International Symposium on Computer Architecture, Denver, Colorado, 1997, pp. 26-37.
- (1997) Proceedings of the International Symposium on Computer Architecture , pp. 26-37
- Ebcioglu, K.¹ Altman, E.²

50
- 38149020846
- A report on sisal language project
- J.T. Feo, D.C. Cann, R.R. Oldehoeft, A report on sisal language project, Journal of Parallel and Distributed Computing 10 (4) (1990) 349-366.
- (1990) Journal of Parallel and Distributed Computing , vol.10 , Issue.4 , pp. 349-366
- Feo, J.T.¹ Cann, D.C.² Oldehoeft, R.R.³

51
- 0023385308
- The program dependence graph and its use in optimization
- J. Ferrante, K. Ottenstein, J. Warren, The program dependence graph and its use in optimization, ACM Transactions on Programming Languages and Systems 9 (3) (1987) 319-349.
- (1987) ACM Transactions on Programming Languages and Systems , vol.9 , Issue.3 , pp. 319-349
- Ferrante, J.¹ Ottenstein, K.² Warren, J.³

52
- 0019596071
- Trace scheduling: A technique for global microcode compaction
- J.A. Fisher, Trace scheduling: a technique for global microcode compaction, IEEE Transactions on Computers 30 (7) (1981) 478-490.
- (1981) IEEE Transactions on Computers , vol.30 , Issue.7 , pp. 478-490
- Fisher, J.A.¹

53
- 0031237815
- Walk-time techniques: Catalyst for architectural change
- J.A. Fisher, Walk-time techniques: catalyst for architectural change, IEEE Computer 30 (9) (1997) 40-42.
- (1997) IEEE Computer , vol.30 , Issue.9 , pp. 40-42
- Fisher, J.A.¹

54
- 0028461905
- Avoidance and suppression of compensation code in a trace scheduling compiler
- S. Freudenberger, T. Gross, P.G. Lowney, Avoidance and suppression of compensation code in a trace scheduling compiler, ACM Transactions on Programming Languages and Systems 16 (4) (1994) 1156-1214.
- (1994) ACM Transactions on Programming Languages and Systems , vol.16 , Issue.4 , pp. 1156-1214
- Freudenberger, S.¹ Gross, T.² Lowney, P.G.³

55
- 0032312214
- Putting the fill unit to work: Dynamic optimizations for trace cache microprocessors
- D.H. Friendly, S.J. Patel, Y.N. Patt, Putting the fill unit to work: dynamic optimizations for trace cache microprocessors, in: Proceedings of the 31st Annual ACM/IEEE Symposium on Microarchitecture, 1998, pp. 173-181.
- (1998) Proceedings of the 31st Annual ACM/IEEE Symposium on Microarchitecture , pp. 173-181
- Friendly, D.H.¹ Patel, S.J.² Patt, Y.N.³

56
- 0031611718
- Value speculation scheduling for high performance processors
- C. Fu, M.D. Jennings, S.Y. Larin, T.M. Conte, Value speculation scheduling for high performance processors, in: Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems, 1998, pp. 262-271.
- (1998) Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems , pp. 262-271
- Fu, C.¹ Jennings, M.D.² Larin, S.Y.³ Conte, T.M.⁴

57
- 0343668683
- The use of BLAS3 in linear algebra on a parallel processor with a hierarchical memory
- Center for Supercomputing Res. and Dev., University of Illinois, October
- K. Gallivan, W. Jalby, U. Meier, The use of BLAS3 in linear algebra on a parallel processor with a hierarchical memory, Technical Report CSRD Rpt. No. 610, Center for Supercomputing Res. and Dev., University of Illinois, October 1986.
- (1986) Technical Report CSRD Rpt. No. 610 , vol.610
- Gallivan, K.¹ Jalby, W.² Meier, U.³

58
- 84957095558
- A novel approach towards automatic data distribution
- Houston, TX, April
- J. Garcia, E. Ayguadé, J. Labarta, A novel approach towards automatic data distribution, in: Proceedings of the Workshop on Automatic Data Layout and Performance Prediction, Houston, TX, April 1995.
- (1995) Proceedings of the Workshop on Automatic Data Layout and Performance Prediction
- Garcia, J.¹ Ayguadé, E.² Labarta, J.³

59
- 0342798380
- SUPERB: Experiences and future research
- North-Holland, Amsterdam, The Netherlands
- M. Gerndt, H. Zima, SUPERB: Experiences and future research, in: Proceedings of the Workshop on Languages, Compilers, and Run-Time Environments for Distributed Memory Machines, North-Holland, Amsterdam, The Netherlands, 1992.
- (1992) Proceedings of the Workshop on Languages, Compilers, and Run-Time Environments for Distributed Memory Machines
- Gerndt, M.¹ Zima, H.²

60
- 0026829045
- The HTG: An intermediate representation for programs based on control and data dependences
- M. Girkar, C. Polychronopoulos, The HTG: An intermediate representation for programs based on control and data dependences, IEEE Transactions on Parallel and Distributed Systems 3 (2) (1992).
- (1992) IEEE Transactions on Parallel and Distributed Systems , vol.3 , Issue.2
- Girkar, M.¹ Polychronopoulos, C.²

61
- 84976790479
- Practical dependence testing
- Toronto, Canada
- G. Golf, K. Kennedy, C.W. Tseng, Practical dependence testing, in: Proceedings of the SIGPLAN'91 Conference on Programming Language Design and Implementation, Toronto, Canada, 1991.
- (1991) Proceedings of the SIGPLAN'91 Conference on Programming Language Design and Implementation
- Golf, G.¹ Kennedy, K.² Tseng, C.W.³

62
- 84947738468
- Compilation techniques for optimizing communication in distributed-memory systems
- St. Charles, IL, August
- C. Gong, R. Gupta, R. Melhem, Compilation techniques for optimizing communication in distributed-memory systems, in: Proceedings 1993 International Conference on Parallel Processing, St. Charles, IL, August 1993.
- (1993) Proceedings 1993 International Conference on Parallel Processing
- Gong, C.¹ Gupta, R.² Melhem, R.³

63
- 84968879727
- Code scheduling and register allocation in large basic blocks
- J.R. Goodman, W-C. Hsu, Code scheduling and register allocation in large basic blocks, in: Proceedings of ACM Supercomputing Conference, 1988, pp. 442-452.
- (1988) Proceedings of ACM Supercomputing Conference , pp. 442-452
- Goodman, J.R.¹ Hsu, W.-C.²

64
- 85027593837
- A methodology for high-level synthesis of communication for multicomputers
- Washington, DC
- M. Gupta, P. Banerjee, A methodology for high-level synthesis of communication for multicomputers, in: Proceedings of the ACM International Conference on Supercomputing, Washington, DC, 1992.
- (1992) Proceedings of the ACM International Conference on Supercomputing
- Gupta, M.¹ Banerjee, P.²

65
- 0030075551
- On compiling array expressions for efficient execution on distributed-memory machines
- S.K.S. Gupta, S.D. Kaushik, C.-H. Huang, P. Sadayappan, On compiling array expressions for efficient execution on distributed-memory machines, Journal of Parallel and Distributed Computing 32 (2) (1996) 155-172.
- (1996) Journal of Parallel and Distributed Computing , vol.32 , Issue.2 , pp. 155-172
- Gupta, S.K.S.¹ Kaushik, S.D.² Huang, C.-H.³ Sadayappan, P.⁴

66
- 0025413768
- Region scheduling: An approach for detecting and redistributing parallelism
- R. Gupta, M.L. Soffa, Region scheduling: an approach for detecting and redistributing parallelism, IEEE Transactions on Software Engineering 16 (4) (1990) 421-431.
- (1990) IEEE Transactions on Software Engineering , vol.16 , Issue.4 , pp. 421-431
- Gupta, R.¹ Soffa, M.L.²

67
- 0031364964
- Code optimization as a side effect of instruction scheduling
- Bangalore, India
- R. Gupta, Code optimization as a side effect of instruction scheduling, in: Proceedings of the International Conference on High Performance Computing, Bangalore, India, 1997, pp. 370-377.
- (1997) Proceedings of the International Conference on High Performance Computing , pp. 370-377
- Gupta, R.¹

68
- 84949185314
- Register pressure sensitive redundancy elimination
- Proceedings of the International Conference on Compiler Construction, Springer, Amsterdam, Netherlands
- R. Gupta, R. Bodik, Register pressure sensitive redundancy elimination, in: Proceedings of the International Conference on Compiler Construction, Lecture Notes in Computer Science, vol. 1575, Springer, Amsterdam, Netherlands, pp. 107-121.
- Lecture Notes in Computer Science , vol.1575 , pp. 107-121
- Gupta, R.¹ Bodik, R.²

69
- 0031372826
- Resource-sensitive profile-directed data flow analysis for code optimization
- Research Triangle Park, North Carolina
- R. Gupta, D. Berson, J.Z. Fang, Resource-sensitive profile-directed data flow analysis for code optimization, in: Proceedings of the 30th Annual IEEE/ACM International Symposium on Microarchitecture, Research Triangle Park, North Carolina, 1997, pp. 358-368.
- (1997) Proceedings of the 30th Annual IEEE/ACM International Symposium on Microarchitecture , pp. 358-368
- Gupta, R.¹ Berson, D.² Fang, J.Z.³

70
- 0031627691
- Path profile guided partial redundancy elimination using speculation
- Chicago, Illinois
- R. Gupta, D. Berson, J.Z. Fang, Path profile guided partial redundancy elimination using speculation, in: Proceedings of the IEEE International Conference on Computer Languages, Chicago, Illinois, 1998, pp. 230-239.
- (1998) Proceedings of the IEEE International Conference on Computer Languages , pp. 230-239
- Gupta, R.¹ Berson, D.² Fang, J.Z.³

71
- 0031372538
- Path profile guided partial dead code elimination using predication
- San Francisco, California
- R. Gupta, D. Berson, J.Z. Fang, Path profile guided partial dead code elimination using predication, in: Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, San Francisco, California, 1997, pp. 102-115.
- (1997) Proceedings of the International Conference on Parallel Architectures and Compilation Techniques , pp. 102-115
- Gupta, R.¹ Berson, D.² Fang, J.Z.³

72
- 0030190872
- Symbolic analysis for parallelizing compilers
- M. Haghighat, C. Polychronopoulos, Symbolic analysis for parallelizing compilers, ACM Transactions on Programming Languages and Systems 18 (4) (1996).
- (1996) ACM Transactions on Programming Languages and Systems , vol.18 , Issue.4
- Haghighat, M.¹ Polychronopoulos, C.²

73
- 0029545359
- Region-based compilation: An introduction and motivation
- R.E. Hank, W-M.W. Hwu, B.R. Rau, Region-based compilation: an introduction and motivation, in: Proceedings of the 28th Annual IEEE/ACM International Symposium on Microarchitecture, 1995.
- (1995) Proceedings of the 28th Annual IEEE/ACM International Symposium on Microarchitecture
- Hank, R.E.¹ Hwu, W.-M.W.² Rau, B.R.³

74
- 84976702373
- Give-n-take - A balanced code placement framework
- Orlando, Florida, June
- R.v. Hanxleden, K. Kennedy, Give-n-take - a balanced code placement framework, in: Proceedings of the ACM SIGPLAN '94 Conference on Programming Language Design and Implementation, Orlando, Florida, June 1994.
- (1994) Proceedings of the ACM SIGPLAN '94 Conference on Programming Language Design and Implementation
- Hanxleden, R.V.¹ Kennedy, K.²

75
- 0043058118
- Compiler analysis for irregular problems in Fortran D
- New Haven, CT, August
- R.v. Hanxleden, K. Kennedy, C. Koelbel, R. Das, J. Saltz, Compiler analysis for irregular problems in Fortran D, in: Proceedings Fifth Workshop on Languages and Compilers for Parallel Computing, New Haven, CT, August 1992.
- (1992) Proceedings Fifth Workshop on Languages and Compilers for Parallel Computing
- Hanxleden, R.V.¹ Kennedy, K.² Koelbel, C.³ Das, R.⁴ Saltz, J.⁵

76
- 0003609682
- The MIT Press, Cambridge, MA
- P. Hatcher, M. Quinn, Data-Parallel Programming on MIMD Com;uters, The MIT Press, Cambridge, MA, 1991.
- (1991) Data-Parallel Programming on MIMD Com;uters
- Hatcher, P.¹ Quinn, M.²

77
- 0004302191
- Morgan Kaufmann, Los Altos, CA
- J.L. Hennessy, D.A. Patterson, Computer Architecture: A Quantitative Approach, Morgan Kaufmann, Los Altos, CA, 1990.
- (1990) Computer Architecture: A Quantitative Approach
- Hennessy, J.L.¹ Patterson, D.A.²

78
- 0003565855
- High Performance Fortran Forum, High Performance Fortran language specification, version 2.0
- Center for Research on Parallel Computation, Rice University, Houston, TX, January
- High Performance Fortran Forum, High Performance Fortran language specification, version 2.0. Technical Report CRPC-TR92225, Center for Research on Parallel Computation, Rice University, Houston, TX, January 1997.
- (1997) Technical Report CRPC-TR92225

79
- 84976706957
- Interprocedural compilation of fortran D for MIMD distributed-memory machines
- Minneapolis, MN
- M.W. Hall, S. Hiranandani, K. Kennedy, C. Tseng, Interprocedural compilation of fortran D for MIMD distributed-memory machines, in: Proceedings of Supercomputing'92, Minneapolis, MN, 1992, pp. 522-534.
- (1992) Proceedings of Supercomputing'92 , pp. 522-534
- Hall, M.W.¹ Hiranandani, S.² Kennedy, K.³ Tseng, C.⁴

80
- 84976813879
- Compiling Fortran D for MIMD distributed-memory machines
- S. Hiranandani, K. Kennedy, C.W. Tseng, Compiling Fortran D for MIMD distributed-memory machines, Communications of the ACM 35 (8) (1992) 66-80.
- (1992) Communications of the ACM , vol.35 , Issue.8 , pp. 66-80
- Hiranandani, S.¹ Kennedy, K.² Tseng, C.W.³

81
- 0027711187
- Preliminary experiences with the fortran D compiler
- Portland, OR, November
- S. Hiranandani, K. Kennedy, C.-W. Tseng, Preliminary experiences with the fortran D compiler, in: Proceedings of Supercomputing'93, Portland, OR, November 1993.
- (1993) Proceedings of Supercomputing'93
- Hiranandani, S.¹ Kennedy, K.² Tseng, C.-W.³

82
- 0026891897
- Partitioning and labeling of loops by unimodular transformations
- E. D'Hollander, Partitioning and labeling of loops by unimodular transformations, IEEE Transactions on Parallel and Distributed Systems 3 (4) (1992) 465-476.
- (1992) IEEE Transactions on Parallel and Distributed Systems , vol.3 , Issue.4 , pp. 465-476
- D'Hollander, E.¹

83
- 38249000489
- Communication-free hyperplane partitioning of nested loops
- C.-H. Huang, P. Sadayappan, Communication-free hyperplane partitioning of nested loops, Journal of Parallel and Distributed Computing 19 (1993) 90-102.
- (1993) Journal of Parallel and Distributed Computing , vol.19 , pp. 90-102
- Huang, C.-H.¹ Sadayappan, P.²

84
- 0342798375
- Compiling parallel loops for high performance computers -partitioning
- Kluwer Academic Publishers, Boston, MA
- D.E. Hudak, S.G. Abraham, Compiling parallel loops for high performance computers -partitioning, in: Data Assignment and Remapping, Kluwer Academic Publishers, Boston, MA, 1993.
- (1993) Data Assignment and Remapping
- Hudak, D.E.¹ Abraham, S.G.²

85
- 0342363503
- Technology outlook: Introduction to predicated execution
- W-m. Hwu, Technology outlook: introduction to predicated execution, IEEE Computer 31 (1) (1998) 49-50.
- (1998) IEEE Computer , vol.31 , Issue.1 , pp. 49-50
- Hwu, W.-M.¹

86
- 0027595384
- The superblock: An effective technique for VLIW and superscalar compilation
- W-M. Hwu, S.A. Mahlke, W.Y. Chen, P.P. Chang, N.J. Warter, R.A. Bringmann, R.G. Ouellette, R.E. Hank, T. Kiyohara, G.E. Haab, J.G. Holm, D.M. Lavery, The superblock: an effective technique for VLIW and superscalar compilation, Journal of Supercomputing A (1993) 229-248.
- (1993) Journal of Supercomputing A , pp. 229-248
- Hwu, W.-M.¹ Mahlke, S.A.² Chen, W.Y.³ Chang, P.P.⁴ Warter, N.J.⁵ Bringmann, R.A.⁶ Ouellette, R.G.⁷ Hank, R.E.⁸ Kiyohara, T.⁹ Haab, G.E.¹⁰ Holm, J.G.¹¹ Lavery, D.M.¹²

87
- 85031523618
- Document SC23-0526-01
- IBM, Engineering and Scientific Subroutine Library (ESSL), Guide and Reference, Document SC23-0526-01, 1994.
- (1994) Guide and Reference

88
- 85026986651
- Supernode partitioning
- F. Irigoin, R. Triolet, Supernode partitioning, in: Proceedings of the 15th ACM Symposium on Principles of Programming Languages, 1988.
- (1988) Proceedings of the 15th ACM Symposium on Principles of Programming Languages
- Irigoin, F.¹ Triolet, R.²

89
- 0008434041
- Multiple threads template library
- Real World Computing Partnership, September
- Y. Ishikawa, Multiple threads template library, Technical Report TR-96-012, Real World Computing Partnership, September 1996.
- (1996) Technical Report TR-96-012
- Ishikawa, Y.¹

90
- 84976816559
- Circular scheduling: A new technique to perform software pipelining
- Toronto, Canada
- S. Jain, Circular scheduling: A new technique to perform software pipelining, in: Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, Toronto, Canada, 1991, pp. 219-228.
- (1991) Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 219-228
- Jain, S.¹

91
- 85031535123
- Code motion for generating compact code on embedded DSPs'
- Washington, DC, 4-6 December'98
- V. Jain, S. Pande, Code motion for generating compact code on embedded DSPs', 1998 Workshop on Compiler and architecture support for embedded systems, Washington, DC, 4-6 December'98. Available under publications link at http://www.ececs.uc.edu/ compiler.
- 1998 Workshop on Compiler and Architecture Support for Embedded Systems
- Jain, V.¹ Pande, S.²

92
- 0342798373
- HPC++: Experiments with the parallel standard template library
- Indiana University, Department of Computer Science, December
- E. Johnson, D. Gannon, HPC++: Experiments with the parallel standard template library, Technical Report TR-96-51, Indiana University, Department of Computer Science, December 1996.
- (1996) Technical Report TR-96-51
- Johnson, E.¹ Gannon, D.²

93
- 0030379259
- Analysis techniques for predicated code
- R. Johnson, M. Schlansker, Analysis techniques for predicated code, in: Proceedings of the 29th Annual International Symposium on Microarchitecture, 1996, pp. 100-113.
- (1996) Proceedings of the 29th Annual International Symposium on Microarchitecture , pp. 100-113
- Johnson, R.¹ Schlansker, M.²

94
- 0343668673
- Minimizing data and synchronization costs in one-way communication
- M. Kandemir, N. Shenoy, P. Banerjee, J. Ramanujam, A. Choudhary, Minimizing data and synchronization costs in one-way communication, in: International Conference on Parallel Processing, 1998, pp. 180-188.
- (1998) International Conference on Parallel Processing , pp. 180-188
- Kandemir, M.¹ Shenoy, N.² Banerjee, P.³ Ramanujam, J.⁴ Choudhary, A.⁵

95
- 85027602455
- Optimizing for parallelism and data locality
- July
- K. Kennedy, K.S. McKinley, Optimizing for parallelism and data locality, in: Proceedings of the ACM 1992 International Conference on Supercomputing, July 1992.
- (1992) Proceedings of the ACM 1992 International Conference on Supercomputing
- Kennedy, K.¹ McKinley, K.S.²

96
- 0029192689
- A linear-time algorithm for computing the memory access sequence in data-parallel programs
- Santa Barbara, CA
- K. Kennedy, N. Nedeljkovic, A. Sethi, A linear-time algorithm for computing the memory access sequence in data-parallel programs, in: Proceedings of Fifth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Santa Barbara, CA, 1995, pp. 102-111.
- (1995) Proceedings of Fifth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming , pp. 102-111
- Kennedy, K.¹ Nedeljkovic, N.² Sethi, A.³

97
- 0029231874
- Combining dependence and data-flow analyses to optimize communication
- Santa Barbara, CA
- K. Kennedy, N. Nedeljkovic, Combining dependence and data-flow analyses to optimize communication, in: Proceedings Ninth International Parallel Processing Symposium, Santa Barbara, CA, 1995.
- (1995) Proceedings Ninth International Parallel Processing Symposium
- Kennedy, K.¹ Nedeljkovic, N.²

98
- 0042805977
- Resource-based communication placement analysis
- San Jose, CA, August
- K. Kennedy, A. Sethi, Resource-based communication placement analysis, in: Proceedings Ninth Workshop on Languages and Compilers for Parallel Computing, San Jose, CA, August 1996.
- (1996) Proceedings Ninth Workshop on Languages and Compilers for Parallel Computing
- Kennedy, K.¹ Sethi, A.²

99
- 0027677302
- Optimization techniques for SIMD fortran compilers
- K. Knobe, J. Lukas, M. Weiss, Optimization techniques for SIMD fortran compilers, Concurrency: Practice and Experience 5 (7) (1993) 527-552.
- (1993) Concurrency: Practice and Experience , vol.5 , Issue.7 , pp. 527-552
- Knobe, K.¹ Lukas, J.² Weiss, M.³

100
- 0031705047
- Array SSA form and its use in parallelization
- San Diego, California, January
- K. Knobe, V. Sarkar, Array SSA form and its use in parallelization, in: Proceedings of the 25th ACM Symposium on Principles of Programming Languages, San Diego, California, January 1998.
- (1998) Proceedings of the 25th ACM Symposium on Principles of Programming Languages
- Knobe, K.¹ Sarkar, V.²

101
- 0030685988
- Data centric multi-level blocking
- I. Kodukula, N. Ahmed, K. Pingali, Data centric multi-level blocking, in: Proceedings of the SIGPLAN ACM Conference on Programming Language Design and Implementation, 1997.
- (1997) Proceedings of the SIGPLAN ACM Conference on Programming Language Design and Implementation
- Kodukula, I.¹ Ahmed, N.² Pingali, K.³

102
- 0026294380
- Compile-time Generation of Communication for scientific programs
- Albuquerque, NM
- C. Koelbel, Compile-time Generation of Communication for scientific programs, in: Proceedings of Supercomputing '91, Albuquerque, NM, 1991, pp. 101-110.
- (1991) Proceedings of Supercomputing '91 , pp. 101-110
- Koelbel, C.¹

103
- 0026190245
- The I-Test: An improved dependence test for automatic parallelization and vectorization
- IEEE Transactions on Parallel and Distributed Systems
- X. Kong, D. Klappholz, K. Psarris, The I-Test: An improved dependence test for automatic parallelization and vectorization, IEEE Transactions on Parallel and Distributed Systems, Special Issue on Parallel Languages and Compilers 2 (3) (1991).
- (1991) Parallel Languages and Compilers , vol.2 , Issue.3 SPEC. ISSUE
- Kong, X.¹ Klappholz, D.² Psarris, K.³

104
- 0003657590
- Addison-Wesley, Reading, MA
- D. Knuth, The Art of Computer Programming, vol. 2, Seminumerical Algorithms, Addison-Wesley, Reading, MA, 1981.
- (1981) The Art of Computer Programming, Vol. 2, Seminumerical Algorithms , vol.2
- Knuth, D.¹

105
- 85033186207
- Document #9603001, Champaign, IL
- Kuck and Associates, Inc., KAP for IBM fortran, user's guide version 3.3, Document #9603001, Champaign, IL, 1996.
- (1996) KAP for IBM Fortran, User's Guide Version 3.3

106
- 0016026944
- The parallel execution of DO loops
- L. Lamport, The parallel execution of DO loops, Communications of the ACM 17 (2) (1974) 83-93.
- (1974) Communications of the ACM , vol.17 , Issue.2 , pp. 83-93
- Lamport, L.¹

107
- 0003327314
- Concurrent static single assignment form and constant propagation for explicitly parallel Programs
- Proceedings of the 10th International Workshop on Languages and Compilers for Parallel Computing, Springer, Minneapolis, MN, August
- J. Lee, S.P. Midkiff, D.A. Padua, Concurrent static single assignment form and constant propagation for explicitly parallel Programs, in: Proceedings of the 10th International Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science, Springer, Minneapolis, MN, August 1997.
- (1997) Lecture Notes in Computer Science
- Lee, J.¹ Midkiff, S.P.² Padua, D.A.³

108
- 0026187669
- Compiling communication-efficient programs for massively parallel machines
- J. Li, M. Chen, Compiling communication-efficient programs for massively parallel machines, IEEE Transactions on Parallel and Distributed Systems 2 (3) (1991) 361-376.
- (1991) IEEE Transactions on Parallel and Distributed Systems , vol.2 , Issue.3 , pp. 361-376
- Li, J.¹ Chen, M.²

109
- 0343232962
- Efficient interprocedural analysis for program parallelization and restructuring
- New Haven, CT, July
- Z. Li, P. Yew, Efficient interprocedural analysis for program parallelization and restructuring, in: Proceedings of the ACM/SIGPLAN Symposium on Parallel Programming, New Haven, CT, July 1998.
- (1998) Proceedings of the ACM/SIGPLAN Symposium on Parallel Programming
- Li, Z.¹ Yew, P.²

110
- 0025229934
- An efficient data dependence analysis for parallelizing compilers
- Z. Li, P. Yew, C. Zhu, An efficient data dependence analysis for parallelizing compilers, IEEE Transactions on Parallel and Distributed Systems 1 (1) (1990).
- (1990) IEEE Transactions on Parallel and Distributed Systems , vol.1 , Issue.1
- Li, Z.¹ Yew, P.² Zhu, C.³

111
- 0030149574
- Storage assignment to decrease code size
- S. Liao et al., Storage assignment to decrease code size, ACM Transactions on Programming Languages and Systems 18 (3) (1996) 235-253.
- (1996) ACM Transactions on Programming Languages and Systems , vol.18 , Issue.3 , pp. 235-253
- Liao, S.¹

112
- 0002909225
- Instruction selection using binate covering for code size optimization
- S. Liao et al., Instruction selection using binate covering for code size optimization, in: Proceedings of the 1995 International Conference on Computer-Aided Design, 1995.
- (1995) Proceedings of the 1995 International Conference on Computer-Aided Design
- Liao, S.¹

113
- 0040348165
- Communication-free parallelization via affine transformations
- August
- A.W. Lim, M.S. Lam, Communication-free parallelization via affine transformations, in: Proceedings of the Seventh Workshop on Languages and Compilers for Parallel Computing, August 1994.
- (1994) Proceedings of the Seventh Workshop on Languages and Compilers for Parallel Computing
- Lim, A.W.¹ Lam, M.S.²

114
- 0030265013
- Value locality and load value prediction
- Cambridge, MA
- M.H. Lipasti, C.B. Wilkerson, J.P. Shen, Value locality and load value prediction, in: Proceedings of the Seventh International Conference on Architectural Support for Programming Languages and Operating Systems, Cambridge, MA, 1996, pp. 138-149.
- (1996) Proceedings of the Seventh International Conference on Architectural Support for Programming Languages and Operating Systems , pp. 138-149
- Lipasti, M.H.¹ Wilkerson, C.B.² Shen, J.P.³

115
- 0011598990
- Addressing in Cray research's MPP Fortran
- Vienna, Austria
- T. MacDonald, D. Pase, A. Meltzer, Addressing in Cray research's MPP Fortran, in: Proceedings of the Third Workshop on Compilers for Parallel Computers, Vienna, Austria, 1992, pp. 161-172.
- (1992) Proceedings of the Third Workshop on Compilers for Parallel Computers , pp. 161-172
- MacDonald, T.¹ Pase, D.² Meltzer, A.³

116
- 0030190854
- Improving data locality with loop transformations
- K.S. McKinley, S. Carr, C-W. Tsseng, Improving data locality with loop transformations, ACM Transactions on Programming Languages and Systems 18 (1996) 423-453.
- (1996) ACM Transactions on Programming Languages and Systems , vol.18 , pp. 423-453
- McKinley, K.S.¹ Carr, S.² Tsseng, C.-W.³

117
- 0002394225
- Local iteration set computation for block-cyclic distributions
- S. Midkiff, Local iteration set computation for block-cyclic distributions, Proceedings International Conference on Parallel Processing II (1995) 77-84.
- (1995) Proceedings International Conference on Parallel Processing , vol.2 , pp. 77-84
- Midkiff, S.¹

118
- 0030717767
- Dynamic speculation and synchronization of data dependences
- A.I. Moshovos, S.E. Breach, T.N. Vijaykumar, G.S. Sohi, Dynamic speculation and synchronization of data dependences, in: Proceedings of the 24th International Symposium on Computer Architecture, 1997.
- (1997) Proceedings of the 24th International Symposium on Computer Architecture
- Moshovos, A.I.¹ Breach, S.E.² Vijaykumar, T.N.³ Sohi, G.S.⁴

119
- 0030674213
- Exploiting instruction level parallelism in processors by caching scheduled groups
- Denver, Colorado
- R. Nair, M.E. Hopkins, Exploiting instruction level parallelism in processors by caching scheduled groups, in: Proceedings of the International Symposium on Computer Architecture, Denver, Colorado, 1997, pp. 13-25.
- (1997) Proceedings of the International Symposium on Computer Architecture , pp. 13-25
- Nair, R.¹ Hopkins, M.E.²

120
- 0032672879
- Value prediction in VLIW machines
- Atlanta, Georgia
- T. Nakra, R. Gupta, M.L. Soffa, Value prediction in VLIW machines, in: Proceedings of the ACM/ IEEE 26th International Symposium on Computer Architecture, Atlanta, Georgia, 1999.
- (1999) Proceedings of the ACM/ IEEE 26th International Symposium on Computer Architecture
- Nakra, T.¹ Gupta, R.² Soffa, M.L.³

121
- 0027767071
- A scheduler-sensitive global register allocator
- Portland, Oregon
- C. Norris, L.L. Pollock, A scheduler-sensitive global register allocator, in: Proceedings of Supercomputing'93, Portland, Oregon, 1993, pp. 804-813.
- (1993) Proceedings of Supercomputing'93 , pp. 804-813
- Norris, C.¹ Pollock, L.L.²

122
- 28444453374
- Superscalar execution with direct data forwarding
- Paris, France
- S. Onder, R. Gupta, Superscalar execution with direct data forwarding, in: Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, Paris, France, 1998, pp. 130-135.
- (1998) Proceedings of the International Conference on Parallel Architectures and Compilation Techniques , pp. 130-135
- Onder, S.¹ Gupta, R.²

123
- 0003701628
- Stanford University Computer Systems Lab, Technical Report CSL-TR-97-715, February
- J. Oplinger, D. Heine, S-W. Liao, B.A. Nayfeh, M.S. Lam, K. Olukotun, Software and hardware for exploiting speculative parallelism with a multiprocessor, Stanford University Computer Systems Lab, Technical Report CSL-TR-97-715, February 1997.
- (1997) Software and Hardware for Exploiting Speculative Parallelism with a Multiprocessor
- Oplinger, J.¹ Heine, D.² Liao, S.-W.³ Nayfeh, B.A.⁴ Lam, M.S.⁵ Olukotun, K.⁶

124
- 0040822529
- Ph.D. Thesis, Department of Electrical and Computer Engineering, University of Ilinois, Urbana, IL, June
- D.J. Palermo, Compiler techniques for optimizing communication and data distribution for distributed-memory multicomputers, Ph.D. Thesis, Department of Electrical and Computer Engineering, University of Ilinois, Urbana, IL, June 1996.
- (1996) Compiler Techniques for Optimizing Communication and Data Distribution for Distributed-memory Multicomputers
- Palermo, D.J.¹

125
- 0030295493
- Dynamic data partitioning for distributed-memory multicomputers
- D.J. Palermo, E.W. Hodges IV, P. Banerjee, Dynamic data partitioning for distributed-memory multicomputers, Journal of Parallel and Distributed Computing 38 (2) (1996) 158-175.
- (1996) Journal of Parallel and Distributed Computing , vol.38 , Issue.2 , pp. 158-175
- Palermo, D.J.¹ Hodges E.W. IV² Banerjee, P.³

126
- 85031535440
- A computation + communication load balanced loop partitioning method for distributed memory systems
- to appear
- S. Pande, T. Bali, A computation + communication load balanced loop partitioning method for distributed memory systems, Journal of Parallel and Distributed Computing, to appear.
- Journal of Parallel and Distributed Computing
- Pande, S.¹ Bali, T.²

127
- 0342363494
- Compilation techniques for distributed memory systems: Guest editorial introduction
- S. Pande, D.P. Agrawal, Compilation techniques for distributed memory systems: Guest editorial introduction (special issue), Journal of Parallel and Distributed Computing on Compilation Techniques for Distributed Memory Systems 38 (1996) 107-113.
- (1996) Journal of Parallel and Distributed Computing on Compilation Techniques for Distributed Memory Systems , vol.38 , Issue.SPEC. ISSUE , pp. 107-113
- Pande, S.¹ Agrawal, D.P.²

128
- 0002524997
- A compile time partitioning method for DOALL loops on distributed memory systems
- IEEE Computer Society Press, Silver Spring, MD
- S. Pande, A compile time partitioning method for DOALL loops on distributed memory systems, in: International Conference on Parallel Processing, vol. III, IEEE Computer Society Press, Silver Spring, MD, 1996, pp. 35-44.
- (1996) International Conference on Parallel Processing , vol.3 , pp. 35-44
- Pande, S.¹

129
- 0342798360
- Workshop on challenges compiling for scalable parallel systems
- October
- S. Pande, J. Ramanujam, Y. Robert, Workshop on challenges compiling for scalable parallel systems, in: Eighth IEEE Symposium on Parallel and Distributed Systems, October 1996.
- (1996) Eighth IEEE Symposium on Parallel and Distributed Systems
- Pande, S.¹ Ramanujam, J.² Robert, Y.³

130
- 84984058313
- Dependence flow graphs: An algebraic approach to program dependences
- K. Pingali, M. Beck, R. Johnson, M. Moudgill, P. Stodghill, Dependence flow graphs: An algebraic approach to program dependences, in: Proceedings of the ACM Symposium on Principles of Programming Languages, 1991.
- (1991) Proceedings of the ACM Symposium on Principles of Programming Languages
- Pingali, K.¹ Beck, M.² Johnson, R.³ Moudgill, M.⁴ Stodghill, P.⁵

131
- 0030076621
- The Banerjee-Wolfe and GCD tests on exact data dependence information
- K. Psarris, The Banerjee-Wolfe and GCD tests on exact data dependence information, Journal of Parallel and Distributed Computing 32 (2) (1996).
- (1996) Journal of Parallel and Distributed Computing , vol.32 , Issue.2
- Psarris, K.¹

132
- 33745192514
- On the accuracy of the Banerjee test, Journal of Parallel and Distributed Computing
- K. Psarris, D. Klappholz, X. Kong, On the accuracy of the Banerjee test, Journal of Parallel and Distributed Computing, Special Issue on Shared Memory Multiprocessors 12 (2) (1991).
- (1991) Shared Memory Multiprocessors , vol.12 , Issue.2 SPEC. ISSUE
- Psarris, K.¹ Klappholz, D.² Kong, X.³

133
- 79851482676
- An empirical study of the I test for exact data dependence
- St. Charles, IL, August
- K. Psarris, S. Pande, An empirical study of the I test for exact data dependence, in: Proceedings of the 1994 International Conference on Parallel Processing, St. Charles, IL, August 1994.
- (1994) Proceedings of the 1994 International Conference on Parallel Processing
- Psarris, K.¹ Pande, S.²

134
- 0027698554
- The direction vector I test
- K. Psarris, X. Kong, D. Klappholz, The direction vector I test, IEEE Transactions on Parallel and Distributed Systems 4 (11) (1993).
- (1993) IEEE Transactions on Parallel and Distributed Systems , vol.4 , Issue.11
- Psarris, K.¹ Kong, X.² Klappholz, D.³

135
- 84976676720
- A practical algorithm for exact array dependence analysis
- W. Pugh, A practical algorithm for exact array dependence analysis, Communications of the ACM 35 (8) (1992).
- (1992) Communications of the ACM , vol.35 , Issue.8
- Pugh, W.¹

136
- 0026231056
- Compile-time techniques for data distribution in distributed memory machines
- J. Ramanujam, P. Sadayappan, Compile-time techniques for data distribution in distributed memory machines, IEEE Transactions on Parallel and Distributed Systems 2 (4) (1991) 472-482.
- (1991) IEEE Transactions on Parallel and Distributed Systems , vol.2 , Issue.4 , pp. 472-482
- Ramanujam, J.¹ Sadayappan, P.²

137
- 84966585509
- Optimal task scheduling to minimize inter-tile latencies
- F. Rastello, A. Rao, S. Pande, Optimal task scheduling to minimize inter-tile latencies, in: International Conference on Parallel Processing, 1998, pp. 172-179.
- (1998) International Conference on Parallel Processing , pp. 172-179
- Rastello, F.¹ Rao, A.² Pande, S.³

138
- 0003015894
- Some scheduling techniques and an easily schedulable horizontal architecture for high performance scientific computing
- MA
- B.R. Rau, C.D. Glaser, Some scheduling techniques and an easily schedulable horizontal architecture for high performance scientific computing, in: Proceedings of the 14th Annual Microprogramming Workshop Chatham, MA, 1981, pp. 183-198.
- (1981) Proceedings of the 14th Annual Microprogramming Workshop Chatham , pp. 183-198
- Rau, B.R.¹ Glaser, C.D.²

139
- 84976823223
- The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization
- L. Rauchwerger, D. Padua, The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization, in: Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, 1995.
- (1995) Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation
- Rauchwerger, L.¹ Padua, D.²

140
- 0030247075
- An implementation framework for HPF distributed arrays on message-passing parallel computer systems
- C. van Reeuwijk, H. Sips, W. Denissen, E. Paalvast, An implementation framework for HPF distributed arrays on message-passing parallel computer systems, IEEE Transactions on Parallel and Distributed Systems 7 (9) (1996) 897-914.
- (1996) IEEE Transactions on Parallel and Distributed Systems , vol.7 , Issue.9 , pp. 897-914
- Van Reeuwijk, C.¹ Sips, H.² Denissen, W.³ Paalvast, E.⁴

141
- 85031529201
- Document Number VA061, Santa Monica, CA
- Pacific-Sierra Research Corporation, VAST-2 for XL Fortran, User's Guide, Edition 1.2, Document Number VA061, Santa Monica, CA, 1994.
- (1994) VAST-2 for XL Fortran, User's Guide, Edition 1.2

142
- 17244382472
- Storage assignment optimizations to generate compact and efficient code on embedded DSPs
- Atlanta
- A. Rao, S. Pande, Storage assignment optimizations to generate compact and efficient code on embedded DSPs, in: ACM SIGPLAN Conference on Programming Language Design and Implementation, Atlanta, 1999.
- (1999) ACM SIGPLAN Conference on Programming Language Design and Implementation
- Rao, A.¹ Pande, S.²

143
- 17144391554
- Software pipelining showdown: Optimal vs. heuristic methods in a production compiler
- Philadelphia, Pennsylvania
- J. Ruttenberg, G.R. Gao, A. Stoutchinin, W. Lichtenstein, Software pipelining showdown: optimal vs. heuristic methods in a production compiler, in: Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, Philadelphia, Pennsylvania, 1996, pp. 1-11.
- (1996) Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 1-11
- Ruttenberg, J.¹ Gao, G.R.² Stoutchinin, A.³ Lichtenstein, W.⁴

144
- 0343668653
- Optimizing CM Fortran compiler for connection machine computers
- G. Sabot, Optimizing CM Fortran compiler for connection machine computers, Journal of Parallel and Distributed Computing 23 (1) (1994) 224-238.
- (1994) Journal of Parallel and Distributed Computing , vol.23 , Issue.1 , pp. 224-238
- Sabot, G.¹

145
- 0025416470
- Run-time scheduling and execution of loops on message passing machines
- J. Saltz, K. Crowley, R. Mirchandaney, H. Berryman, Run-time scheduling and execution of loops on message passing machines, Journal of Parallel and Distributed Computing 8 (4) (1990).
- (1990) Journal of Parallel and Distributed Computing , vol.8 , Issue.4
- Saltz, J.¹ Crowley, K.² Mirchandaney, R.³ Berryman, H.⁴

146
- 0003493010
- Pitman, London and The MIT Press, Cambridge, MA
- V. Sarkar, Partitioning and scheduling parallel programs for multiprocessors, Pitman, London and The MIT Press, Cambridge, MA, 1989.
- (1989) Partitioning and Scheduling Parallel Programs for Multiprocessors
- Sarkar, V.¹

147
- 0026213832
- Automatic partitioning of a program dependence graph into parallel tasks
- V. Sarkar, Automatic partitioning of a program dependence graph into parallel tasks, IBM Journal of Research and Development, 35 (5/6) (1991).
- (1991) IBM Journal of Research and Development , vol.35 , Issue.5-6
- Sarkar, V.¹

148
- 0026991030
- A general framework for iteration-reordering loop transformations
- San Francisco, California
- V. Sarkar, R. Thekkath, A general framework for iteration-reordering loop transformations, in: Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, San Francisco, California, 1992, pp. 175-187.
- (1992) Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 175-187
- Sarkar, V.¹ Thekkath, R.²

149
- 0031140581
- Automatic selection of high order transformations in the IBM XL Fortran Compilers
- V. Sarkar, Automatic selection of high order transformations in the IBM XL Fortran Compilers, IBM Journal of Research and Development 41 (3) (1997).
- (1997) IBM Journal of Research and Development , vol.41 , Issue.3
- Sarkar, V.¹

150
- 18844400618
- Analysis and optimization of explicitly parallel programs using the parallel program graph representation
- LNCS Springer, Minneapolis, MN
- V. Sarkar, Analysis and optimization of explicitly parallel programs using the parallel program graph representation, in: Proceedings of the 10th International Workshop on Languages and Compilers for Parallel Computing, LNCS Springer, Minneapolis, MN, 1997.
- (1997) Proceedings of the 10th International Workshop on Languages and Compilers for Parallel Computing
- Sarkar, V.¹

151
- 0022676201
- A vectorizing Fortran compiler
- R.G. Scarborough, H.G. Kolsky, A vectorizing Fortran compiler, IBM Journal of Research and Development 30 (2) (1986) 163-171.
- (1986) IBM Journal of Research and Development , vol.30 , Issue.2 , pp. 163-171
- Scarborough, R.G.¹ Kolsky, H.G.²

152
- 0029511540
- Critical path reduction for scalar programs
- M. Schlansker, V. Kathail, Critical path reduction for scalar programs, in: 28th Annual IEEE/ACM International Symposium on Microarchitecture, 1995.
- (1995) 28th Annual IEEE/ACM International Symposium on Microarchitecture
- Schlansker, M.¹ Kathail, V.²

153
- 84955559042
- Efficient Distribution Analysis via Graph Contraction
- Proceedings of the Eighth Workshop on Languages and Compilers for Parallel Computing, Columbus, OH, August Springer, Berlin
- T.J. Sheffler, R. Schreiber, J.R. Gilbert, W. Pugh, Efficient Distribution Analysis via Graph Contraction, in: Proceedings of the Eighth Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science 1033, Columbus, OH, August 1995. Springer, Berlin, 1996, pp. 377-391.
- (1995) Lecture Notes in Computer Science , vol.1033 , pp. 377-391
- Sheffler, T.J.¹ Schreiber, R.² Gilbert, J.R.³ Pugh, W.⁴

154
- 33751427820
- Statement-level communication-free partitioning techniques for parallelizing compilers
- K.-P. Shih, J.-P. Sheu, C.-H. Huang, Statement-level communication-free partitioning techniques for parallelizing compilers, in: Proceedings of the Ninth Workshop on Languages and Compilers for Parallel Computing, 1996.
- (1996) Proceedings of the Ninth Workshop on Languages and Compilers for Parallel Computing
- Shih, K.-P.¹ Sheu, J.-P.² Huang, C.-H.³

155
- 0029228631
- The communication software and parallel environment of the IBM SP2
- M. Snir et al., The communication software and parallel environment of the IBM SP2, IBM Systems Journal 34 (2) (1995) 205-221.
- (1995) IBM Systems Journal , vol.34 , Issue.2 , pp. 205-221
- Snir, M.¹

156
- 0343232951
- SPAM Research Group, SPAM compiler user's manual, 1997, http://www.ee.princeton.edu/spam.
- (1997) SPAM Compiler User's Manual

157
- 0342798352
- Ph.D. Thesis, Department of Computer Science, University of Colorado, Denver, Colorado
- H. Srinivasan, Optimizing explicitly parallel programs, Ph.D. Thesis, Department of Computer Science, University of Colorado, Denver, Colorado, 1994.
- (1994) Optimizing Explicitly Parallel Programs
- Srinivasan, H.¹

158
- 0000412263
- Generating communication for array statements: Design, implementation and evaluation
- J. Stichnoth, D. O'Hallaron, T. Gross, Generating communication for array statements: Design, implementation and evaluation, Journal of Parallel and Distributed Computing 21 (1) (1994) 150-159.
- (1994) Journal of Parallel and Distributed Computing , vol.21 , Issue.1 , pp. 150-159
- Stichnoth, J.¹ O'Hallaron, D.² Gross, T.³

159
- 85031528022
- Efficient program partitioning based on compiler controlled communication
- to appear
- R. Subramanian, S. Pande, Efficient program partitioning based on compiler controlled communication, in: Proceedings of the Fourth International Workshop on High-Level Parallel Programming Models and Supportive Environments, to appear.
- Proceedings of the Fourth International Workshop on High-Level Parallel Programming Models and Supportive Environments
- Subramanian, R.¹ Pande, S.²

160
- 0030676530
- Optimization of embedded DSP programs using post-pass data-flow analysis
- A. Sudarsanam et al., Optimization of embedded DSP programs using post-pass data-flow analysis, in: Proceedings of International Conference on Acoustics, Speech, and Signal Processing, 1997.
- (1997) Proceedings of International Conference on Acoustics, Speech, and Signal Processing
- Sudarsanam, A.¹

161
- 0029487859
- Memory bank and register allocation in software synthesis for ASIPs
- A. Sudarsanam, S. Malik. Memory bank and register allocation in software synthesis for ASIPs, in: Proceedings of International Conference on Computer Aided Design, 1995, pp. 388-392.
- (1995) Proceedings of International Conference on Computer Aided Design , pp. 388-392
- Sudarsanam, A.¹ Malik, S.²

162
- 0000606960
- Fast address sequence generation for data-parallel programs using integer lattices
- Proceedings of the languages and compilers for parallel computing, Springer, Berlin
- A. Thirumalai, J. Ramanujam, Fast address sequence generation for data-parallel programs using integer lattices, in: Proceedings of the languages and compilers for parallel computing, Lecture Notes in Computer Science 1033, Springer, Berlin, 1996, pp. 191-208.
- (1996) Lecture Notes in Computer Science , vol.1033 , pp. 191-208
- Thirumalai, A.¹ Ramanujam, J.²

163
- 0030295507
- Efficient computation of address sequences in data-parallel programs using closed forms for basis vectors
- A. Thirumalai, J. Ramanujam, Efficient computation of address sequences in data-parallel programs using closed forms for basis vectors, Journal of Parallel and Distributed Computing 38 (2) (1996) 188-203.
- (1996) Journal of Parallel and Distributed Computing , vol.38 , Issue.2 , pp. 188-203
- Thirumalai, A.¹ Ramanujam, J.²

164
- 84886627169
- Dataflow analysis driven dynamic data partitioning
- Fourth Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers, Springer, Pittsburgh, PA, May
- J. Tims, R. Gupta, M.L. Soffa, Dataflow analysis driven dynamic data partitioning, in: Fourth Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers, Lecture Notes in Computer Science, vol. 1511, Springer, Pittsburgh, PA, May 1998, pp. 75-90.
- (1998) Lecture Notes in Computer Science , vol.1511 , pp. 75-90
- Tims, J.¹ Gupta, R.² Soffa, M.L.³

165
- 0026986882
- Global analysis for partitioning non-strict programs into sequential threads
- San Francisco, CA
- K.R. Traub, D.E. Culler, K.E. Schauser, Global analysis for partitioning non-strict programs into sequential threads, in: ACM Conference on Lisp and Functional Programming, San Francisco, CA, 1992.
- (1992) ACM Conference on Lisp and Functional Programming
- Traub, K.R.¹ Culler, D.E.² Schauser, K.E.³

166
- 84976844340
- Direct parallelization of call statements
- R. Triolet, F. Irigoin, P. Feautrier, Direct parallelization of call statements, in: Proceedings of the Sigplan Symposium on Compiler Construction, 1986, pp. 176-185.
- (1986) Proceedings of the Sigplan Symposium on Compiler Construction , pp. 176-185
- Triolet, R.¹ Irigoin, F.² Feautrier, P.³

167
- 33751027415
- Array privatization for shared and distributed memory machines
- in ACM SIGPLAN Notices
- P. Tu, D. Padua, Array privatization for shared and distributed memory machines, in: Proceedings Second Workshop on Languages, Compilers, and Run-Time Environments for Distributed Memory Machines, in ACM SIGPLAN Notices, 1993.
- (1993) Proceedings Second Workshop on Languages, Compilers, and Run-Time Environments for Distributed Memory Machines
- Tu, P.¹ Padua, D.²

168
- 0010224751
- Runtime performance of parallel array assignment: An empirical study
- Pittsburgh, PA
- L. Wang, J. Stichnoth, S. Chatterjee, Runtime performance of parallel array assignment: an empirical study, in: Proceedings Supercomputing 96, Pittsburgh, PA, 1996.
- (1996) Proceedings Supercomputing 96
- Wang, L.¹ Stichnoth, J.² Chatterjee, S.³

169
- 0004324297
- Kluwer Academic Publishers, Dordrecht
- H. Wijshoff, Data Organization in Parallel Computers, Kluwer Academic Publishers, Dordrecht, 1989.
- (1989) Data Organization in Parallel Computers
- Wijshoff, H.¹

170
- 84976692695
- SUIF: A parallelizing and optimizing research compiler
- R. Wilson et al., SUIF: a parallelizing and optimizing research compiler, SIGPLAN Notices 29 (12) (1994) 31-37.
- (1994) SIGPLAN Notices , vol.29 , Issue.12 , pp. 31-37
- Wilson, R.¹

171
- 0026232450
- A loop transformation theory and an algorithm to maximize parallelism
- M.E. Wolf, M.S. Lam, A loop transformation theory and an algorithm to maximize parallelism, IEEE Transactions on Parallel and Distributed Systems 2 (4) (1991) 452-471.
- (1991) IEEE Transactions on Parallel and Distributed Systems , vol.2 , Issue.4 , pp. 452-471
- Wolf, M.E.¹ Lam, M.S.²

172
- 0004062640
- Pitman, London and The MIT Press, Cambridge, MA
- M.J. Wolfe, Optimizing Supercompilers for Supercomputers, Pitman, London and The MIT Press, Cambridge, MA, 1989.
- (1989) Optimizing Supercompilers for Supercomputers
- Wolfe, M.J.¹

173
- 0002433589
- Iteration space tiling for memory hierarchies
- M.J. Wolfe, Iteration space tiling for memory hierarchies, in: Proceedings of the Third SIAM Conference on Parallel Processing for Scientific Computing, 1987, pp. 357-361.
- (1987) Proceedings of the Third SIAM Conference on Parallel Processing for Scientific Computing , pp. 357-361
- Wolfe, M.J.¹

174
- 0026923851
- The power test for data dependence
- M. Wolfe, C. Tseng, The power test for data dependence, IEEE Transactions on Parallel and Distributed Systems 3 (5) (1992).
- (1992) IEEE Transactions on Parallel and Distributed Systems , vol.3 , Issue.5
- Wolfe, M.¹ Tseng, C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.