메뉴 건너뛰기




Volumn 25, Issue 13, 1999, Pages 1741-1783

Compilation techniques for parallel systems

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER ARCHITECTURE; COMPUTER SYSTEMS PROGRAMMING; DATA STORAGE EQUIPMENT; OPTIMIZATION; PROGRAM COMPILERS; RESPONSE TIME (COMPUTER SYSTEMS);

EID: 0033283421     PISSN: 01678191     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0167-8191(99)00086-1     Document Type: Article
Times cited : (28)

References (174)
  • 2
    • 0029373981 scopus 로고
    • Automatic partitioning of parallel loops and data arrays for distributed shared-memory multiprocessors
    • A. Agarwal, D. Kranz, V. Natrajan, Automatic partitioning of parallel loops and data arrays for distributed shared-memory multiprocessors, IEEE Transactions on Parallel and Distributed Systems 6 (9) (1995) 943-962.
    • (1995) IEEE Transactions on Parallel and Distributed Systems , vol.6 , Issue.9 , pp. 943-962
    • Agarwal, A.1    Kranz, D.2    Natrajan, V.3
  • 3
    • 0032114494 scopus 로고    scopus 로고
    • Interprocedural partial redundancy elimination with application to distributed memory compilation
    • G. Agrawal, Interprocedural partial redundancy elimination with application to distributed memory compilation, IEEE Transactions on Parallel and Distributed Systems 9 (7) (1998) 609-625.
    • (1998) IEEE Transactions on Parallel and Distributed Systems , vol.9 , Issue.7 , pp. 609-625
    • Agrawal, G.1
  • 7
    • 0342363542 scopus 로고
    • Vector register allocation
    • Rice University, Houston, TX, December
    • R. Allen, K. Kennedy, Vector register allocation, Technical Report TR86-45, Rice University, Houston, TX, December 1986.
    • (1986) Technical Report TR86-45
    • Allen, R.1    Kennedy, K.2
  • 12
  • 21
    • 0026817662 scopus 로고
    • Optimizing stack frame accesses for processors with restricted addressing modes
    • D. Bartley, Optimizing stack frame accesses for processors with restricted addressing modes, Software Practice and Experience 22 (2) (1992) 101-110.
    • (1992) Software Practice and Experience , vol.22 , Issue.2 , pp. 101-110
    • Bartley, D.1
  • 22
    • 84947776744 scopus 로고
    • Solving alignment using elementary linear algebra
    • Proceedings of the Seventh Workshop on Languages and Compilers for Parallel Computing, Ithica, NY, Springer, Berlin
    • D. Bau, I. Koduklula, V. Kotlyar, K. Pingali, P. Stodghill, Solving alignment using elementary linear algebra, in: Proceedings of the Seventh Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science, vol. 892, Ithica, NY, 1994, Springer, Berlin, 1995, pp. 46-60.
    • (1994) Lecture Notes in Computer Science , vol.892 , pp. 46-60
    • Bau, D.1    Koduklula, I.2    Kotlyar, V.3    Pingali, K.4    Stodghill, P.5
  • 35
    • 0343668689 scopus 로고
    • Automatic support for data distribution on distributed memory multiprocessor systems
    • Proceedings of the Sixth Workshop on Languages and Compilers for Parallel Computing, Portland, OR, Aug. Springer, Berlin
    • B. Chapman, T. Fahringer, H. Zima, Automatic support for data distribution on distributed memory multiprocessor systems, in: Proceedings of the Sixth Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science, vol. 768, Portland, OR, Aug. 1993, Springer, Berlin, 1994, pp. 184-199.
    • (1993) Lecture Notes in Computer Science , vol.768 , pp. 184-199
    • Chapman, B.1    Fahringer, T.2    Zima, H.3
  • 36
    • 0343668692 scopus 로고
    • The alignment-distribution graph
    • Languages and Compilers for Parallel Computing, Sixth International Workshop, Springer, Berlin
    • S. Chatterjee, J. Gilbert, R. Schreiber, The alignment-distribution graph, in: Languages and Compilers for Parallel Computing, Sixth International Workshop, Lecture Notes in Computer Science, vol. 768, Springer, Berlin, 1993.
    • (1993) Lecture Notes in Computer Science , vol.768
    • Chatterjee, S.1    Gilbert, J.2    Schreiber, R.3
  • 38
    • 0028499023 scopus 로고
    • Communication-free data allocation techniques for parallelizing compilers on multicomputers
    • T.S. Chen, J.P. Sheu, Communication-free data allocation techniques for parallelizing compilers on multicomputers, IEEE Transactions on Parallel and Distributed Systems 5 (9) (1994) 924-938.
    • (1994) IEEE Transactions on Parallel and Distributed Systems , vol.5 , Issue.9 , pp. 924-938
    • Chen, T.S.1    Sheu, J.P.2
  • 40
    • 0343668688 scopus 로고    scopus 로고
    • Evolutionary compilation to long instruction superscalar microarchitectures for exploiting parallelism at all levels
    • T.M. Conte, Evolutionary compilation to long instruction superscalar microarchitectures for exploiting parallelism at all levels, in: ASPLOS Wild and Crazy Idea Session, 1998.
    • (1998) ASPLOS Wild and Crazy Idea Session
    • Conte, T.M.1
  • 48
    • 0032123777 scopus 로고    scopus 로고
    • The IA-64 architecture at work
    • C. Dulong, The IA-64 architecture at work, IEEE Computer (1998) 24-32.
    • (1998) IEEE Computer , pp. 24-32
    • Dulong, C.1
  • 52
    • 0019596071 scopus 로고
    • Trace scheduling: A technique for global microcode compaction
    • J.A. Fisher, Trace scheduling: a technique for global microcode compaction, IEEE Transactions on Computers 30 (7) (1981) 478-490.
    • (1981) IEEE Transactions on Computers , vol.30 , Issue.7 , pp. 478-490
    • Fisher, J.A.1
  • 53
    • 0031237815 scopus 로고    scopus 로고
    • Walk-time techniques: Catalyst for architectural change
    • J.A. Fisher, Walk-time techniques: catalyst for architectural change, IEEE Computer 30 (9) (1997) 40-42.
    • (1997) IEEE Computer , vol.30 , Issue.9 , pp. 40-42
    • Fisher, J.A.1
  • 57
    • 0343668683 scopus 로고
    • The use of BLAS3 in linear algebra on a parallel processor with a hierarchical memory
    • Center for Supercomputing Res. and Dev., University of Illinois, October
    • K. Gallivan, W. Jalby, U. Meier, The use of BLAS3 in linear algebra on a parallel processor with a hierarchical memory, Technical Report CSRD Rpt. No. 610, Center for Supercomputing Res. and Dev., University of Illinois, October 1986.
    • (1986) Technical Report CSRD Rpt. No. 610 , vol.610
    • Gallivan, K.1    Jalby, W.2    Meier, U.3
  • 66
    • 0025413768 scopus 로고
    • Region scheduling: An approach for detecting and redistributing parallelism
    • R. Gupta, M.L. Soffa, Region scheduling: an approach for detecting and redistributing parallelism, IEEE Transactions on Software Engineering 16 (4) (1990) 421-431.
    • (1990) IEEE Transactions on Software Engineering , vol.16 , Issue.4 , pp. 421-431
    • Gupta, R.1    Soffa, M.L.2
  • 68
    • 84949185314 scopus 로고    scopus 로고
    • Register pressure sensitive redundancy elimination
    • Proceedings of the International Conference on Compiler Construction, Springer, Amsterdam, Netherlands
    • R. Gupta, R. Bodik, Register pressure sensitive redundancy elimination, in: Proceedings of the International Conference on Compiler Construction, Lecture Notes in Computer Science, vol. 1575, Springer, Amsterdam, Netherlands, pp. 107-121.
    • Lecture Notes in Computer Science , vol.1575 , pp. 107-121
    • Gupta, R.1    Bodik, R.2
  • 78
    • 0003565855 scopus 로고    scopus 로고
    • High Performance Fortran Forum, High Performance Fortran language specification, version 2.0
    • Center for Research on Parallel Computation, Rice University, Houston, TX, January
    • High Performance Fortran Forum, High Performance Fortran language specification, version 2.0. Technical Report CRPC-TR92225, Center for Research on Parallel Computation, Rice University, Houston, TX, January 1997.
    • (1997) Technical Report CRPC-TR92225
  • 79
    • 84976706957 scopus 로고
    • Interprocedural compilation of fortran D for MIMD distributed-memory machines
    • Minneapolis, MN
    • M.W. Hall, S. Hiranandani, K. Kennedy, C. Tseng, Interprocedural compilation of fortran D for MIMD distributed-memory machines, in: Proceedings of Supercomputing'92, Minneapolis, MN, 1992, pp. 522-534.
    • (1992) Proceedings of Supercomputing'92 , pp. 522-534
    • Hall, M.W.1    Hiranandani, S.2    Kennedy, K.3    Tseng, C.4
  • 80
    • 84976813879 scopus 로고
    • Compiling Fortran D for MIMD distributed-memory machines
    • S. Hiranandani, K. Kennedy, C.W. Tseng, Compiling Fortran D for MIMD distributed-memory machines, Communications of the ACM 35 (8) (1992) 66-80.
    • (1992) Communications of the ACM , vol.35 , Issue.8 , pp. 66-80
    • Hiranandani, S.1    Kennedy, K.2    Tseng, C.W.3
  • 84
    • 0342798375 scopus 로고
    • Compiling parallel loops for high performance computers -partitioning
    • Kluwer Academic Publishers, Boston, MA
    • D.E. Hudak, S.G. Abraham, Compiling parallel loops for high performance computers -partitioning, in: Data Assignment and Remapping, Kluwer Academic Publishers, Boston, MA, 1993.
    • (1993) Data Assignment and Remapping
    • Hudak, D.E.1    Abraham, S.G.2
  • 85
    • 0342363503 scopus 로고    scopus 로고
    • Technology outlook: Introduction to predicated execution
    • W-m. Hwu, Technology outlook: introduction to predicated execution, IEEE Computer 31 (1) (1998) 49-50.
    • (1998) IEEE Computer , vol.31 , Issue.1 , pp. 49-50
    • Hwu, W.-M.1
  • 87
    • 85031523618 scopus 로고
    • Document SC23-0526-01
    • IBM, Engineering and Scientific Subroutine Library (ESSL), Guide and Reference, Document SC23-0526-01, 1994.
    • (1994) Guide and Reference
  • 89
    • 0008434041 scopus 로고    scopus 로고
    • Multiple threads template library
    • Real World Computing Partnership, September
    • Y. Ishikawa, Multiple threads template library, Technical Report TR-96-012, Real World Computing Partnership, September 1996.
    • (1996) Technical Report TR-96-012
    • Ishikawa, Y.1
  • 91
    • 85031535123 scopus 로고    scopus 로고
    • Code motion for generating compact code on embedded DSPs'
    • Washington, DC, 4-6 December'98
    • V. Jain, S. Pande, Code motion for generating compact code on embedded DSPs', 1998 Workshop on Compiler and architecture support for embedded systems, Washington, DC, 4-6 December'98. Available under publications link at http://www.ececs.uc.edu/ compiler.
    • 1998 Workshop on Compiler and Architecture Support for Embedded Systems
    • Jain, V.1    Pande, S.2
  • 92
    • 0342798373 scopus 로고    scopus 로고
    • HPC++: Experiments with the parallel standard template library
    • Indiana University, Department of Computer Science, December
    • E. Johnson, D. Gannon, HPC++: Experiments with the parallel standard template library, Technical Report TR-96-51, Indiana University, Department of Computer Science, December 1996.
    • (1996) Technical Report TR-96-51
    • Johnson, E.1    Gannon, D.2
  • 102
    • 0026294380 scopus 로고
    • Compile-time Generation of Communication for scientific programs
    • Albuquerque, NM
    • C. Koelbel, Compile-time Generation of Communication for scientific programs, in: Proceedings of Supercomputing '91, Albuquerque, NM, 1991, pp. 101-110.
    • (1991) Proceedings of Supercomputing '91 , pp. 101-110
    • Koelbel, C.1
  • 103
    • 0026190245 scopus 로고
    • The I-Test: An improved dependence test for automatic parallelization and vectorization
    • IEEE Transactions on Parallel and Distributed Systems
    • X. Kong, D. Klappholz, K. Psarris, The I-Test: An improved dependence test for automatic parallelization and vectorization, IEEE Transactions on Parallel and Distributed Systems, Special Issue on Parallel Languages and Compilers 2 (3) (1991).
    • (1991) Parallel Languages and Compilers , vol.2 , Issue.3 SPEC. ISSUE
    • Kong, X.1    Klappholz, D.2    Psarris, K.3
  • 106
    • 0016026944 scopus 로고
    • The parallel execution of DO loops
    • L. Lamport, The parallel execution of DO loops, Communications of the ACM 17 (2) (1974) 83-93.
    • (1974) Communications of the ACM , vol.17 , Issue.2 , pp. 83-93
    • Lamport, L.1
  • 107
    • 0003327314 scopus 로고    scopus 로고
    • Concurrent static single assignment form and constant propagation for explicitly parallel Programs
    • Proceedings of the 10th International Workshop on Languages and Compilers for Parallel Computing, Springer, Minneapolis, MN, August
    • J. Lee, S.P. Midkiff, D.A. Padua, Concurrent static single assignment form and constant propagation for explicitly parallel Programs, in: Proceedings of the 10th International Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science, Springer, Minneapolis, MN, August 1997.
    • (1997) Lecture Notes in Computer Science
    • Lee, J.1    Midkiff, S.P.2    Padua, D.A.3
  • 108
    • 0026187669 scopus 로고
    • Compiling communication-efficient programs for massively parallel machines
    • J. Li, M. Chen, Compiling communication-efficient programs for massively parallel machines, IEEE Transactions on Parallel and Distributed Systems 2 (3) (1991) 361-376.
    • (1991) IEEE Transactions on Parallel and Distributed Systems , vol.2 , Issue.3 , pp. 361-376
    • Li, J.1    Chen, M.2
  • 109
    • 0343232962 scopus 로고    scopus 로고
    • Efficient interprocedural analysis for program parallelization and restructuring
    • New Haven, CT, July
    • Z. Li, P. Yew, Efficient interprocedural analysis for program parallelization and restructuring, in: Proceedings of the ACM/SIGPLAN Symposium on Parallel Programming, New Haven, CT, July 1998.
    • (1998) Proceedings of the ACM/SIGPLAN Symposium on Parallel Programming
    • Li, Z.1    Yew, P.2
  • 121
    • 0027767071 scopus 로고
    • A scheduler-sensitive global register allocator
    • Portland, Oregon
    • C. Norris, L.L. Pollock, A scheduler-sensitive global register allocator, in: Proceedings of Supercomputing'93, Portland, Oregon, 1993, pp. 804-813.
    • (1993) Proceedings of Supercomputing'93 , pp. 804-813
    • Norris, C.1    Pollock, L.L.2
  • 126
    • 85031535440 scopus 로고    scopus 로고
    • A computation + communication load balanced loop partitioning method for distributed memory systems
    • to appear
    • S. Pande, T. Bali, A computation + communication load balanced loop partitioning method for distributed memory systems, Journal of Parallel and Distributed Computing, to appear.
    • Journal of Parallel and Distributed Computing
    • Pande, S.1    Bali, T.2
  • 128
    • 0002524997 scopus 로고    scopus 로고
    • A compile time partitioning method for DOALL loops on distributed memory systems
    • IEEE Computer Society Press, Silver Spring, MD
    • S. Pande, A compile time partitioning method for DOALL loops on distributed memory systems, in: International Conference on Parallel Processing, vol. III, IEEE Computer Society Press, Silver Spring, MD, 1996, pp. 35-44.
    • (1996) International Conference on Parallel Processing , vol.3 , pp. 35-44
    • Pande, S.1
  • 131
    • 0030076621 scopus 로고    scopus 로고
    • The Banerjee-Wolfe and GCD tests on exact data dependence information
    • K. Psarris, The Banerjee-Wolfe and GCD tests on exact data dependence information, Journal of Parallel and Distributed Computing 32 (2) (1996).
    • (1996) Journal of Parallel and Distributed Computing , vol.32 , Issue.2
    • Psarris, K.1
  • 132
    • 33745192514 scopus 로고
    • On the accuracy of the Banerjee test, Journal of Parallel and Distributed Computing
    • K. Psarris, D. Klappholz, X. Kong, On the accuracy of the Banerjee test, Journal of Parallel and Distributed Computing, Special Issue on Shared Memory Multiprocessors 12 (2) (1991).
    • (1991) Shared Memory Multiprocessors , vol.12 , Issue.2 SPEC. ISSUE
    • Psarris, K.1    Klappholz, D.2    Kong, X.3
  • 135
    • 84976676720 scopus 로고
    • A practical algorithm for exact array dependence analysis
    • W. Pugh, A practical algorithm for exact array dependence analysis, Communications of the ACM 35 (8) (1992).
    • (1992) Communications of the ACM , vol.35 , Issue.8
    • Pugh, W.1
  • 138
    • 0003015894 scopus 로고
    • Some scheduling techniques and an easily schedulable horizontal architecture for high performance scientific computing
    • MA
    • B.R. Rau, C.D. Glaser, Some scheduling techniques and an easily schedulable horizontal architecture for high performance scientific computing, in: Proceedings of the 14th Annual Microprogramming Workshop Chatham, MA, 1981, pp. 183-198.
    • (1981) Proceedings of the 14th Annual Microprogramming Workshop Chatham , pp. 183-198
    • Rau, B.R.1    Glaser, C.D.2
  • 141
  • 144
    • 0343668653 scopus 로고
    • Optimizing CM Fortran compiler for connection machine computers
    • G. Sabot, Optimizing CM Fortran compiler for connection machine computers, Journal of Parallel and Distributed Computing 23 (1) (1994) 224-238.
    • (1994) Journal of Parallel and Distributed Computing , vol.23 , Issue.1 , pp. 224-238
    • Sabot, G.1
  • 147
    • 0026213832 scopus 로고
    • Automatic partitioning of a program dependence graph into parallel tasks
    • V. Sarkar, Automatic partitioning of a program dependence graph into parallel tasks, IBM Journal of Research and Development, 35 (5/6) (1991).
    • (1991) IBM Journal of Research and Development , vol.35 , Issue.5-6
    • Sarkar, V.1
  • 149
    • 0031140581 scopus 로고    scopus 로고
    • Automatic selection of high order transformations in the IBM XL Fortran Compilers
    • V. Sarkar, Automatic selection of high order transformations in the IBM XL Fortran Compilers, IBM Journal of Research and Development 41 (3) (1997).
    • (1997) IBM Journal of Research and Development , vol.41 , Issue.3
    • Sarkar, V.1
  • 153
    • 84955559042 scopus 로고
    • Efficient Distribution Analysis via Graph Contraction
    • Proceedings of the Eighth Workshop on Languages and Compilers for Parallel Computing, Columbus, OH, August Springer, Berlin
    • T.J. Sheffler, R. Schreiber, J.R. Gilbert, W. Pugh, Efficient Distribution Analysis via Graph Contraction, in: Proceedings of the Eighth Workshop on Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science 1033, Columbus, OH, August 1995. Springer, Berlin, 1996, pp. 377-391.
    • (1995) Lecture Notes in Computer Science , vol.1033 , pp. 377-391
    • Sheffler, T.J.1    Schreiber, R.2    Gilbert, J.R.3    Pugh, W.4
  • 155
    • 0029228631 scopus 로고
    • The communication software and parallel environment of the IBM SP2
    • M. Snir et al., The communication software and parallel environment of the IBM SP2, IBM Systems Journal 34 (2) (1995) 205-221.
    • (1995) IBM Systems Journal , vol.34 , Issue.2 , pp. 205-221
    • Snir, M.1
  • 156
  • 157
    • 0342798352 scopus 로고
    • Ph.D. Thesis, Department of Computer Science, University of Colorado, Denver, Colorado
    • H. Srinivasan, Optimizing explicitly parallel programs, Ph.D. Thesis, Department of Computer Science, University of Colorado, Denver, Colorado, 1994.
    • (1994) Optimizing Explicitly Parallel Programs
    • Srinivasan, H.1
  • 158
    • 0000412263 scopus 로고
    • Generating communication for array statements: Design, implementation and evaluation
    • J. Stichnoth, D. O'Hallaron, T. Gross, Generating communication for array statements: Design, implementation and evaluation, Journal of Parallel and Distributed Computing 21 (1) (1994) 150-159.
    • (1994) Journal of Parallel and Distributed Computing , vol.21 , Issue.1 , pp. 150-159
    • Stichnoth, J.1    O'Hallaron, D.2    Gross, T.3
  • 162
    • 0000606960 scopus 로고    scopus 로고
    • Fast address sequence generation for data-parallel programs using integer lattices
    • Proceedings of the languages and compilers for parallel computing, Springer, Berlin
    • A. Thirumalai, J. Ramanujam, Fast address sequence generation for data-parallel programs using integer lattices, in: Proceedings of the languages and compilers for parallel computing, Lecture Notes in Computer Science 1033, Springer, Berlin, 1996, pp. 191-208.
    • (1996) Lecture Notes in Computer Science , vol.1033 , pp. 191-208
    • Thirumalai, A.1    Ramanujam, J.2
  • 163
    • 0030295507 scopus 로고    scopus 로고
    • Efficient computation of address sequences in data-parallel programs using closed forms for basis vectors
    • A. Thirumalai, J. Ramanujam, Efficient computation of address sequences in data-parallel programs using closed forms for basis vectors, Journal of Parallel and Distributed Computing 38 (2) (1996) 188-203.
    • (1996) Journal of Parallel and Distributed Computing , vol.38 , Issue.2 , pp. 188-203
    • Thirumalai, A.1    Ramanujam, J.2
  • 164
    • 84886627169 scopus 로고    scopus 로고
    • Dataflow analysis driven dynamic data partitioning
    • Fourth Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers, Springer, Pittsburgh, PA, May
    • J. Tims, R. Gupta, M.L. Soffa, Dataflow analysis driven dynamic data partitioning, in: Fourth Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers, Lecture Notes in Computer Science, vol. 1511, Springer, Pittsburgh, PA, May 1998, pp. 75-90.
    • (1998) Lecture Notes in Computer Science , vol.1511 , pp. 75-90
    • Tims, J.1    Gupta, R.2    Soffa, M.L.3
  • 168
    • 0010224751 scopus 로고    scopus 로고
    • Runtime performance of parallel array assignment: An empirical study
    • Pittsburgh, PA
    • L. Wang, J. Stichnoth, S. Chatterjee, Runtime performance of parallel array assignment: an empirical study, in: Proceedings Supercomputing 96, Pittsburgh, PA, 1996.
    • (1996) Proceedings Supercomputing 96
    • Wang, L.1    Stichnoth, J.2    Chatterjee, S.3
  • 170
    • 84976692695 scopus 로고
    • SUIF: A parallelizing and optimizing research compiler
    • R. Wilson et al., SUIF: a parallelizing and optimizing research compiler, SIGPLAN Notices 29 (12) (1994) 31-37.
    • (1994) SIGPLAN Notices , vol.29 , Issue.12 , pp. 31-37
    • Wilson, R.1
  • 171


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.