메뉴 건너뛰기




Volumn 24, Issue 3-4, 1998, Pages 421-444

Loop parallelization algorithms: From parallelism extraction to code generation

Author keywords

Automatic parallelization; Code generation; Loop fusion; Nested loops; Parallelization algorithms

Indexed keywords

CODES (SYMBOLS); COMPUTER SYSTEMS PROGRAMMING; PARALLEL PROCESSING SYSTEMS;

EID: 0032066690     PISSN: 01678191     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0167-8191(98)00020-9     Document Type: Article
Times cited : (58)

References (39)
  • 1
    • 0028743437 scopus 로고
    • Compiler transformations for high-performance computing
    • D.F. Bacon, S.L. Graham, O.J. Sharp, Compiler transformations for high-performance computing, ACM Comput. Surveys 26 (4) (1994) .
    • (1994) ACM Comput. Surveys , vol.26 , Issue.4
    • Bacon, D.F.1    Graham, S.L.2    Sharp, O.J.3
  • 3
    • 0029352643 scopus 로고
    • Minimal data dependence abstractions for loop transformations
    • Y.-Q. Yang, C. Ancourt, F. Irigoin, Minimal data dependence abstractions for loop transformations, Int. J. Parallel Programming 23 (4) (1995) 359-388.
    • (1995) Int. J. Parallel Programming , vol.23 , Issue.4 , pp. 359-388
    • Yang, Y.-Q.1    Ancourt, C.2    Irigoin, F.3
  • 4
    • 0347387866 scopus 로고    scopus 로고
    • Parallelizing nested loops with approximation of distance vectors: A survey
    • Alain Darte, Frédéric Vivien, Parallelizing nested loops with approximation of distance vectors: a survey, Parallel Process. Lett. (1997).
    • (1997) Parallel Process. Lett.
    • Darte, A.1    Vivien, F.2
  • 5
    • 0003565855 scopus 로고    scopus 로고
    • High performance Fortran forum, high performance Fortran language specification
    • Rice University, January
    • High performance Fortran forum, high performance Fortran language specification, Technical Report 2.0, Rice University, January 1997.
    • (1997) Technical Report 2.0
  • 6
    • 0041836397 scopus 로고    scopus 로고
    • German National Research Institute for Computer Science, March
    • Thomas Brandes, ADAPTOR Programmer's Guide-Version 4.0, German National Research Institute for Computer Science, March 1996.
    • (1996) ADAPTOR Programmer's Guide-Version 4.0
    • Brandes, T.1
  • 7
    • 0023438847 scopus 로고
    • Automatic translation of Fortran programs to vector form
    • J.R. Allen, K. Kennedy, Automatic translation of Fortran programs to vector form, ACM Trans. Programming Languages Syst. 9 (4) (1987) 491-542.
    • (1987) ACM Trans. Programming Languages Syst. , vol.9 , Issue.4 , pp. 491-542
    • Allen, J.R.1    Kennedy, K.2
  • 8
    • 0026232450 scopus 로고
    • A loop transformation theory and an algorithm to maximize parallelism
    • M.E. Wolf, M.S. Lam, A loop transformation theory and an algorithm to maximize parallelism, IEEE Trans. Parallel Distributed Syst. 2 (4) (1991) 452-471.
    • (1991) IEEE Trans. Parallel Distributed Syst. , vol.2 , Issue.4 , pp. 452-471
    • Wolf, M.E.1    Lam, M.S.2
  • 9
    • 0029747936 scopus 로고    scopus 로고
    • Optimal fine and medium grain parallelism detection in polyhedral reduced dependence graphs
    • IEEE Computer Society Press, Boston, MA, October
    • Alain Darte, Frédéric Vivien, Optimal fine and medium grain parallelism detection in polyhedral reduced dependence graphs, Proceedings of PACT '96, IEEE Computer Society Press, Boston, MA, October 1996.
    • (1996) Proceedings of PACT '96
    • Darte, A.1    Vivien, F.2
  • 10
    • 0001448065 scopus 로고
    • Some efficient solutions to the affine scheduling problem: Part II. Multidimensional time
    • P. Feautrier, Some efficient solutions to the affine scheduling problem: Part II. Multidimensional time, Int. J. Parallel Programming 21 (6) (1992) 389-420.
    • (1992) Int. J. Parallel Programming , vol.21 , Issue.6 , pp. 389-420
    • Feautrier, P.1
  • 12
    • 84938023119 scopus 로고
    • Analysis of programs for parallel processing
    • A.J. Bernstein, Analysis of programs for parallel processing, IEEE Trans. Electron. Comput. 15 (1966) 757-762.
    • (1966) IEEE Trans. Electron. Comput. , vol.15 , pp. 757-762
    • Bernstein, A.J.1
  • 13
    • 0141513025 scopus 로고
    • PFC: A program to convert Fortran to parallel form
    • Rice University, Houston, TX, USA
    • John R. Allen, Ken Kennedy, PFC: a program to convert Fortran to parallel form, Technical Report MASC-TR82-6, Rice University, Houston, TX, USA, 1982.
    • (1982) Technical Report MASC-TR82-6
    • Allen, J.R.1    Kennedy, K.2
  • 14
    • 0016026944 scopus 로고
    • The parallel execution of DO loops
    • L. Lamport, The parallel execution of DO loops, Commun. ACM 17 (2) (1974) 83-93.
    • (1974) Commun. ACM , vol.17 , Issue.2 , pp. 83-93
    • Lamport, L.1
  • 16
    • 0345703819 scopus 로고
    • Computing dependence direction vectors and dependence cones with linear systems
    • Ecole des Mines de Paris, Fontainebleau, France
    • François Irigoin, Rémy Triolet, Computing dependence direction vectors and dependence cones with linear systems, Technical Report ENSMP-CAI-87-E94, Ecole des Mines de Paris, Fontainebleau, France, 1987.
    • (1987) Technical Report ENSMP-CAI-87-E94
    • Irigoin, F.1    Triolet, R.2
  • 17
    • 0026109335 scopus 로고
    • Dataflow analysis of array and scalar references
    • P. Feautrier, Dataflow analysis of array and scalar references, Int. J. Parallel Programming 20 (1) (1991) 23-51.
    • (1991) Int. J. Parallel Programming , vol.20 , Issue.1 , pp. 23-51
    • Feautrier, P.1
  • 19
    • 0343099860 scopus 로고    scopus 로고
    • On the optimality of Allen and Kennedy's algorithm for parallelism extraction in nested loops
    • Special issue on Optimizing Compilers for Parallel Languages
    • A. Darte, F. Vivien, On the optimality of Allen and Kennedy's algorithm for parallelism extraction in nested loops, J. Parallel Algorithms Applicat. 12 (1-3) (1997) 83-112, Special issue on Optimizing Compilers for Parallel Languages.
    • (1997) J. Parallel Algorithms Applicat. , vol.12 , Issue.1-3 , pp. 83-112
    • Darte, A.1    Vivien, F.2
  • 20
    • 0026933251 scopus 로고
    • Some efficient solutions to the affine scheduling problem: Part I. One-dimensional time
    • P. Feautrier, Some efficient solutions to the affine scheduling problem: Part I. One-dimensional time, Int. J. Parallel Programming 21 (5) (1992) 313-348.
    • (1992) Int. J. Parallel Programming , vol.21 , Issue.5 , pp. 313-348
    • Feautrier, P.1
  • 22
    • 0342782260 scopus 로고    scopus 로고
    • Combining retiming and scheduling techniques for loop parallelization and loop tiling
    • to appear. Also available as Tech. Rep. LIP, ENS-Lyon, RR96-34
    • Alain Darte, Georges-André Silber, Frédéric Vivien, Combining retiming and scheduling techniques for loop parallelization and loop tiling, Parallel Process. Lett. (1997) Special issue, to appear. Also available as Tech. Rep. LIP, ENS-Lyon, RR96-34.
    • (1997) Parallel Process. Lett. , Issue.SPEC. ISSUE
    • Darte, A.1    Silber, G.-A.2    Vivien, F.3
  • 25
    • 84976676720 scopus 로고
    • A practical algorithm for exact array dependence analysis
    • W. Pugh, A practical algorithm for exact array dependence analysis, Commun. ACM 8 (1992) 27-47.
    • (1992) Commun. ACM , vol.8 , pp. 27-47
    • Pugh, W.1
  • 27
    • 0342782294 scopus 로고
    • Construction of DO loops from systems of affine constraints
    • J.-F. Collard, P. Feautrier, T. Risset, Construction of DO loops from systems of affine constraints, Parallel Process. Lett. 5 (3) (1995) 421-436.
    • (1995) Parallel Process. Lett. , vol.5 , Issue.3 , pp. 421-436
    • Collard, J.-F.1    Feautrier, P.2    Risset, T.3
  • 29
    • 0347387865 scopus 로고    scopus 로고
    • The group of Pr. Lengauer, The loopo project
    • The group of Pr. Lengauer, The loopo project, World Wide Web document, URL: http://brahms.fmi.uni-passau.de/cl/loopo/index.html.
  • 32
    • 0003378935 scopus 로고
    • A singular loop transformation framework based on non-singular matrices
    • Yale University, August
    • Wei Li, Keshav Pingali, A singular loop transformation framework based on non-singular matrices, 5th Workshop on Languages and Compilers for Parallel Computing, Yale University, August 1992, pp. 249-260.
    • (1992) 5th Workshop on Languages and Compilers for Parallel Computing , pp. 249-260
    • Li, W.1    Pingali, K.2
  • 33
    • 0028434044 scopus 로고
    • Automatic non-unimodular transformations of loop nests
    • J. Xue, Automatic non-unimodular transformations of loop nests, Parallel Comput. 20 (5) (1994) 711-728.
    • (1994) Parallel Comput. , vol.20 , Issue.5 , pp. 711-728
    • Xue, J.1
  • 34
    • 0028014881 scopus 로고
    • Code generation in automatic parallelizers
    • Claude Girault (Ed.), IFIP WG 10.3, North Holland, April
    • Jean-François Collard, Code generation in automatic parallelizers, in: Claude Girault (Ed.), Proc. Int. Conf. on Application in Parallel and Distributed Computing, IFIP WG 10.3, North Holland, April 1994, pp. 185-194.
    • (1994) Proc. Int. Conf. on Application in Parallel and Distributed Computing , pp. 185-194
    • Collard, J.-F.1
  • 36
    • 0346126915 scopus 로고    scopus 로고
    • William Pugh, the Omega Team, World Wide Web document, url:http://www.cs.umd.edu/projects/omega/.
    • Pugh, W.1
  • 37
    • 0348018586 scopus 로고    scopus 로고
    • Loop parallelization algorithms: From parallelism extraction to code generation
    • LIP, ENS-Lyon, France, June
    • Pierre Boulet, Alain Darte, Georges-André Silber, Frédéric Vivien, Loop parallelization algorithms: from parallelism extraction to code generation, Technical Report 97-17, LIP, ENS-Lyon, France, June 1997.
    • (1997) Technical Report 97-17
    • Boulet, P.1    Darte, A.2    Silber, G.-A.3    Vivien, F.4
  • 38
    • 0001465739 scopus 로고
    • Maximizing loop parallelism and improving data locality via loop fusion and distribution
    • U. Banerjee, D. Gelernter, A. Nicolau, D. Padua, (Eds.), The Sixth Annual Languages and Compiler for Parallelism Workshop, Springer-Verlag
    • Kathryn S. McKinley, Ken Kennedy, Maximizing loop parallelism and improving data locality via loop fusion and distribution, in: U. Banerjee, D. Gelernter, A. Nicolau, D. Padua, (Eds.), The Sixth Annual Languages and Compiler for Parallelism Workshop, Number 768 in Lecture Notes in Computer Science, Springer-Verlag, 1993, pp. 301-320.
    • (1993) Lecture Notes in Computer Science , vol.768 , pp. 301-320
    • McKinley, K.S.1    Kennedy, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.