메뉴 건너뛰기




Volumn 43, Issue 6, 2008, Pages 101-113

A practical automatic polyhedral parallelizer and locality optimizer

Author keywords

Affine transformations; Automatic parallelization; Locality optimization; Loop transformations; Polyhedral model; Tiling

Indexed keywords

APPLICATION PROGRAMMING INTERFACES (API); INTEGER PROGRAMMING; LINEAR PROGRAMMING; LINEAR TRANSFORMATIONS; MATHEMATICAL TRANSFORMATIONS; PROGRAM COMPILERS;

EID: 67650079888     PISSN: 15232867     EISSN: None     Source Type: Journal    
DOI: 10.1145/1379022.1375595     Document Type: Article
Times cited : (244)

References (63)
  • 2
    • 10844259103 scopus 로고    scopus 로고
    • Synthesizing transformations for locality enhancement of imperfectly-nested loops
    • Oct
    • N. Ahmed, N. Mateev, and K. Pingali. Synthesizing transformations for locality enhancement of imperfectly-nested loops. Intl. J. of Parallel Programming, 29(5), Oct. 2001.
    • (2001) Intl. J. of Parallel Programming , vol.29 , Issue.5
    • Ahmed, N.1    Mateev, N.2    Pingali, K.3
  • 4
    • 84976766536 scopus 로고
    • Scanning polyhedra with do loops
    • C. Ancourt and F. Irigoin. Scanning polyhedra with do loops. In ACM SIGPLANPPoPP'91, pages 39-50, 1991.
    • (1991) ACM SIGPLANPPoPP'91 , pp. 39-50
    • Ancourt, C.1    Irigoin, F.2
  • 6
    • 10444289646 scopus 로고    scopus 로고
    • Code generation in the polyhedral model is easier than you think
    • Sept
    • C. Bastoul. Code generation in the polyhedral model is easier than you think. In IEEE PACT, pages 7-16, Sept. 2004.
    • (2004) IEEE PACT , pp. 7-16
    • Bastoul, C.1
  • 8
    • 57349110181 scopus 로고    scopus 로고
    • Affine transformations for communication minimal parallelization and locality optimization of arbitrarily-nested loop sequences
    • Technical Report OSU-CISRC-5/07-TR43, The Ohio State University, May
    • U. Bondhugula, M. Baskaran, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. Affine transformations for communication minimal parallelization and locality optimization of arbitrarily-nested loop sequences. Technical Report OSU-CISRC-5/07-TR43, The Ohio State University, May 2007.
    • (2007)
    • Bondhugula, U.1    Baskaran, M.2    Krishnamoorthy, S.3    Ramanujam, J.4    Rountev, A.5    Sadayappan, P.6
  • 10
    • 47249137843 scopus 로고    scopus 로고
    • Pluto: A practical and fully automatic polyhedral parallelizer and locality optimizer
    • Technical Report OSU-CISRC-10/07-TR70, The Ohio State University, Oct
    • U. Bondhugula, J. Ramanujam, and P. Sadayappan. Pluto: A practical and fully automatic polyhedral parallelizer and locality optimizer. Technical Report OSU-CISRC-10/07-TR70, The Ohio State University, Oct. 2007.
    • (2007)
    • Bondhugula, U.1    Ramanujam, J.2    Sadayappan, P.3
  • 12
    • 0032066690 scopus 로고    scopus 로고
    • Loop parallelization algorithms: From parallelism extraction to code generation
    • P. Boulet, A. Darte, G.-A. Silber, and F. Vivien. Loop parallelization algorithms: From parallelism extraction to code generation. Parallel Computing, 24(3-4):421-444, 1998.
    • (1998) Parallel Computing , vol.24 , Issue.3-4 , pp. 421-444
    • Boulet, P.1    Darte, A.2    Silber, G.-A.3    Vivien, F.4
  • 16
    • 0342782260 scopus 로고    scopus 로고
    • Combining retiming and scheduling techniques for loop parallelization and loop tiling
    • A. Darte, G.-A. Silber, and F. Vivien. Combining retiming and scheduling techniques for loop parallelization and loop tiling. Parallel Processing Letters, 7(4):379-392, 1997.
    • (1997) Parallel Processing Letters , vol.7 , Issue.4 , pp. 379-392
    • Darte, A.1    Silber, G.-A.2    Vivien, F.3
  • 17
    • 0031358458 scopus 로고    scopus 로고
    • Optimal fine and medium grain parallelism detection in polyhedral reduced dependence graphs
    • Dec
    • A. Darte and F. Vivien. Optimal fine and medium grain parallelism detection in polyhedral reduced dependence graphs. Intl. J. Parallel Programming, 25(6):447-496, Dec. 1997.
    • (1997) Intl. J. Parallel Programming , vol.25 , Issue.6 , pp. 447-496
    • Darte, A.1    Vivien, F.2
  • 18
  • 19
    • 0026109335 scopus 로고
    • Dataflow analysis of scalar and array references
    • Feb
    • P. Feautrier. Dataflow analysis of scalar and array references. Intl. J. of Parallel Programming, 20(1):23-53, Feb. 1991.
    • (1991) Intl. J. of Parallel Programming , vol.20 , Issue.1 , pp. 23-53
    • Feautrier, P.1
  • 20
    • 0026933251 scopus 로고
    • Some efficient solutions to the affine scheduling problem: I. one-dimensional time
    • P. Feautrier. Some efficient solutions to the affine scheduling problem: I. one-dimensional time. Intl. J. of Parallel Programming, 21(5):313-348, 1992.
    • (1992) Intl. J. of Parallel Programming , vol.21 , Issue.5 , pp. 313-348
    • Feautrier, P.1
  • 21
    • 0001448065 scopus 로고
    • Some efficient solutions to the affine scheduling problem, part II. multidimensional time
    • P. Feautrier. Some efficient solutions to the affine scheduling problem, part II. multidimensional time. Intl. J. of Parallel Programming, 21(6):389-420, 1992.
    • (1992) Intl. J. of Parallel Programming , vol.21 , Issue.6 , pp. 389-420
    • Feautrier, P.1
  • 24
    • 67650089094 scopus 로고    scopus 로고
    • Automatic Parallelization of Loop Programs for Distributed Memory Architectures. University of Passau, Habilitation thesis
    • M. Griebl. Automatic Parallelization of Loop Programs for Distributed Memory Architectures. University of Passau, 2004. Habilitation thesis.
    • (2004)
    • Griebl, M.1
  • 25
    • 84936955817 scopus 로고    scopus 로고
    • Code generation in the polytope model
    • M. Griebl, C. Lengauer, and S. Wetzel. Code generation in the polytope model. In IEEE PACT, pages 106-111, 1998.
    • (1998) In IEEE PACT , pp. 106-111
    • Griebl, M.1    Lengauer, C.2    Wetzel, S.3
  • 26
    • 0036958653 scopus 로고    scopus 로고
    • On time optimal supernode shape
    • E. Hodzic and W. Shang. On time optimal supernode shape. IEEE Trans. Par. & Dist. Sys., 13(12):1220-1233, 2002.
    • (2002) IEEE Trans. Par. & Dist. Sys , vol.13 , Issue.12 , pp. 1220-1233
    • Hodzic, E.1    Shang, W.2
  • 27
    • 0032642196 scopus 로고    scopus 로고
    • Selecting tile shape for minimal execution time
    • K. Hogstedt, L. Carter, and J. Ferrante. Selecting tile shape for minimal execution time. In SPAA, pages 201-211, 1999.
    • (1999) SPAA , pp. 201-211
    • Hogstedt, K.1    Carter, L.2    Ferrante, J.3
  • 30
    • 0038220747 scopus 로고
    • A unifying framework for iteration reordering transformations
    • Technical Report CS-TR-3430, Dept. of Computer Science, University of Maryland, College Park
    • W. Kelly and W. Pugh. A unifying framework for iteration reordering transformations. Technical Report CS-TR-3430, Dept. of Computer Science, University of Maryland, College Park, 1995.
    • (1995)
    • Kelly, W.1    Pugh, W.2
  • 34
    • 0028409782 scopus 로고
    • A singular loop transformation framework based on non-singular matrices
    • W. Li and K. Pingali. A singular loop transformation framework based on non-singular matrices. Intl. J. of Parallel Programming, 22(2): 183-205, 1994.
    • (1994) Intl. J. of Parallel Programming , vol.22 , Issue.2 , pp. 183-205
    • Li, W.1    Pingali, K.2
  • 35
    • 0034823777 scopus 로고    scopus 로고
    • Blocking and array contraction across arbitrarily nested loops using affine partitioning
    • A. Lim, S. Liao, and M. Lam. Blocking and array contraction across arbitrarily nested loops using affine partitioning. In ACM SIGPLAN PPoPP, pages 103-112, 2001.
    • (2001) ACM SIGPLAN PPoPP , pp. 103-112
    • Lim, A.1    Liao, S.2    Lam, M.3
  • 36
    • 0032662841 scopus 로고    scopus 로고
    • An affine partitioning algorithm to maximize parallelism and minimize communication
    • A. W. Lim, G. I. Cheong, and M. S. Lam. An affine partitioning algorithm to maximize parallelism and minimize communication. In ACM Intl. Conf. on Supercomputing, pages 228-237, 1999.
    • (1999) ACM Intl. Conf. on Supercomputing , pp. 228-237
    • Lim, A.W.1    Cheong, G.I.2    Lam, M.S.3
  • 37
    • 0032067773 scopus 로고    scopus 로고
    • Maximizing parallelism and minimizing synchronization with affine partitions
    • A. W. Lim and M. S. Lam. Maximizing parallelism and minimizing synchronization with affine partitions. Parallel Computing, 24(3-4):445-475, 1998.
    • (1998) Parallel Computing , vol.24 , Issue.3-4 , pp. 445-475
    • Lim, A.W.1    Lam, M.S.2
  • 38
    • 84869377033 scopus 로고    scopus 로고
    • The LooPo Project, Loop parallelization in the polytope model
    • The LooPo Project - Loop parallelization in the polytope model. http://www.fmi.uni-passau.de/loopo.
  • 44
    • 57349167317 scopus 로고    scopus 로고
    • Iterative optimization in the polyhedral model: Part II, multidimensional time
    • Tucson, Arizona, June
    • L.-N. Pouchet, C. Bastoul, J. Cavazos, and A. Cohen. Iterative optimization in the polyhedral model: Part II, multidimensional time. In PLDI'08, Tucson, Arizona, June 2008.
    • (2008) PLDI'08
    • Pouchet, L.-N.1    Bastoul, C.2    Cavazos, J.3    Cohen, A.4
  • 45
    • 51049121269 scopus 로고    scopus 로고
    • Iterative optimization in the polyhedral model: Part I, one-dimensional time
    • Mar
    • L.-N. Pouchet, C. Bastoul, A. Cohen, and N. Vasilache. Iterative optimization in the polyhedral model: Part I, one-dimensional time. In ACM CGO, Mar. 2007.
    • (2007) ACM CGO
    • Pouchet, L.-N.1    Bastoul, C.2    Cohen, A.3    Vasilache, N.4
  • 46
    • 84976676720 scopus 로고
    • The omega test: A fast and practical integer programming algorithm for dependence analysis
    • Aug
    • W. Pugh. The omega test: a fast and practical integer programming algorithm for dependence analysis. Communications of the ACM, 8:102-114, Aug. 1992.
    • (1992) Communications of the ACM , vol.8 , pp. 102-114
    • Pugh, W.1
  • 48
    • 38249009019 scopus 로고
    • Tiling multidimensional iteration spaces for multicomputers
    • J. Ramanujam and P. Sadayappan. Tiling multidimensional iteration spaces for multicomputers. JPDC, 16(2): 108-230, 1992.
    • (1992) JPDC , vol.16 , Issue.2 , pp. 108-230
    • Ramanujam, J.1    Sadayappan, P.2
  • 50
    • 0003929457 scopus 로고
    • Automatic blocking of nested loops
    • Technical report, University of Tennessee, Knoxville, TN, Aug
    • R. Schreiber and J. Dongarra. Automatic blocking of nested loops. Technical report, University of Tennessee, Knoxville, TN, Aug. 1990.
    • (1990)
    • Schreiber, R.1    Dongarra, J.2
  • 52
    • 0032635362 scopus 로고    scopus 로고
    • New tiling techniques to improve cache temporal locality
    • Y. Song and Z. Li. New tiling techniques to improve cache temporal locality. In PLDI, pages 215-228, 1999.
    • (1999) PLDI , pp. 215-228
    • Song, Y.1    Li, Z.2
  • 53
    • 57349127962 scopus 로고    scopus 로고
    • Program Optimization Techniques in the Polyhedral Model. PhD thesis, Universite de Paris-Sud, INRIA, Futurs, Sept
    • N. Vasilache. Scalable Program Optimization Techniques in the Polyhedral Model. PhD thesis, Universite de Paris-Sud, INRIA, Futurs, Sept. 2007.
    • (2007) Scalable
    • Vasilache, N.1
  • 56
    • 0035064349 scopus 로고    scopus 로고
    • Automated Empirical Optimizations of Software and the ATLAS Project
    • R. Whaley, A. Petitet, and J. Dongarra. Automated Empirical Optimizations of Software and the ATLAS Project. Parallel Computing, 2000.
    • (2000) Parallel Computing
    • Whaley, R.1    Petitet, A.2    Dongarra, J.3
  • 57
    • 0004005802 scopus 로고    scopus 로고
    • A library for doing polyhedral operations
    • Technical Report RR-2157, IRIS A, 1993
    • D. K. Wilde. A library for doing polyhedral operations. Technical Report RR-2157, IRIS A, 1993.
    • Wilde, D.K.1
  • 58
    • 84976827033 scopus 로고
    • A data locality optimizing algorithm
    • M. Wolf and M. S. Lam. A data locality optimizing algorithm. In ACM SIGPLAN PLDI '91, pages 30-44, 1991.
    • (1991) ACM SIGPLAN PLDI '91 , pp. 30-44
    • Wolf, M.1    Lam, M.S.2
  • 59
    • 0026232450 scopus 로고
    • A loop transformation theory and an algorithm to maximize parallelism
    • M. Wolf and M. S. Lam. A loop transformation theory and an algorithm to maximize parallelism. IEEE Trans. Parallel Distrib. Syst., 2(4):452-471, 1991.
    • (1991) IEEE Trans. Parallel Distrib. Syst , vol.2 , Issue.4 , pp. 452-471
    • Wolf, M.1    Lam, M.S.2
  • 60
    • 0003125942 scopus 로고    scopus 로고
    • Communication-minimal tiling of uniform dependence loops
    • J. Xue. Communication-minimal tiling of uniform dependence loops. JPDC, 42(1):42-59, 1997.
    • (1997) JPDC , vol.42 , Issue.1 , pp. 42-59
    • Xue, J.1
  • 61
    • 0442303278 scopus 로고    scopus 로고
    • Kluwer Academic Publishers, Norwell, MA, USA
    • J. Xue. Loop tiling for parallelism. Kluwer Academic Publishers, Norwell, MA, USA, 2000.
    • (2000) Loop tiling for parallelism
    • Xue, J.1
  • 62
    • 1242285116 scopus 로고    scopus 로고
    • Transforming complex loop nests for locality
    • Q. Yi, K. Kennedy, and V Adve. Transforming complex loop nests for locality. J. of Supercomputing, 27(3):219-264, 2004.
    • (2004) J. of Supercomputing , vol.27 , Issue.3 , pp. 219-264
    • Yi, Q.1    Kennedy, K.2    Adve, V.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.