메뉴 건너뛰기




Volumn , Issue , 2013, Pages 127-138

When polyhedral transformations meet SIMD code generation

Author keywords

Affine Scheduling; Autotuning; Compiler Optimization; Loop Transformations; Program synthesis

Indexed keywords

AUTOTUNING; COMPILER OPTIMIZATIONS; FINE GRAIN PARALLELISM; HIGH-LEVEL TRANSFORMATIONS; LOOP TRANSFORMATION; MULTIDIMENSIONAL DATA; POLYHEDRAL TRANSFORMATIONS; PROGRAM SYNTHESIS;

EID: 84883089997     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2462156.2462187     Document Type: Conference Paper
Times cited : (72)

References (43)
  • 2
    • 84883120973 scopus 로고    scopus 로고
    • PolyOpt/C. http://hpcrl.cse.ohio-state.edu/wiki/index.php/polyopt/c.
    • PolyOpt/C
  • 3
    • 84883124392 scopus 로고    scopus 로고
    • www.spiral.net/software/stencilgen.html.
  • 7
    • 32844469860 scopus 로고    scopus 로고
    • More legal transformations for locality
    • LNCS 3149 Pisa, august
    • C. Bastoul and P. Feautrier. More legal transformations for locality. In Euro-Par'10 Intl. Euro-Par conference, LNCS 3149, pages 272-283, Pisa, august 2004.
    • (2004) Euro-Par'10 Intl. Euro-Par Conference , pp. 272-283
    • Bastoul, C.1    Feautrier, P.2
  • 10
    • 84976711318 scopus 로고
    • Programming pearls: Little languages
    • J. Bentley. Programming pearls: little languages. Communications of the ACM, 29(8):711-721, 1986.
    • (1986) Communications of the ACM , vol.29 , Issue.8 , pp. 711-721
    • Bentley, J.1
  • 11
    • 74049164978 scopus 로고    scopus 로고
    • A practical automatic polyhedral program optimization system
    • June
    • U. Bondhugula, A. Hartono, J. Ramanujam, and P. Sadayappan. A practical automatic polyhedral program optimization system. In PLDI, June 2008.
    • (2008) PLDI
    • Bondhugula, U.1    Hartono, A.2    Ramanujam, J.3    Sadayappan, P.4
  • 15
    • 8344245462 scopus 로고    scopus 로고
    • Vectorization for simd architectures with alignment constraints
    • A. Eichenberger, P. Wu, and K. O'Brien. Vectorization for simd architectures with alignment constraints. In PLDI, 2004.
    • (2004) PLDI
    • Eichenberger, A.1    Wu, P.2    O'Brien, K.3
  • 16
    • 0001448065 scopus 로고
    • Some efficient solutions to the affine scheduling problem, Part II: Multidimensional time
    • Dec.
    • P. Feautrier. Some efficient solutions to the affine scheduling problem, part II: multidimensional time. Intl. J. of Parallel Programming, 21(6):389-420, Dec. 1992.
    • (1992) Intl. J. of Parallel Programming , vol.21 , Issue.6 , pp. 389-420
    • Feautrier, P.1
  • 17
    • 0348209599 scopus 로고    scopus 로고
    • A fast fourier transform compiler
    • M. Frigo. A fast Fourier transform compiler. In PLDI, pages 169-180, 1999.
    • (1999) PLDI , pp. 169-180
    • Frigo, M.1
  • 18
    • 20744449792 scopus 로고    scopus 로고
    • The design and implementation of FFTW3
    • M. Frigo and S. G. Johnson. The design and implementation of FFTW3. Proc. of the IEEE, 93(2):216-231, 2005.
    • (2005) Proc. of the IEEE , vol.93 , Issue.2 , pp. 216-231
    • Frigo, M.1    Johnson, S.G.2
  • 20
    • 67649530725 scopus 로고
    • Little language processing, an alternative to courses on compiler construction
    • K. J. Gough. Little language processing, an alternative to courses on compiler construction. SIGCSE Bulletin, 13(3):31-34, 1981.
    • (1981) SIGCSE Bulletin , vol.13 , Issue.3 , pp. 31-34
    • Gough, K.J.1
  • 28
    • 0034446825 scopus 로고    scopus 로고
    • Exploiting superword level parallelism with multimedia instruction sets
    • S. Larsen and S. P. Amarasinghe. Exploiting superword level parallelism with multimedia instruction sets. In PLDI, 2000.
    • (2000) PLDI
    • Larsen, S.1    Amarasinghe, S.P.2
  • 29
    • 0030645995 scopus 로고    scopus 로고
    • Maximizing parallelism and minimizing synchronization with affine transforms
    • A. W. Lim and M. S. Lam. Maximizing parallelism and minimizing synchronization with affine transforms. In POPL, pages 201-214, 1997.
    • (1997) POPL , pp. 201-214
    • Lim, A.W.1    Lam, M.S.2
  • 30
    • 33746034953 scopus 로고    scopus 로고
    • Auto-vectorization of interleaved data for simd
    • D. Nuzman, I. Rosen, and A. Zaks. Auto-vectorization of interleaved data for simd. In PLDI, 2006.
    • (2006) PLDI
    • Nuzman, D.1    Rosen, I.2    Zaks, A.3
  • 31
    • 63549093768 scopus 로고    scopus 로고
    • Outer-loop vectorization: Revisited for short simd architectures
    • D. Nuzman and A. Zaks. Outer-loop vectorization: revisited for short simd architectures. In PACT, 2008.
    • (2008) PACT
    • Nuzman, D.1    Zaks, A.2
  • 32
    • 57349167317 scopus 로고    scopus 로고
    • Iterative optimization in the polyhedral model: Part II, multidimensional time
    • ACM Press
    • L.-N. Pouchet, C. Bastoul, A. Cohen, and J. Cavazos. Iterative optimization in the polyhedral model: Part II, multidimensional time. In PLDI, pages 90-100. ACM Press, 2008.
    • (2008) PLDI , pp. 90-100
    • Pouchet, L.-N.1    Bastoul, C.2    Cohen, A.3    Cavazos, J.4
  • 36
    • 18944384585 scopus 로고    scopus 로고
    • Mechanizing the development of software
    • M. Broy, editor NATO ASI Series, IOS Press Kestrel Institute Technical Report KES.U.99.1
    • D. R. Smith. Mechanizing the development of software. In M. Broy, editor, Calculational System Design, Proc. of the International Summer School Marktoberdorf. NATO ASI Series, IOS Press, 1999. Kestrel Institute Technical Report KES.U.99.1.
    • (1999) Calculational System Design, Proc. of the International Summer School Marktoberdorf
    • Smith, D.R.1
  • 38
  • 40
    • 84872972843 scopus 로고    scopus 로고
    • Joint scheduling and layout optimization to enable multi-level vectorization
    • Jan.
    • N. Vasilache, B. Meister, M. Baskaran, and R. Lethin. Joint scheduling and layout optimization to enable multi-level vectorization. In Proc. of IMPACT'12, Jan. 2012.
    • (2012) Proc. of IMPACT'12
    • Vasilache, N.1    Meister, B.2    Baskaran, M.3    Lethin, R.4
  • 41
    • 58649099625 scopus 로고    scopus 로고
    • Algebraic signal processing theory: Cooley-tukey type algorithms for real dfts
    • Y. Voronenko and M. Püschel. Algebraic signal processing theory: Cooley-tukey type algorithms for real dfts. IEEE Transactions on Signal Processing, 57(1), 2009.
    • (2009) IEEE Transactions on Signal Processing , vol.57 , Issue.1
    • Voronenko, Y.1    Püschel, M.2
  • 42
    • 0003278639 scopus 로고    scopus 로고
    • Automatically tuned linear algebra software (ATLAS)
    • math-atlas. sourceforge.net
    • R. C. Whaley and J. Dongarra. Automatically Tuned Linear Algebra Software (ATLAS). In Proc. Supercomputing, 1998. math-atlas. sourceforge.net.
    • (1998) Proc. Supercomputing
    • Whaley, R.C.1    Dongarra, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.