메뉴 건너뛰기




Volumn , Issue , 2009, Pages 327-337

Polyhedral-model guided loop-nest auto-vectorization

Author keywords

[No Author keywords available]

Indexed keywords

ACCURATE PREDICTION; COST MODELS; LOOP TRANSFORMATION; LOOP VECTORIZATION; MULTIPLE TRANSFORMATION; OPTIMIZING COMPILERS; PERFORMANCE IMPACT; POLYHEDRAL FRAMEWORK; POLYHEDRAL MODELS; POLYHEDRAL REPRESENTATION; PREDICTIVE MODELLING; VECTORIZATION;

EID: 70449626135     PISSN: 1089795X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/PACT.2009.18     Document Type: Conference Paper
Times cited : (105)

References (26)
  • 1
    • 32844466554 scopus 로고    scopus 로고
    • An integrated simdization framework using virtual vectors
    • P. Wu, A. E. Eichenberger, A. Wang, and P. Zhao, "An integrated Simdization framework using virtual vectors," in ICS, 2005.
    • (2005) ICS
    • Wu, P.1    Eichenberger, A.E.2    Wang, A.3    Zhao, P.4
  • 3
    • 0344908850 scopus 로고    scopus 로고
    • Automatic intra-register vectorization for the intel architecture
    • A. J. C. Bik, M. Girkar, P. M. Grey, and X. Tian, "Automatic intra-register vectorization for the Intel architecture," IJPP, vol. 30, no. 2, pp. 65-98, 2002.
    • (2002) IJPP , vol.30 , Issue.2 , pp. 65-98
    • Bik, A.J.C.1    Girkar, M.2    Grey, P.M.3    Tian, X.4
  • 4
    • 37149019455 scopus 로고    scopus 로고
    • Autovectorization in GCC - Two years later
    • June
    • D. Nuzman and A. Zaks, "Autovectorization in GCC - two years later," in the GCC Developer's summit, June 2006.
    • (2006) GCC Developer's summit
    • Nuzman, D.1    Zaks, A.2
  • 5
    • 33646554301 scopus 로고    scopus 로고
    • Superword-level parallelism in the presence of control flow
    • March
    • J. Shin, M. Hall, and J. Chame, "Superword-level parallelism in the presence of control flow," in CGO, March 2005.
    • (2005) CGO
    • Shin, J.1    Hall, M.2    Chame, J.3
  • 6
    • 63549093768 scopus 로고    scopus 로고
    • Outer-loop vectorization - Revisited for short SIMD architectures
    • October
    • D. Nuzman and A. Zaks, "Outer-loop vectorization - revisited for short SIMD architectures," in PACT, October 2008.
    • (2008) PACT
    • Nuzman, D.1    Zaks, A.2
  • 8
    • 74049164978 scopus 로고    scopus 로고
    • A practical automatic polyhedral parallelization and locality optimization system
    • Jun.
    • U. Bondhugula, A. Hartono, J. Ramanujam, and P. Sadayappan, "A practical automatic polyhedral parallelization and locality optimization system," in PLDI, Jun. 2008.
    • (2008) PLDI
    • Bondhugula, U.1    Hartono, A.2    Ramanujam, J.3    Sadayappan, P.4
  • 9
    • 57349167317 scopus 로고    scopus 로고
    • Iterative optimization in the polyhedral model: Part II, multidimensional time
    • Jun.
    • L.-N. Pouchet, C. Bastoul, A. Cohen, and J. Cavazos, "Iterative optimization in the polyhedral model: Part II, multidimensional time," in PLDI, Jun. 2008.
    • (2008) PLDI
    • Pouchet, L.-N.1    Bastoul, C.2    Cohen, A.3    Cavazos, J.4
  • 10
    • 0023438847 scopus 로고
    • Automatic translation of fortran programs to vector form
    • R. Allen and K. Kennedy, "Automatic translation of fortran programs to vector form," ACM Tr. on Prog. Lang. and Systems, vol. 9, no. 4, pp. 491-542, 1987.
    • (1987) ACM Tr. on Prog. Lang. and Systems , vol.9 , Issue.4 , pp. 491-542
    • Allen, R.1    Kennedy, K.2
  • 12
    • 84948740064 scopus 로고    scopus 로고
    • Compiler-controlled caching in superword register files for multimedia extension architectures
    • September
    • J. Shin, J. Chame, and M. W. Hall, "Compiler-controlled caching in superword register files for multimedia extension architectures," in PACT, September 2002.
    • (2002) PACT
    • Shin, J.1    Chame, J.2    Hall, M.W.3
  • 13
    • 33746034953 scopus 로고    scopus 로고
    • Auto-vectorization of interleaved data for simd
    • D. Nuzman, I. Rosen, and A. Zaks, "Auto-vectorization of interleaved data for simd," in PLDI, 2006.
    • (2006) PLDI
    • Nuzman, D.1    Rosen, I.2    Zaks, A.3
  • 16
    • 0037340135 scopus 로고    scopus 로고
    • Probabilistic miss equations: Evaluating memory hierarchy performance
    • B. B. Fraguela, R. Doallo, and E. L. Zapata, "Probabilistic miss equations: Evaluating memory hierarchy performance," IEEE Trans. Comput., vol. 52, no. 3, pp. 321-336, 2003.
    • (2003) IEEE Trans. Comput. , vol.52 , Issue.3 , pp. 321-336
    • Fraguela, B.B.1    Doallo, R.2    Zapata, E.L.3
  • 17
    • 84958731989 scopus 로고
    • Array expansion
    • St. Malo, France, Jul.
    • P. Feautrier, "Array expansion," in ICS, St. Malo, France, Jul. 1988.
    • (1988) ICS
    • Feautrier, P.1
  • 18
    • 33746593747 scopus 로고    scopus 로고
    • Semi-automatic composition of loop transformations for deep parallelism and memory hierarchies
    • Jun. special issue on Microgrids.
    • S. Girbal, N. Vasilache, C. Bastoul, A. Cohen, D. Parello, M. Sigler, and O. Temam, "Semi-automatic composition of loop transformations for deep parallelism and memory hierarchies," Intl. J. of Parallel Programming, vol. 34, no. 3, pp. 261-317, Jun. 2006, special issue on Microgrids.
    • (2006) Intl. J. of Parallel Programming , vol.34 , Issue.3 , pp. 261-317
    • Girbal, S.1    Vasilache, N.2    Bastoul, C.3    Cohen, A.4    Parello, D.5    Sigler, M.6    Temam, O.7
  • 19
    • 0001448065 scopus 로고
    • Some efficient solutions to the affine scheduling problem, part II, multidimensional time
    • Dec.
    • P. Feautrier, "Some efficient solutions to the affine scheduling problem, part II, multidimensional time," Intl. J. of Parallel Programming, vol. 21, no. 6, pp. 389-420, Dec. 1992
    • (1992) Intl. J. of Parallel Programming , vol.21 , Issue.6 , pp. 389-420
    • Feautrier, P.1
  • 20
    • 35048864273 scopus 로고    scopus 로고
    • see also Part I
    • see also Part I, one dimensional time, 21(5):315-348.
    • One Dimensional Time , vol.21 , Issue.5 , pp. 315-348
  • 22
    • 0030645995 scopus 로고    scopus 로고
    • Maximizing parallelism and minimizing synchronization with affine transforms
    • Paris, Jan.
    • A. Lim and M. Lam, "Maximizing parallelism and minimizing synchronization with affine transforms," in PoPL'24, Paris, Jan. 1997, pp. 201-214.
    • (1997) PoPL'24 , pp. 201-214
    • Lim, A.1    Lam, M.2
  • 23
    • 10444289646 scopus 로고    scopus 로고
    • Code generation in the polyhedral model is easier than you think
    • Sep.
    • C. Bastoul, "Code generation in the polyhedral model is easier than you think," in PACT, Sep. 2004.
    • (2004) PACT
    • Bastoul, C.1
  • 24
    • 63549147948 scopus 로고    scopus 로고
    • C. G. Lee, "UTDSP benchmarks," http://www.eecg.toronto.edu/ ~corinna/DSP/infrastructure/UTDSP.html, 1998.
    • (1998) UTDSP Benchmarks
    • Lee, C.G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.