메뉴 건너뛰기




Volumn 62, Issue 1, 2006, Pages 25-46

In search of a program generator to implement generic transformations for high-performance computing

Author keywords

Adaptive libraries; Application specific program generators; Loop transformations; Multi stage programming

Indexed keywords

ABSTRACTING; AUTOMATIC PROGRAMMING; CODES (SYMBOLS); COMPUTATION THEORY; COMPUTATIONAL COMPLEXITY; COMPUTER ARCHITECTURE; COMPUTER PROGRAMMING LANGUAGES; MICROPROCESSOR CHIPS; OPTIMIZATION; SYNTACTICS;

EID: 33745665061     PISSN: 01676423     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.scico.2005.10.013     Document Type: Article
Times cited : (37)

References (54)
  • 1
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimizations of software and the ATLAS project
    • Whaley R.C., Petitet A., and Dongarra J.J. Automated empirical optimizations of software and the ATLAS project. Parallel Comput. 27 1-2 (2001) 3-35
    • (2001) Parallel Comput. , vol.27 , Issue.1-2 , pp. 3-35
    • Whaley, R.C.1    Petitet, A.2    Dongarra, J.J.3
  • 3
    • 0036679993 scopus 로고    scopus 로고
    • Adaptive optimizing compilers for the 21st century
    • Cooper K.D., Subramanian D., and Torczon L. Adaptive optimizing compilers for the 21st century. J. Supercomput. 23 1 (2002) 7-22
    • (2002) J. Supercomput. , vol.23 , Issue.1 , pp. 7-22
    • Cooper, K.D.1    Subramanian, D.2    Torczon, L.3
  • 4
    • 84934324885 scopus 로고    scopus 로고
    • D. Parello, O. Temam, A. Cohen, J.-M. Verdun, Towards a systematic, pragmatic and architecture-aware program optimization process for complex processors, in: ACM Supercomputing'04, Pittsburgh, Pennsylvania, 2004, p. 15
  • 5
    • 0344908850 scopus 로고    scopus 로고
    • Automatic intra-register vectorization for the Intel architecture
    • Bik A.J.C., Girkar M., Grey P.M., and Tian X. Automatic intra-register vectorization for the Intel architecture. Int. J. Parallel Programming 30 2 (2002) 65-98
    • (2002) Int. J. Parallel Programming , vol.30 , Issue.2 , pp. 65-98
    • Bik, A.J.C.1    Girkar, M.2    Grey, P.M.3    Tian, X.4
  • 6
    • 33745672359 scopus 로고    scopus 로고
    • Lengauer C., Batory D., Consel C., and Odersky M. (Eds), Springer-Verlag
    • In: Lengauer C., Batory D., Consel C., and Odersky M. (Eds). Domain-Specific Program Generation. LNCS vol. 3016 (2003), Springer-Verlag
    • (2003) LNCS , vol.3016
  • 7
    • 0031594006 scopus 로고    scopus 로고
    • D.I. August, D.A. Connors, S.A. Mahlke, J.W. Sias, K.M. Crozier, B.-C. Cheng, P.R. Eaton, Q.B. Olaniran, W.-M. Hwu, Integrated predicated and speculative execution in the IMPACT EPIC architecture, in: Proceedings of the 25th Intl. Symp. on Computer Architecture, 1998, pp. 227-237
  • 8
    • 0033076827 scopus 로고    scopus 로고
    • The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization
    • Compilers and Languages for Parallel and Distributed Computers (special issue)
    • Rauchwerger L., and Padua D. The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization. Compilers and Languages for Parallel and Distributed Computers. IEEE Trans. Parallel Distrib. Syst. 10 2 (1999) 160-180 (special issue)
    • (1999) IEEE Trans. Parallel Distrib. Syst. , vol.10 , Issue.2 , pp. 160-180
    • Rauchwerger, L.1    Padua, D.2
  • 9
    • 35048850130 scopus 로고    scopus 로고
    • A polyhedral approach to ease the composition of program transformations
    • Euro-Par'04, Springer-Verlag, Pisa, Italy
    • Cohen A., Girbal S., and Temam O. A polyhedral approach to ease the composition of program transformations. Euro-Par'04. LNCS vol. 3149 (2004), Springer-Verlag, Pisa, Italy 292-303
    • (2004) LNCS , vol.3149 , pp. 292-303
    • Cohen, A.1    Girbal, S.2    Temam, O.3
  • 10
    • 3042561893 scopus 로고    scopus 로고
    • X. Li, M.-J. Garzaran, D. Padua, A dynamically tuned sorting library, in: ACM Conf. on Code Generation and Optimization, CGO'04, San Jose, CA, 2004, pp. 111-124
  • 11
    • 33745675546 scopus 로고    scopus 로고
    • T. Veldhuizen, D. Gannon, Active libraries: Rethinking the roles of compilers and libraries, in: SIAM Workshop on Object Oriented Methods for Inter-operable Scientific and Engineering Computing, 1998, pp. 21-23
  • 12
    • 17244380718 scopus 로고    scopus 로고
    • M.D. Smith, Overcoming the challenges to feedback-directed optimization, in: ACM SIGPLAN Workshop on Dynamic and Adaptive Compilation and Optimization, 2000, pp. 1-11 (Keynote Talk)
  • 13
    • 10744232785 scopus 로고    scopus 로고
    • K. Yotov, X. Li, G. Ren, M. Cibulskis, G. DeJong, M. Garzaran, D. Padua, K. Pingali, P. Stodghill, P. Wu, A comparison of empirical and model-driven optimization, in: ACM Symp. on Programming Language Design and Implementation, PLDI'03, San Diego, CA, 2003, pp. 63-76
  • 14
    • 0031636309 scopus 로고    scopus 로고
    • M. Frigo, S.G. Johnson, FFTW: An adaptive software architecture for the FFT, in: Proc. of the ICASSP Conf., vol. 3, 1998, pp. 1381-1384
  • 15
    • 0037882884 scopus 로고    scopus 로고
    • Techniques for the translation of MATLAB programs into Fortran90
    • De Rose L., and Padua D. Techniques for the translation of MATLAB programs into Fortran90. ACM Trans. Programming Languages Systems 21 2 (1999) 286-323
    • (1999) ACM Trans. Programming Languages Systems , vol.21 , Issue.2 , pp. 286-323
    • De Rose, L.1    Padua, D.2
  • 16
    • 0034825496 scopus 로고    scopus 로고
    • A. Chauhan, K. Kennedy, Optimizing strategies for telescoping languages: Procedure strength reduction and procedure vectorization, in: ACM Intl. Conf. on Supercomput., ICS'04, 2001, pp. 92-101
  • 18
    • 33745650325 scopus 로고    scopus 로고
    • Open research compiler, http://ipf-orc.sourceforge.net
  • 19
    • 30944439123 scopus 로고    scopus 로고
    • GENERIC and GIMPLE: A new tree representation for entire functions
    • Merill J. GENERIC and GIMPLE: A new tree representation for entire functions. Proceedings of the 2003 GCC Developers Summit (2003) 171-180. http://www.gccsummit.org/2003
    • (2003) Proceedings of the 2003 GCC Developers Summit , pp. 171-180
    • Merill, J.1
  • 20
    • 32844473507 scopus 로고    scopus 로고
    • A. Cohen, S. Girbal, D. Parello, M. Sigler, O. Temam, N. Vasilache, Facilitating the search for compositions of program transformations, in: ACM Intl. Conf. on Supercomputing, ICS'05, Boston, Massachusetts, 2005, pp. 151-160
  • 21
    • 0033906425 scopus 로고    scopus 로고
    • K. Kennedy, Telescoping languages: A compiler strategy for implementation of high-level domain-specific programming systems, in: Proc. Intl. Parallel and Distributed Processing Symposium, IPIPS'00, 2000, pp. 297-304
  • 22
    • 35248851156 scopus 로고    scopus 로고
    • C. Calcagno, W. Taha, L. Huang, X. Leroy, Implementing multi-stage languages using ASTs, Gensym, and reflection, in: ACM SIGPLAN/SIGSOFT Intl. Conf. Generative Programming and Component Engineering, GPCE'03, 2003, pp. 57-76
  • 23
    • 33646129674 scopus 로고    scopus 로고
    • Implicitly heterogeneous multi-stage programming
    • ACM SIGPLAN/SIGSOFT Intl. Conf. Generative Programming and Component Engineering. GPCE'05
    • Ekhardt J., Kaiabachev R., Pašalić E., Swadi K., and Taha W. Implicitly heterogeneous multi-stage programming. ACM SIGPLAN/SIGSOFT Intl. Conf. Generative Programming and Component Engineering. GPCE'05. LNCS vol. 3676 (2005)
    • (2005) LNCS , vol.3676
    • Ekhardt, J.1    Kaiabachev, R.2    Pašalić, E.3    Swadi, K.4    Taha, W.5
  • 24
    • 35048873677 scopus 로고    scopus 로고
    • Semantic-driven parallelization of loops operating on user-defined containers
    • Workshop on Languages and Compilers for Parallel Computing, Springer-Verlag
    • Quinlan D.J., Schordan M., Yi Q., and de Supinski B.R. Semantic-driven parallelization of loops operating on user-defined containers. Workshop on Languages and Compilers for Parallel Computing. LNCS vol. 2958 (2003), Springer-Verlag 524-538
    • (2003) LNCS , vol.2958 , pp. 524-538
    • Quinlan, D.J.1    Schordan, M.2    Yi, Q.3    de Supinski, B.R.4
  • 25
    • 33745656856 scopus 로고    scopus 로고
    • O. Beckmann, A. Houghton, P.H.J. Kelly, M. Mellor, Run-time code generation in C++ as a foundation for domain-specific optimisation, in: Proceedings of the 2003 Dagstuhl Workshop on Domain-Specific Program Generation, 2003
  • 26
    • 0033691593 scopus 로고    scopus 로고
    • W. Taha, A sound reduction semantics for untyped CBN mutli-stage computation. or, the theory of MetaML is non-trival, in: Proc. of the ACM workshop on Partial Evaluation and semantics-based Program Manipulation, PEPM'00, Boston, Massachusetts, 2000, pp. 34-43
  • 27
    • 0038643987 scopus 로고    scopus 로고
    • W. Taha, M.F. Nielsen, Environment classifiers, in: ACM Symp. on Principles of Programming Languages, PoPL'03, 2003, pp. 26-37
  • 28
    • 35048882935 scopus 로고    scopus 로고
    • ML-like inference for classifiers
    • European Symposium on Programming. Schmidt D. (Ed). ESOP, Springer-Verlag
    • Calcagno C., Moggi E., and Taha W. ML-like inference for classifiers. In: Schmidt D. (Ed). European Symposium on Programming. ESOP. LNCS vol. 2986 (2004), Springer-Verlag 79-93
    • (2004) LNCS , vol.2986 , pp. 79-93
    • Calcagno, C.1    Moggi, E.2    Taha, W.3
  • 29
    • 33745673440 scopus 로고    scopus 로고
    • Standard performance evaluation corporation, http://www.spec.org
  • 32
    • 0033413294 scopus 로고    scopus 로고
    • Parallelization of divide-and-conquer by translation to nested loops
    • Herrmann C.A., and Lengauer C. Parallelization of divide-and-conquer by translation to nested loops. J. Funct. Programming 9 3 (1999) 279-310
    • (1999) J. Funct. Programming , vol.9 , Issue.3 , pp. 279-310
    • Herrmann, C.A.1    Lengauer, C.2
  • 33
    • 0034197220 scopus 로고    scopus 로고
    • HDC: A higher-order language for divide-and-conquer
    • Herrmann C.A., and Lengauer C. HDC: A higher-order language for divide-and-conquer. Parallel Process. Lett. 10 2-3 (2000) 239-250
    • (2000) Parallel Process. Lett. , vol.10 , Issue.2-3 , pp. 239-250
    • Herrmann, C.A.1    Lengauer, C.2
  • 34
    • 24944457551 scopus 로고    scopus 로고
    • Generating message-passing programs from abstract specifications by partial evaluation
    • Herrmann C.A. Generating message-passing programs from abstract specifications by partial evaluation. Parallel Process. Lett. 15 3 (2005) 305-320
    • (2005) Parallel Process. Lett. , vol.15 , Issue.3 , pp. 305-320
    • Herrmann, C.A.1
  • 35
    • 0029203616 scopus 로고    scopus 로고
    • S. Liang, P. Hudak, M. Jones, Monad transformers and modular interpreters, in: ACM Symp. on Principles of Programming Languages, PoPL'95, 1995, pp. 333-343
  • 36
    • 0001813087 scopus 로고    scopus 로고
    • Domain-specific languages: An annotated bibliography
    • Van Deursen A., Klint P., and Visser J. Domain-specific languages: An annotated bibliography. ACM SIGPLAN Notices 35 6 (2000) 26-36
    • (2000) ACM SIGPLAN Notices , vol.35 , Issue.6 , pp. 26-36
    • Van Deursen, A.1    Klint, P.2    Visser, J.3
  • 37
    • 24944568375 scopus 로고    scopus 로고
    • O. Kiselyov, K.N. Swadi, W. Taha, A methodology for generating verified combinatorial circuits, in: Embedded Software Conf., EMSOFT'04, Pisa, Italy, 2004, pp. 249-258
  • 38
    • 33646133205 scopus 로고    scopus 로고
    • Multi-stage programming with functors and monads: Eliminating abstraction overhead from generic code
    • ACM SIGPLAN/SIGSOFT Intl. Conf. Generative Programming and Component Engineering. GPCE'05
    • Carette J., and Kiselyov O. Multi-stage programming with functors and monads: Eliminating abstraction overhead from generic code. ACM SIGPLAN/SIGSOFT Intl. Conf. Generative Programming and Component Engineering. GPCE'05. LNCS vol. 3676 (2005)
    • (2005) LNCS , vol.3676
    • Carette, J.1    Kiselyov, O.2
  • 39
    • 0001448065 scopus 로고
    • Some efficient solutions to the affine scheduling problem, part II, multidimensional time
    • See also Part I, one dimensional time, 21(5):315-348
    • Feautrier P. Some efficient solutions to the affine scheduling problem, part II, multidimensional time. Int. J. Parallel Programming 21 6 (1992) 389-420 See also Part I, one dimensional time, 21(5):315-348
    • (1992) Int. J. Parallel Programming , vol.21 , Issue.6 , pp. 389-420
    • Feautrier, P.1
  • 40
    • 33745668033 scopus 로고    scopus 로고
    • M.E. Wolf, Improving locality and parallelism in nested loops, Ph.D. Thesis, Stanford University, published as CSL-TR-92-538 (August 1992)
  • 41
    • 33745663819 scopus 로고    scopus 로고
    • W. Kelly, Optimization within a unified transformation framework, Tech. Rep. CS-TR-3725, University of Maryland, 1996
  • 42
    • 33745660635 scopus 로고    scopus 로고
    • C. Bastoul, A. Cohen, S. Girbal, S. Sharma, O. Temam, Putting polyhedral loop transformations to work, in: Workshop on Languages and Compilers for Parallel Computing, LCPC'03, LNCS, College Station, Texas, 2003, pp. 23-30
  • 43
    • 10444289646 scopus 로고    scopus 로고
    • C. Bastoul, Code generation in the polyhedral model is easier than you think, in: Parallel Architectures and Compilation Techniques, PACT'04, Juan-les-Pins, 2004, pp. 7-16
  • 46
    • 0031381304 scopus 로고    scopus 로고
    • V. Loechner, D. Wilde, Parameterized polyhedra and their vertices, Int. J. Parallel Programming 25 (6). http://icps.u-strasbg.fr/PolyLib
  • 47
    • 84976676720 scopus 로고
    • A practical algorithm for exact array dependence analysis
    • Pugh W. A practical algorithm for exact array dependence analysis. Comm. ACM 35 8 (1992) 27-47
    • (1992) Comm. ACM , vol.35 , Issue.8 , pp. 27-47
    • Pugh, W.1
  • 48
    • 84958731989 scopus 로고    scopus 로고
    • P. Feautrier, Array expansion, in: ACM Intl. Conf. on Supercomputing, St. Malo, France, 1988, pp. 429-441
  • 50
    • 0029728670 scopus 로고    scopus 로고
    • J. Llosa, Swing modulo scheduling: A lifetime-sensitive approach, in: Parallel Architectures and Compilation Techniques, PACT'96, 1996, pp. 80-87
  • 51
    • 0001775822 scopus 로고
    • Using C++ template metaprograms
    • Veldhuizen T. Using C++ template metaprograms. C++ Report 7 4 (1995) 36-43
    • (1995) C++ Report , vol.7 , Issue.4 , pp. 36-43
    • Veldhuizen, T.1
  • 53
    • 0018286453 scopus 로고
    • A new implementation technique for applicative languages
    • Turner D.A. A new implementation technique for applicative languages. Software-Practice and Experience 9 1 (1979) 31-49
    • (1979) Software-Practice and Experience , vol.9 , Issue.1 , pp. 31-49
    • Turner, D.A.1
  • 54
    • 3042658703 scopus 로고    scopus 로고
    • C. Lattner, V. Adve, LLVM: A compilation framework for lifelong program analysis & transformation, in: ACM Conf. on Code Generation and Optimization, CGO'04, San Jose, CA, 2004, pp. 75-88


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.