-
1
-
-
0343462141
-
Automated empirical optimizations of software and the ATLAS project
-
Whaley R.C., Petitet A., and Dongarra J.J. Automated empirical optimizations of software and the ATLAS project. Parallel Comput. 27 1-2 (2001) 3-35
-
(2001)
Parallel Comput.
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Whaley, R.C.1
Petitet, A.2
Dongarra, J.J.3
-
2
-
-
0002363292
-
Iterative compilation in program optimization
-
Kisuki T., Knijnenburg P., O'Boyle M., and Wijshoff H. Iterative compilation in program optimization. Proc. CPC'10, Compilers for Parallel Computers (2000) 35-44
-
(2000)
Proc. CPC'10, Compilers for Parallel Computers
, pp. 35-44
-
-
Kisuki, T.1
Knijnenburg, P.2
O'Boyle, M.3
Wijshoff, H.4
-
3
-
-
0036679993
-
Adaptive optimizing compilers for the 21st century
-
Cooper K.D., Subramanian D., and Torczon L. Adaptive optimizing compilers for the 21st century. J. Supercomput. 23 1 (2002) 7-22
-
(2002)
J. Supercomput.
, vol.23
, Issue.1
, pp. 7-22
-
-
Cooper, K.D.1
Subramanian, D.2
Torczon, L.3
-
4
-
-
84934324885
-
-
D. Parello, O. Temam, A. Cohen, J.-M. Verdun, Towards a systematic, pragmatic and architecture-aware program optimization process for complex processors, in: ACM Supercomputing'04, Pittsburgh, Pennsylvania, 2004, p. 15
-
-
-
-
5
-
-
0344908850
-
Automatic intra-register vectorization for the Intel architecture
-
Bik A.J.C., Girkar M., Grey P.M., and Tian X. Automatic intra-register vectorization for the Intel architecture. Int. J. Parallel Programming 30 2 (2002) 65-98
-
(2002)
Int. J. Parallel Programming
, vol.30
, Issue.2
, pp. 65-98
-
-
Bik, A.J.C.1
Girkar, M.2
Grey, P.M.3
Tian, X.4
-
6
-
-
33745672359
-
-
Lengauer C., Batory D., Consel C., and Odersky M. (Eds), Springer-Verlag
-
In: Lengauer C., Batory D., Consel C., and Odersky M. (Eds). Domain-Specific Program Generation. LNCS vol. 3016 (2003), Springer-Verlag
-
(2003)
LNCS
, vol.3016
-
-
-
7
-
-
0031594006
-
-
D.I. August, D.A. Connors, S.A. Mahlke, J.W. Sias, K.M. Crozier, B.-C. Cheng, P.R. Eaton, Q.B. Olaniran, W.-M. Hwu, Integrated predicated and speculative execution in the IMPACT EPIC architecture, in: Proceedings of the 25th Intl. Symp. on Computer Architecture, 1998, pp. 227-237
-
-
-
-
8
-
-
0033076827
-
The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization
-
Compilers and Languages for Parallel and Distributed Computers (special issue)
-
Rauchwerger L., and Padua D. The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization. Compilers and Languages for Parallel and Distributed Computers. IEEE Trans. Parallel Distrib. Syst. 10 2 (1999) 160-180 (special issue)
-
(1999)
IEEE Trans. Parallel Distrib. Syst.
, vol.10
, Issue.2
, pp. 160-180
-
-
Rauchwerger, L.1
Padua, D.2
-
9
-
-
35048850130
-
A polyhedral approach to ease the composition of program transformations
-
Euro-Par'04, Springer-Verlag, Pisa, Italy
-
Cohen A., Girbal S., and Temam O. A polyhedral approach to ease the composition of program transformations. Euro-Par'04. LNCS vol. 3149 (2004), Springer-Verlag, Pisa, Italy 292-303
-
(2004)
LNCS
, vol.3149
, pp. 292-303
-
-
Cohen, A.1
Girbal, S.2
Temam, O.3
-
10
-
-
3042561893
-
-
X. Li, M.-J. Garzaran, D. Padua, A dynamically tuned sorting library, in: ACM Conf. on Code Generation and Optimization, CGO'04, San Jose, CA, 2004, pp. 111-124
-
-
-
-
11
-
-
33745675546
-
-
T. Veldhuizen, D. Gannon, Active libraries: Rethinking the roles of compilers and libraries, in: SIAM Workshop on Object Oriented Methods for Inter-operable Scientific and Engineering Computing, 1998, pp. 21-23
-
-
-
-
12
-
-
17244380718
-
-
M.D. Smith, Overcoming the challenges to feedback-directed optimization, in: ACM SIGPLAN Workshop on Dynamic and Adaptive Compilation and Optimization, 2000, pp. 1-11 (Keynote Talk)
-
-
-
-
13
-
-
10744232785
-
-
K. Yotov, X. Li, G. Ren, M. Cibulskis, G. DeJong, M. Garzaran, D. Padua, K. Pingali, P. Stodghill, P. Wu, A comparison of empirical and model-driven optimization, in: ACM Symp. on Programming Language Design and Implementation, PLDI'03, San Diego, CA, 2003, pp. 63-76
-
-
-
-
14
-
-
0031636309
-
-
M. Frigo, S.G. Johnson, FFTW: An adaptive software architecture for the FFT, in: Proc. of the ICASSP Conf., vol. 3, 1998, pp. 1381-1384
-
-
-
-
15
-
-
0037882884
-
Techniques for the translation of MATLAB programs into Fortran90
-
De Rose L., and Padua D. Techniques for the translation of MATLAB programs into Fortran90. ACM Trans. Programming Languages Systems 21 2 (1999) 286-323
-
(1999)
ACM Trans. Programming Languages Systems
, vol.21
, Issue.2
, pp. 286-323
-
-
De Rose, L.1
Padua, D.2
-
16
-
-
0034825496
-
-
A. Chauhan, K. Kennedy, Optimizing strategies for telescoping languages: Procedure strength reduction and procedure vectorization, in: ACM Intl. Conf. on Supercomput., ICS'04, 2001, pp. 92-101
-
-
-
-
17
-
-
1542396679
-
SPIRAL: A generator for platform-adapted libraries of signal processing algorithms
-
Automatic Performance Tuning (special Issue)
-
Püschel M., Singer B., Xiong J., Moura J., Johnson J., Padua D., Veloso M., and Johnson R.W. SPIRAL: A generator for platform-adapted libraries of signal processing algorithms. Automatic Performance Tuning. J. High Performance Computing and Applications 18 1 (2004) 21-45 (special Issue)
-
(2004)
J. High Performance Computing and Applications
, vol.18
, Issue.1
, pp. 21-45
-
-
Püschel, M.1
Singer, B.2
Xiong, J.3
Moura, J.4
Johnson, J.5
Padua, D.6
Veloso, M.7
Johnson, R.W.8
-
18
-
-
33745650325
-
-
Open research compiler, http://ipf-orc.sourceforge.net
-
-
-
-
19
-
-
30944439123
-
GENERIC and GIMPLE: A new tree representation for entire functions
-
Merill J. GENERIC and GIMPLE: A new tree representation for entire functions. Proceedings of the 2003 GCC Developers Summit (2003) 171-180. http://www.gccsummit.org/2003
-
(2003)
Proceedings of the 2003 GCC Developers Summit
, pp. 171-180
-
-
Merill, J.1
-
20
-
-
32844473507
-
-
A. Cohen, S. Girbal, D. Parello, M. Sigler, O. Temam, N. Vasilache, Facilitating the search for compositions of program transformations, in: ACM Intl. Conf. on Supercomputing, ICS'05, Boston, Massachusetts, 2005, pp. 151-160
-
-
-
-
21
-
-
0033906425
-
-
K. Kennedy, Telescoping languages: A compiler strategy for implementation of high-level domain-specific programming systems, in: Proc. Intl. Parallel and Distributed Processing Symposium, IPIPS'00, 2000, pp. 297-304
-
-
-
-
22
-
-
35248851156
-
-
C. Calcagno, W. Taha, L. Huang, X. Leroy, Implementing multi-stage languages using ASTs, Gensym, and reflection, in: ACM SIGPLAN/SIGSOFT Intl. Conf. Generative Programming and Component Engineering, GPCE'03, 2003, pp. 57-76
-
-
-
-
23
-
-
33646129674
-
Implicitly heterogeneous multi-stage programming
-
ACM SIGPLAN/SIGSOFT Intl. Conf. Generative Programming and Component Engineering. GPCE'05
-
Ekhardt J., Kaiabachev R., Pašalić E., Swadi K., and Taha W. Implicitly heterogeneous multi-stage programming. ACM SIGPLAN/SIGSOFT Intl. Conf. Generative Programming and Component Engineering. GPCE'05. LNCS vol. 3676 (2005)
-
(2005)
LNCS
, vol.3676
-
-
Ekhardt, J.1
Kaiabachev, R.2
Pašalić, E.3
Swadi, K.4
Taha, W.5
-
24
-
-
35048873677
-
Semantic-driven parallelization of loops operating on user-defined containers
-
Workshop on Languages and Compilers for Parallel Computing, Springer-Verlag
-
Quinlan D.J., Schordan M., Yi Q., and de Supinski B.R. Semantic-driven parallelization of loops operating on user-defined containers. Workshop on Languages and Compilers for Parallel Computing. LNCS vol. 2958 (2003), Springer-Verlag 524-538
-
(2003)
LNCS
, vol.2958
, pp. 524-538
-
-
Quinlan, D.J.1
Schordan, M.2
Yi, Q.3
de Supinski, B.R.4
-
25
-
-
33745656856
-
-
O. Beckmann, A. Houghton, P.H.J. Kelly, M. Mellor, Run-time code generation in C++ as a foundation for domain-specific optimisation, in: Proceedings of the 2003 Dagstuhl Workshop on Domain-Specific Program Generation, 2003
-
-
-
-
26
-
-
0033691593
-
-
W. Taha, A sound reduction semantics for untyped CBN mutli-stage computation. or, the theory of MetaML is non-trival, in: Proc. of the ACM workshop on Partial Evaluation and semantics-based Program Manipulation, PEPM'00, Boston, Massachusetts, 2000, pp. 34-43
-
-
-
-
27
-
-
0038643987
-
-
W. Taha, M.F. Nielsen, Environment classifiers, in: ACM Symp. on Principles of Programming Languages, PoPL'03, 2003, pp. 26-37
-
-
-
-
28
-
-
35048882935
-
ML-like inference for classifiers
-
European Symposium on Programming. Schmidt D. (Ed). ESOP, Springer-Verlag
-
Calcagno C., Moggi E., and Taha W. ML-like inference for classifiers. In: Schmidt D. (Ed). European Symposium on Programming. ESOP. LNCS vol. 2986 (2004), Springer-Verlag 79-93
-
(2004)
LNCS
, vol.2986
, pp. 79-93
-
-
Calcagno, C.1
Moggi, E.2
Taha, W.3
-
29
-
-
33745673440
-
-
Standard performance evaluation corporation, http://www.spec.org
-
-
-
-
32
-
-
0033413294
-
Parallelization of divide-and-conquer by translation to nested loops
-
Herrmann C.A., and Lengauer C. Parallelization of divide-and-conquer by translation to nested loops. J. Funct. Programming 9 3 (1999) 279-310
-
(1999)
J. Funct. Programming
, vol.9
, Issue.3
, pp. 279-310
-
-
Herrmann, C.A.1
Lengauer, C.2
-
33
-
-
0034197220
-
HDC: A higher-order language for divide-and-conquer
-
Herrmann C.A., and Lengauer C. HDC: A higher-order language for divide-and-conquer. Parallel Process. Lett. 10 2-3 (2000) 239-250
-
(2000)
Parallel Process. Lett.
, vol.10
, Issue.2-3
, pp. 239-250
-
-
Herrmann, C.A.1
Lengauer, C.2
-
34
-
-
24944457551
-
Generating message-passing programs from abstract specifications by partial evaluation
-
Herrmann C.A. Generating message-passing programs from abstract specifications by partial evaluation. Parallel Process. Lett. 15 3 (2005) 305-320
-
(2005)
Parallel Process. Lett.
, vol.15
, Issue.3
, pp. 305-320
-
-
Herrmann, C.A.1
-
35
-
-
0029203616
-
-
S. Liang, P. Hudak, M. Jones, Monad transformers and modular interpreters, in: ACM Symp. on Principles of Programming Languages, PoPL'95, 1995, pp. 333-343
-
-
-
-
36
-
-
0001813087
-
Domain-specific languages: An annotated bibliography
-
Van Deursen A., Klint P., and Visser J. Domain-specific languages: An annotated bibliography. ACM SIGPLAN Notices 35 6 (2000) 26-36
-
(2000)
ACM SIGPLAN Notices
, vol.35
, Issue.6
, pp. 26-36
-
-
Van Deursen, A.1
Klint, P.2
Visser, J.3
-
37
-
-
24944568375
-
-
O. Kiselyov, K.N. Swadi, W. Taha, A methodology for generating verified combinatorial circuits, in: Embedded Software Conf., EMSOFT'04, Pisa, Italy, 2004, pp. 249-258
-
-
-
-
38
-
-
33646133205
-
Multi-stage programming with functors and monads: Eliminating abstraction overhead from generic code
-
ACM SIGPLAN/SIGSOFT Intl. Conf. Generative Programming and Component Engineering. GPCE'05
-
Carette J., and Kiselyov O. Multi-stage programming with functors and monads: Eliminating abstraction overhead from generic code. ACM SIGPLAN/SIGSOFT Intl. Conf. Generative Programming and Component Engineering. GPCE'05. LNCS vol. 3676 (2005)
-
(2005)
LNCS
, vol.3676
-
-
Carette, J.1
Kiselyov, O.2
-
39
-
-
0001448065
-
Some efficient solutions to the affine scheduling problem, part II, multidimensional time
-
See also Part I, one dimensional time, 21(5):315-348
-
Feautrier P. Some efficient solutions to the affine scheduling problem, part II, multidimensional time. Int. J. Parallel Programming 21 6 (1992) 389-420 See also Part I, one dimensional time, 21(5):315-348
-
(1992)
Int. J. Parallel Programming
, vol.21
, Issue.6
, pp. 389-420
-
-
Feautrier, P.1
-
40
-
-
33745668033
-
-
M.E. Wolf, Improving locality and parallelism in nested loops, Ph.D. Thesis, Stanford University, published as CSL-TR-92-538 (August 1992)
-
-
-
-
41
-
-
33745663819
-
-
W. Kelly, Optimization within a unified transformation framework, Tech. Rep. CS-TR-3725, University of Maryland, 1996
-
-
-
-
42
-
-
33745660635
-
-
C. Bastoul, A. Cohen, S. Girbal, S. Sharma, O. Temam, Putting polyhedral loop transformations to work, in: Workshop on Languages and Compilers for Parallel Computing, LCPC'03, LNCS, College Station, Texas, 2003, pp. 23-30
-
-
-
-
43
-
-
10444289646
-
-
C. Bastoul, Code generation in the polyhedral model is easier than you think, in: Parallel Architectures and Compilation Techniques, PACT'04, Juan-les-Pins, 2004, pp. 7-16
-
-
-
-
46
-
-
0031381304
-
-
V. Loechner, D. Wilde, Parameterized polyhedra and their vertices, Int. J. Parallel Programming 25 (6). http://icps.u-strasbg.fr/PolyLib
-
-
-
-
47
-
-
84976676720
-
A practical algorithm for exact array dependence analysis
-
Pugh W. A practical algorithm for exact array dependence analysis. Comm. ACM 35 8 (1992) 27-47
-
(1992)
Comm. ACM
, vol.35
, Issue.8
, pp. 27-47
-
-
Pugh, W.1
-
48
-
-
84958731989
-
-
P. Feautrier, Array expansion, in: ACM Intl. Conf. on Supercomputing, St. Malo, France, 1988, pp. 429-441
-
-
-
-
50
-
-
0029728670
-
-
J. Llosa, Swing modulo scheduling: A lifetime-sensitive approach, in: Parallel Architectures and Compilation Techniques, PACT'96, 1996, pp. 80-87
-
-
-
-
51
-
-
0001775822
-
Using C++ template metaprograms
-
Veldhuizen T. Using C++ template metaprograms. C++ Report 7 4 (1995) 36-43
-
(1995)
C++ Report
, vol.7
, Issue.4
, pp. 36-43
-
-
Veldhuizen, T.1
-
52
-
-
0001379889
-
'C and tcc: A language and compiler for dynamic code generation
-
Poletto M., Hsieh W.C., Engler D.R., and Kaashoek M.F. 'C and tcc: A language and compiler for dynamic code generation. ACM Trans. Programming Languages Systems 21 2 (1999) 324-369
-
(1999)
ACM Trans. Programming Languages Systems
, vol.21
, Issue.2
, pp. 324-369
-
-
Poletto, M.1
Hsieh, W.C.2
Engler, D.R.3
Kaashoek, M.F.4
-
53
-
-
0018286453
-
A new implementation technique for applicative languages
-
Turner D.A. A new implementation technique for applicative languages. Software-Practice and Experience 9 1 (1979) 31-49
-
(1979)
Software-Practice and Experience
, vol.9
, Issue.1
, pp. 31-49
-
-
Turner, D.A.1
-
54
-
-
3042658703
-
-
C. Lattner, V. Adve, LLVM: A compilation framework for lifelong program analysis & transformation, in: ACM Conf. on Code Generation and Optimization, CGO'04, San Jose, CA, 2004, pp. 75-88
-
-
-
|