-
2
-
-
84883120973
-
-
PolyOpt/C. http://hpcrl.cse.ohio-state.edu/wiki/index.php/polyopt/c.
-
PolyOpt/C
-
-
-
3
-
-
84883124392
-
-
www.spiral.net/software/stencilgen.html.
-
-
-
-
5
-
-
77954003716
-
Parameterized tiling revisited
-
April
-
M. Baskaran, A. Hartono, S. Tavarageri, T. Henretty, J. Ramanujam, and P. Sadayappan. Parameterized tiling revisited. In CGO, April 2010.
-
(2010)
CGO
-
-
Baskaran, M.1
Hartono, A.2
Tavarageri, S.3
Henretty, T.4
Ramanujam, J.5
Sadayappan, P.6
-
7
-
-
32844469860
-
More legal transformations for locality
-
LNCS 3149 Pisa, august
-
C. Bastoul and P. Feautrier. More legal transformations for locality. In Euro-Par'10 Intl. Euro-Par conference, LNCS 3149, pages 272-283, Pisa, august 2004.
-
(2004)
Euro-Par'10 Intl. Euro-Par Conference
, pp. 272-283
-
-
Bastoul, C.1
Feautrier, P.2
-
8
-
-
2942713451
-
Achieving extensibility through product-lines and domain-specific languages: A case study
-
D. Batory, C. Johnson, B. MacDonald, and D. von Heeder. Achieving extensibility through product-lines and domain-specific languages: A case study. ACM Transactions on Software Engineering and Methodology (TOSEM), 11(2):191-214, 2002.
-
(2002)
ACM Transactions on Software Engineering and Methodology (TOSEM)
, vol.11
, Issue.2
, pp. 191-214
-
-
Batory, D.1
Johnson, C.2
MacDonald, B.3
Von Heeder, D.4
-
10
-
-
84976711318
-
Programming pearls: Little languages
-
J. Bentley. Programming pearls: little languages. Communications of the ACM, 29(8):711-721, 1986.
-
(1986)
Communications of the ACM
, vol.29
, Issue.8
, pp. 711-721
-
-
Bentley, J.1
-
14
-
-
20744452904
-
Self adapting linear algebra algorithms and software
-
J. Demmel, J. Dongarra, V. Eijkhout, E. Fuentes, A. Petitet, R. Vuduc, C. Whaley, and K. Yelick. Self adapting linear algebra algorithms and software. Proc. of the IEEE, 93(2):293-312, 2005.
-
(2005)
Proc. of the IEEE
, vol.93
, Issue.2
, pp. 293-312
-
-
Demmel, J.1
Dongarra, J.2
Eijkhout, V.3
Fuentes, E.4
Petitet, A.5
Vuduc, R.6
Whaley, C.7
Yelick, K.8
-
15
-
-
8344245462
-
Vectorization for simd architectures with alignment constraints
-
A. Eichenberger, P. Wu, and K. O'Brien. Vectorization for simd architectures with alignment constraints. In PLDI, 2004.
-
(2004)
PLDI
-
-
Eichenberger, A.1
Wu, P.2
O'Brien, K.3
-
16
-
-
0001448065
-
Some efficient solutions to the affine scheduling problem, Part II: Multidimensional time
-
Dec.
-
P. Feautrier. Some efficient solutions to the affine scheduling problem, part II: multidimensional time. Intl. J. of Parallel Programming, 21(6):389-420, Dec. 1992.
-
(1992)
Intl. J. of Parallel Programming
, vol.21
, Issue.6
, pp. 389-420
-
-
Feautrier, P.1
-
17
-
-
0348209599
-
A fast fourier transform compiler
-
M. Frigo. A fast Fourier transform compiler. In PLDI, pages 169-180, 1999.
-
(1999)
PLDI
, pp. 169-180
-
-
Frigo, M.1
-
18
-
-
20744449792
-
The design and implementation of FFTW3
-
M. Frigo and S. G. Johnson. The design and implementation of FFTW3. Proc. of the IEEE, 93(2):216-231, 2005.
-
(2005)
Proc. of the IEEE
, vol.93
, Issue.2
, pp. 216-231
-
-
Frigo, M.1
Johnson, S.G.2
-
19
-
-
33746593747
-
Semi-automatic composition of loop transformations
-
June
-
S. Girbal, N. Vasilache, C. Bastoul, A. Cohen, D. Parello, M. Sigler, and O. Temam. Semi-automatic composition of loop transformations. International Journal of Parallel Programming, 34(3):261-317, June 2006.
-
(2006)
International Journal of Parallel Programming
, vol.34
, Issue.3
, pp. 261-317
-
-
Girbal, S.1
Vasilache, N.2
Bastoul, C.3
Cohen, A.4
Parello, D.5
Sigler, M.6
Temam, O.7
-
20
-
-
67649530725
-
Little language processing, an alternative to courses on compiler construction
-
K. J. Gough. Little language processing, an alternative to courses on compiler construction. SIGCSE Bulletin, 13(3):31-34, 1981.
-
(1981)
SIGCSE Bulletin
, vol.13
, Issue.3
, pp. 31-34
-
-
Gough, K.J.1
-
22
-
-
70449702074
-
Parametric multi-level tiling of imperfectly nested loops
-
A. Hartono, M. Baskaran, C. Bastoul, A. Cohen, S. Krishnamoorthy, B. Norris, J. Ramanujam, and P. Sadayappan. Parametric multi-level tiling of imperfectly nested loops. In ICS, 2009.
-
(2009)
ICS
-
-
Hartono, A.1
Baskaran, M.2
Bastoul, C.3
Cohen, A.4
Krishnamoorthy, S.5
Norris, B.6
Ramanujam, J.7
Sadayappan, P.8
-
23
-
-
79953274591
-
Data layout transformation for stencil computations on short simd architectures
-
Saarbrcken, Germany, Mar. Springer Verlag
-
T. Henretty, K. Stock, L.-N. Pouchet, F. Franchetti, J. Ramanujam, and P. Sadayappan. Data layout transformation for stencil computations on short simd architectures. In ETAPS International Conference on Compiler Construction (CC'11), pages 225-245, Saarbrcken, Germany, Mar. 2011. Springer Verlag.
-
(2011)
ETAPS International Conference on Compiler Construction (CC'11)
, pp. 225-245
-
-
Henretty, T.1
Stock, K.2
Pouchet, L.-N.3
Franchetti, F.4
Ramanujam, J.5
Sadayappan, P.6
-
28
-
-
0034446825
-
Exploiting superword level parallelism with multimedia instruction sets
-
S. Larsen and S. P. Amarasinghe. Exploiting superword level parallelism with multimedia instruction sets. In PLDI, 2000.
-
(2000)
PLDI
-
-
Larsen, S.1
Amarasinghe, S.P.2
-
29
-
-
0030645995
-
Maximizing parallelism and minimizing synchronization with affine transforms
-
A. W. Lim and M. S. Lam. Maximizing parallelism and minimizing synchronization with affine transforms. In POPL, pages 201-214, 1997.
-
(1997)
POPL
, pp. 201-214
-
-
Lim, A.W.1
Lam, M.S.2
-
30
-
-
33746034953
-
Auto-vectorization of interleaved data for simd
-
D. Nuzman, I. Rosen, and A. Zaks. Auto-vectorization of interleaved data for simd. In PLDI, 2006.
-
(2006)
PLDI
-
-
Nuzman, D.1
Rosen, I.2
Zaks, A.3
-
31
-
-
63549093768
-
Outer-loop vectorization: Revisited for short simd architectures
-
D. Nuzman and A. Zaks. Outer-loop vectorization: revisited for short simd architectures. In PACT, 2008.
-
(2008)
PACT
-
-
Nuzman, D.1
Zaks, A.2
-
32
-
-
57349167317
-
Iterative optimization in the polyhedral model: Part II, multidimensional time
-
ACM Press
-
L.-N. Pouchet, C. Bastoul, A. Cohen, and J. Cavazos. Iterative optimization in the polyhedral model: Part II, multidimensional time. In PLDI, pages 90-100. ACM Press, 2008.
-
(2008)
PLDI
, pp. 90-100
-
-
Pouchet, L.-N.1
Bastoul, C.2
Cohen, A.3
Cavazos, J.4
-
33
-
-
78650842988
-
Combined iterative and model-driven optimization in an automatic parallelization framework
-
New Orleans, Lousiana, Nov.
-
L.-N. Pouchet, U. Bondhugula, C. Bastoul, A. Cohen, J. Ramanujam, and P. Sadayappan. Combined iterative and model-driven optimization in an automatic parallelization framework. In ACM Supercomputing Conf. (SC'10), New Orleans, Lousiana, Nov. 2010.
-
(2010)
ACM Supercomputing Conf. (SC'10)
-
-
Pouchet, L.-N.1
Bondhugula, U.2
Bastoul, C.3
Cohen, A.4
Ramanujam, J.5
Sadayappan, P.6
-
34
-
-
79251560668
-
Loop transformations: Convexity, pruning and optimization
-
Austin, TX, Jan.
-
L.-N. Pouchet, U. Bondhugula, C. Bastoul, A. Cohen, J. Ramanujam, P. Sadayappan, and N. Vasilache. Loop transformations: Convexity, pruning and optimization. In POPL, pages 549-562, Austin, TX, Jan. 2011.
-
(2011)
POPL
, pp. 549-562
-
-
Pouchet, L.-N.1
Bondhugula, U.2
Bastoul, C.3
Cohen, A.4
Ramanujam, J.5
Sadayappan, P.6
Vasilache, N.7
-
35
-
-
19344368072
-
SPIRAL: Code generation for DSP transforms
-
M. Püschel, J. M. F. Moura, J. Johnson, D. Padua, M. Veloso, B. Singer, J. Xiong, F. Franchetti, A. Gacic, Y. Voronenko, K. Chen, R. W. Johnson, and N. Rizzolo. SPIRAL: Code generation for DSP transforms. Proc. of the IEEE, 93(2):232-275, 2005.
-
(2005)
Proc. of the IEEE
, vol.93
, Issue.2
, pp. 232-275
-
-
Püschel, M.1
Moura, J.M.F.2
Johnson, J.3
Padua, D.4
Veloso, M.5
Singer, B.6
Xiong, J.7
Franchetti, F.8
Gacic, A.9
Voronenko, Y.10
Chen, K.11
Johnson, R.W.12
Rizzolo, N.13
-
36
-
-
18944384585
-
Mechanizing the development of software
-
M. Broy, editor NATO ASI Series, IOS Press Kestrel Institute Technical Report KES.U.99.1
-
D. R. Smith. Mechanizing the development of software. In M. Broy, editor, Calculational System Design, Proc. of the International Summer School Marktoberdorf. NATO ASI Series, IOS Press, 1999. Kestrel Institute Technical Report KES.U.99.1.
-
(1999)
Calculational System Design, Proc. of the International Summer School Marktoberdorf
-
-
Smith, D.R.1
-
38
-
-
70449626135
-
Polyhedral-model guided loop-nest auto-vectorization
-
Sept.
-
K. Trifunovic, D. Nuzman, A. Cohen, A. Zaks, and I. Rosen. Polyhedral-model guided loop-nest auto-vectorization. In PACT, Sept. 2009.
-
(2009)
PACT
-
-
Trifunovic, K.1
Nuzman, D.2
Cohen, A.3
Zaks, A.4
Rosen, I.5
-
41
-
-
58649099625
-
Algebraic signal processing theory: Cooley-tukey type algorithms for real dfts
-
Y. Voronenko and M. Püschel. Algebraic signal processing theory: Cooley-tukey type algorithms for real dfts. IEEE Transactions on Signal Processing, 57(1), 2009.
-
(2009)
IEEE Transactions on Signal Processing
, vol.57
, Issue.1
-
-
Voronenko, Y.1
Püschel, M.2
-
42
-
-
0003278639
-
Automatically tuned linear algebra software (ATLAS)
-
math-atlas. sourceforge.net
-
R. C. Whaley and J. Dongarra. Automatically Tuned Linear Algebra Software (ATLAS). In Proc. Supercomputing, 1998. math-atlas. sourceforge.net.
-
(1998)
Proc. Supercomputing
-
-
Whaley, R.C.1
Dongarra, J.2
|