-
1
-
-
69049096837
-
-
Intel: Integrated Performance Primitives 5.3, User Guide
-
Intel: Integrated Performance Primitives 5.3, User Guide
-
-
-
-
2
-
-
57049117343
-
How to write fast numerical code: A small introduction
-
Lämmel, R, Visser, J, Saraiva, J, eds, Generative and Transformational Techniques in Software Engineering II, Springer, Heidelberg
-
Chellappa, S., Franchetti, F., Püschel, M.: How to write fast numerical code: A small introduction. In: Lämmel, R., Visser, J., Saraiva, J. (eds.) Generative and Transformational Techniques in Software Engineering II. LNCS, vol. 5235, pp. 196-259. Springer, Heidelberg (2008)
-
(2008)
LNCS
, vol.5235
, pp. 196-259
-
-
Chellappa, S.1
Franchetti, F.2
Püschel, M.3
-
3
-
-
0034826555
-
SPL: A language and compiler for DSP algorithms
-
Xiong, J., Johnson, J., Johnson, R., Padua, D.: SPL: A language and compiler for DSP algorithms. In: Proc. Programming Language Design and Implementation (PLDI), pp. 298-308 (2001)
-
(2001)
Proc. Programming Language Design and Implementation (PLDI)
, pp. 298-308
-
-
Xiong, J.1
Johnson, J.2
Johnson, R.3
Padua, D.4
-
4
-
-
19344368072
-
SPIRAL: Code generation for DSP transforms
-
Püschel, M., Moura, J.M.F., Johnson, J., Padua, D., Veloso, M., Singer, B., Xiong, J., Franchetti, F., Gacic, A., Voronenko, Y., Chen, K., Johnson, R.W., Rizzolo, N.: SPIRAL: Code generation for DSP transforms. Proc. of the IEEE, special issue on Program Generation, Optimization, and Adaptation 93(2), 232-275 (2005)
-
(2005)
Proc. of the IEEE, special issue on Program Generation, Optimization, and Adaptation
, vol.93
, Issue.2
, pp. 232-275
-
-
Püschel, M.1
Moura, J.M.F.2
Johnson, J.3
Padua, D.4
Veloso, M.5
Singer, B.6
Xiong, J.7
Franchetti, F.8
Gacic, A.9
Voronenko, Y.10
Chen, K.11
Johnson, R.W.12
Rizzolo, N.13
-
6
-
-
38049144052
-
A rewriting system for the vectorization of signal transforms
-
Daydé, M, Palma, J.M.L.M, Coutinho, Á .L.G.A, Pacitti, E, Lopes, J.C, eds, VECPAR 2006, Springer, Heidelberg
-
Franchetti, F., Voronenko, Y., Püschel, M.: A rewriting system for the vectorization of signal transforms. In: Daydé, M., Palma, J.M.L.M., Coutinho, Á .L.G.A., Pacitti, E., Lopes, J.C. (eds.) VECPAR 2006. LNCS, vol. 4395, pp. 363-377. Springer, Heidelberg (2007)
-
(2007)
LNCS
, vol.4395
, pp. 363-377
-
-
Franchetti, F.1
Voronenko, Y.2
Püschel, M.3
-
7
-
-
69049083172
-
-
GPCE: ACM conference on generative programming and component engineering
-
GPCE: ACM conference on generative programming and component engineering
-
-
-
-
8
-
-
0004014411
-
-
Addison-Wesley, Reading
-
Czarnecki, K., Eisenecker, U.: Generative Programming: Methods, Tools, and Applications. Addison-Wesley, Reading (2000)
-
(2000)
Generative Programming: Methods, Tools, and Applications
-
-
Czarnecki, K.1
Eisenecker, U.2
-
9
-
-
2942713451
-
Achieving extensibility through product-lines and domain-specific languages: A case study
-
Batory, D., Johnson, C., MacDonald, B., von Heeder, D.: Achieving extensibility through product-lines and domain-specific languages: A case study. ACM Transactions on Software Engineering and Methodology (TOSEM) 11(2), 191-214 (2002)
-
(2002)
ACM Transactions on Software Engineering and Methodology (TOSEM)
, vol.11
, Issue.2
, pp. 191-214
-
-
Batory, D.1
Johnson, C.2
MacDonald, B.3
von Heeder, D.4
-
11
-
-
69049089562
-
-
Smith, D.R.: Mechanizing the development of software. In: Broy, M. (ed.) Calculational System Design, Proc. of the International Summer School Marktoberdorf. NATO ASI Series. IOS Press, Amsterdam (1999); Kestrel Institute Technical Report KES.U.99.1
-
Smith, D.R.: Mechanizing the development of software. In: Broy, M. (ed.) Calculational System Design, Proc. of the International Summer School Marktoberdorf. NATO ASI Series. IOS Press, Amsterdam (1999); Kestrel Institute Technical Report KES.U.99.1
-
-
-
-
12
-
-
67649530725
-
Little language processing, an alternative to courses on compiler construction
-
Gough, K.J.: Little language processing, an alternative to courses on compiler construction. SIGCSE Bulletin 13(3), 31-34 (1981)
-
(1981)
SIGCSE Bulletin
, vol.13
, Issue.3
, pp. 31-34
-
-
Gough, K.J.1
-
13
-
-
84976711318
-
Programming pearls: Little languages
-
Bentley, J.: Programming pearls: little languages. Communications of the ACM 29(8), 711- 721 (1986)
-
(1986)
Communications of the ACM
, vol.29
, Issue.8
, pp. 711-721
-
-
Bentley, J.1
-
15
-
-
84947255563
-
DSL implementation in MetaOCaml, Template Haskell, and C++
-
Lengauer, C, Batory, D, Consel, C, Odersky, M, eds, Domain-Specific Program Generation, Springer, Heidelberg
-
Czarnecki, K., O'Donnell, J., Striegnitz, J., Taha, W.: DSL implementation in MetaOCaml, Template Haskell, and C++. In: Lengauer, C., Batory, D., Consel, C., Odersky, M. (eds.) Domain-Specific Program Generation. LNCS, vol. 3016, pp. 51-72. Springer, Heidelberg(2004)
-
(2004)
LNCS
, vol.3016
, pp. 51-72
-
-
Czarnecki, K.1
O'Donnell, J.2
Striegnitz, J.3
Taha, W.4
-
17
-
-
0002515795
-
Automatically Tuned Linear Algebra Software (ATLAS)
-
Whaley, R.C., Dongarra, J.: Automatically Tuned Linear Algebra Software (ATLAS). In: Proc. Supercomputing (1998), math-atlas.sourceforge.net
-
(1998)
Proc. Supercomputing
-
-
Whaley, R.C.1
Dongarra, J.2
-
18
-
-
1542501019
-
Sparsity: Optimization framework for sparse matrix kernels
-
Im, E.J., Yelick, K., Vuduc, R.: Sparsity: Optimization framework for sparse matrix kernels. Int'l. J. High Performance Computing Applications 18(1) (2004)
-
(2004)
Int'l. J. High Performance Computing Applications
, vol.18
, Issue.1
-
-
Im, E.J.1
Yelick, K.2
Vuduc, R.3
-
19
-
-
20744449792
-
The design and implementation of FFTW3
-
Frigo, M., Johnson, S.G.: The design and implementation of FFTW3. Proc. of the IEEE, special issue on Program Generation, Optimization, and Adaptation 93(2), 216-231 (2005)
-
(2005)
Proc. of the IEEE, special issue on Program Generation, Optimization, and Adaptation
, vol.93
, Issue.2
, pp. 216-231
-
-
Frigo, M.1
Johnson, S.G.2
-
21
-
-
20744453223
-
Synthesis of highperformance parallel programs for a class of ab initio quantum chemistry models
-
Baumgartner, G., Auer, A., Bernholdt, D.E., Bibireata, A., Choppella, V., Cociorva, D., Gao, X., Harrison, R.J., Hirata, S., Krishanmoorthy, S., Krishnan, S., Lam, C.C., Lu, Q., Nooijen, M., Pitzer, R.M., Ramanujam, J., Sadayappan, P., Sibiryakov, A.: Synthesis of highperformance parallel programs for a class of ab initio quantum chemistry models. Proc. of the IEEE, special issue on Program Generation, Optimization, and Adaptation 93(2) (2005)
-
(2005)
Proc. of the IEEE, special issue on Program Generation, Optimization, and Adaptation
, vol.93
, Issue.2
-
-
Baumgartner, G.1
Auer, A.2
Bernholdt, D.E.3
Bibireata, A.4
Choppella, V.5
Cociorva, D.6
Gao, X.7
Harrison, R.J.8
Hirata, S.9
Krishanmoorthy, S.10
Krishnan, S.11
Lam, C.C.12
Lu, Q.13
Nooijen, M.14
Pitzer, R.M.15
Ramanujam, J.16
Sadayappan, P.17
Sibiryakov, A.18
-
22
-
-
17644412337
-
The scienceof deriving dense linear algebra algorithms
-
Bientinesi, P., Gunnels, J.A., Myers, M.E., Quintana-Orti, E., van de Geijn, R.: The scienceof deriving dense linear algebra algorithms. TOMS 31(1), 1-26 (2005)
-
(2005)
TOMS
, vol.31
, Issue.1
, pp. 1-26
-
-
Bientinesi, P.1
Gunnels, J.A.2
Myers, M.E.3
Quintana-Orti, E.4
van de Geijn, R.5
-
23
-
-
0000459334
-
Rewriting
-
Robinson, A, Voronkov, A, eds, Elsevier, Amsterdam
-
Dershowitz, N., Plaisted, D.A.: Rewriting. In: Robinson, A., Voronkov, A. (eds.) Handbook of Automated Reasoning, vol. 1, pp. 535-610. Elsevier, Amsterdam (2001)
-
(2001)
Handbook of Automated Reasoning
, vol.1
, pp. 535-610
-
-
Dershowitz, N.1
Plaisted, D.A.2
-
24
-
-
69049091660
-
-
Nilsson, U., Maluszynski, J.: Logic, Programming and Prolog, 2nd edn. John Wiley & Sons Inc., Chichester (1995)
-
Nilsson, U., Maluszynski, J.: Logic, Programming and Prolog, 2nd edn. John Wiley & Sons Inc., Chichester (1995)
-
-
-
-
27
-
-
0343462141
-
Automated empirical optimization of software and the ATLAS project
-
Whaley, R.C., Petitet, A., Dongarra, J.J.: Automated empirical optimization of software and the ATLAS project. Parallel Computing 27(1-2), 3-35 (2001)
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Whaley, R.C.1
Petitet, A.2
Dongarra, J.J.3
-
28
-
-
20744459570
-
A comparison of empirical and model-driven optimization
-
Yotov, K., Li, X., Ren, G., Garzaran, M., Padua, D., Pingali, K., Stodghill, P.: A comparison of empirical and model-driven optimization. Proc. of the IEEE, special issue on Program Generation, Optimization, and Adaptation 93(2) (2005)
-
(2005)
Proc. of the IEEE, special issue on Program Generation, Optimization, and Adaptation
, vol.93
, Issue.2
-
-
Yotov, K.1
Li, X.2
Ren, G.3
Garzaran, M.4
Padua, D.5
Pingali, K.6
Stodghill, P.7
-
29
-
-
0025558548
-
Multilinear algebra and parallel programming
-
IEEE Computer Society Press, Los Alamitos
-
Johnson, R.W., Huang, C.H., Johnson, J.R.: Multilinear algebra and parallel programming. In: Supercomputing 1990: Proceedings of the 1990 conference on Supercomputing, pp. 20-31 IEEE Computer Society Press, Los Alamitos (1990)
-
(1990)
Supercomputing 1990: Proceedings of the 1990 conference on Supercomputing
, pp. 20-31
-
-
Johnson, R.W.1
Huang, C.H.2
Johnson, J.R.3
-
30
-
-
51049115051
-
-
PhD thesis, Electrical and Computer Engineering, Carnegie Mellon University
-
Voronenko, Y.: Library Generation for Linear Transforms. PhD thesis, Electrical and Computer Engineering, Carnegie Mellon University (2008)
-
(2008)
Library Generation for Linear Transforms
-
-
Voronenko, Y.1
-
31
-
-
67650568215
-
Computer generation of general size linear transform libraries
-
Voronenko, Y., deMesmay, F., Püschel,M.: Computer generation of general size linear transform libraries. In: Intl. Symposium on Code Generation and Optimization, CGO (2009)
-
(2009)
Intl. Symposium on Code Generation and Optimization, CGO
-
-
Voronenko, Y.1
deMesmay, F.2
Püschel, M.3
-
32
-
-
85154002090
-
Sorting networks and their applications
-
Batcher, K.: Sorting networks and their applications. In: Proc. AFIPS Spring Joint Comput. Conf., vol. 32, pp. 307-314 (1968)
-
(1968)
Proc. AFIPS Spring Joint Comput. Conf
, vol.32
, pp. 307-314
-
-
Batcher, K.1
-
33
-
-
84935113569
-
Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
-
Viterbi, A.: Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory 13(2), 260-269 (1967)
-
(1967)
IEEE Transactions on Information Theory
, vol.13
, Issue.2
, pp. 260-269
-
-
Viterbi, A.1
-
34
-
-
69049107250
-
-
submitted for publication
-
de Mesmay, F., Chellappa, S., Franchetti, F., Püschel, M.: Computer generation of efficient software Viterbi decoders: submitted for publication
-
Computer generation of efficient software Viterbi decoders
-
-
de Mesmay, F.1
Chellappa, S.2
Franchetti, F.3
Püschel, M.4
-
36
-
-
69049106226
-
High performance synthetic aperture radar image formation on commodity architectures
-
McFarlin, D., Franchetti, F., Moura, J.M.F., Püschel, M.: High performance synthetic aperture radar image formation on commodity architectures. In: SPIE Conference on Defense, Security, and Sensing (2009)
-
(2009)
SPIE Conference on Defense, Security, and Sensing
-
-
McFarlin, D.1
Franchetti, F.2
Moura, J.M.F.3
Püschel, M.4
-
37
-
-
51049115769
-
Domain-specific library generation for parallel software and hardware platforms
-
Franchetti, F.,Voronenko, Y.,Milder, P.A., Chellappa, S., Telgarsky,M., Shen, H., D'Alberto, P., deMesmay, F., Hoe, J.C., Moura, J.M.F., Püschel, M.: Domain-specific library generation for parallel software and hardware platforms. In: NSF Next Generation Software Program workshop, NSFNGS (2008)
-
(2008)
NSF Next Generation Software Program workshop, NSFNGS
-
-
Franchetti, F.1
Voronenko, Y.2
Milder, P.A.3
Chellappa, S.4
Telgarsky, M.5
Shen, H.6
D'Alberto, P.7
deMesmay, F.8
Hoe, J.C.9
Moura, J.M.F.10
Püschel, M.11
-
39
-
-
31844432305
-
Loop merging for signal transforms
-
Franchetti, F., Voronenko, Y., Püschel, M.: Loop merging for signal transforms. In: Proc. Programming Language Design and Implementation (PLDI), pp. 315-326 (2005)
-
(2005)
Proc. Programming Language Design and Implementation (PLDI)
, pp. 315-326
-
-
Franchetti, F.1
Voronenko, Y.2
Püschel, M.3
-
41
-
-
69049110689
-
-
Intel: Math Kernel Library 10.0, Reference Manual
-
Intel: Math Kernel Library 10.0, Reference Manual
-
-
-
-
42
-
-
69049096836
-
-
Goto, K.: GotoBLAS 1.26 (2008), http://www.tacc.utexas.edu/resources/ software/#blas
-
(2008)
GotoBLAS 1.26
-
-
Goto, K.1
-
44
-
-
69049101756
-
Implementation of polar format SAR image formation on the IBM Cell Broadband Engine
-
Rudin, J.A.: Implementation of polar format SAR image formation on the IBM Cell Broadband Engine. In: Proc. High Performance Embedded Computing (HPEC) (2007)
-
(2007)
Proc. High Performance Embedded Computing (HPEC)
-
-
Rudin, J.A.1
-
45
-
-
69049107251
-
-
Karn, P, FEC library version 3.0.1 August 2007
-
Karn, P.: FEC library version 3.0.1 (August 2007), http://www.ka9q.net/ code/fec/
-
-
-
|