-
1
-
-
84891393757
-
-
Secaucus, NJ, USA: Springer-Verlag New York, Inc.
-
K. Naono, K. Teranishi, J. Cavazos, and R. Suda, Software Automatic Tuning (From Concepts to State-of-the-Art Results). Secaucus, NJ, USA: Springer-Verlag New York, Inc., 2010.
-
(2010)
Software Automatic Tuning (From Concepts to State-of-the-Art Results)
-
-
Naono, K.1
Teranishi, K.2
Cavazos, J.3
Suda, R.4
-
3
-
-
24344485098
-
Oski: A library of automatically tuned sparse matrix kernels
-
IOP Publishing
-
R. Vuduc, J. Demmel, and K. Yelick, "Oski: A library of automatically tuned sparse matrix kernels," in Journal of Physics: Conference Series, vol. 16. IOP Publishing, 2005, p. 521.
-
(2005)
Journal of Physics: Conference Series
, vol.16
, pp. 521
-
-
Vuduc, R.1
Demmel, J.2
Yelick, K.3
-
4
-
-
0036679993
-
Adaptive optimizing compilers for the 21st century
-
K. Cooper, D. Subramanian, and L. Torczon, "Adaptive optimizing compilers for the 21st century," The Journal of Supercomputing, vol. 23, no. 1, pp. 7-22, 2001.
-
(2001)
The Journal of Supercomputing
, vol.23
, Issue.1
, pp. 7-22
-
-
Cooper, K.1
Subramanian, D.2
Torczon, L.3
-
5
-
-
79956151596
-
Milepost gcc: Machine learning enabled self-tuning compiler
-
G. Fursin, Y. Kashnikov, A. Memon, Z. Chamski, O. Temam, M. Namolaru, E. Yom-Tov, B. Mendelson, A. Zaks, E. Courtois et al., "Milepost gcc: machine learning enabled self-tuning compiler," International Journal of Parallel Programming, vol. 39, no. 3, pp. 296-327, 2011.
-
(2011)
International Journal of Parallel Programming
, vol.39
, Issue.3
, pp. 296-327
-
-
Fursin, G.1
Kashnikov, Y.2
Memon, A.3
Chamski, Z.4
Temam, O.5
Namolaru, M.6
Yom-Tov, E.7
Mendelson, B.8
Zaks, A.9
Courtois, E.10
-
6
-
-
84859140308
-
Analytical bounds for optimal tile size selection
-
Tallinn, Estonia: Springer Verlag, Mar.
-
J. Shirako, K. Sharma, N. Fauzia, L.-N. Pouchet, J. Ramanujam, P. Sadayappan, and V. Sarkar, "Analytical bounds for optimal tile size selection," in ETAPS International Conference on Compiler Construction (CC'12). Tallinn, Estonia: Springer Verlag, Mar. 2012.
-
(2012)
ETAPS International Conference on Compiler Construction (CC'12)
-
-
Shirako, J.1
Sharma, K.2
Fauzia, N.3
Pouchet, L.-N.4
Ramanujam, J.5
Sadayappan, P.6
Sarkar, V.7
-
7
-
-
70449844310
-
A scalable auto-tuning framework for compiler optimization
-
Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing, ser. Washington, DC, USA: IEEE Computer Society
-
A. Tiwari, C. Chen, J. Chame, M. Hall, and J. K. Hollingsworth, "A scalable auto-tuning framework for compiler optimization," in Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing, ser. IPDPS '09. Washington, DC, USA: IEEE Computer Society, 2009, pp. 1-12.
-
(2009)
IPDPS '09
, pp. 1-12
-
-
Tiwari, A.1
Chen, C.2
Chame, J.3
Hall, M.4
Hollingsworth, J.K.5
-
8
-
-
34547683700
-
Iterative optimization in the polyhedral model: Part i, one-dimensional time
-
IEEE
-
L. Pouchet, C. Bastoul, A. Cohen, and N. Vasilache, "Iterative optimization in the polyhedral model: Part i, one-dimensional time," in Code Generation and Optimization, 2007. CGO'07. International Symposium on. IEEE, 2007, pp. 144-156.
-
(2007)
Code Generation and Optimization, 2007. CGO'07. International Symposium on
, pp. 144-156
-
-
Pouchet, L.1
Bastoul, C.2
Cohen, A.3
Vasilache, N.4
-
9
-
-
77954003716
-
Parameterized tiling revisited
-
ACM
-
M. Baskaran, A. Hartono, S. Tavarageri, T. Henretty, J. Ramanujam, and P. Sadayappan, "Parameterized tiling revisited," in Proceedings of the 8th annual IEEE/ACM international symposium on Code generation and optimization. ACM, 2010, pp. 200-209.
-
(2010)
Proceedings of the 8th Annual IEEE/ACM International Symposium on Code Generation and Optimization
, pp. 200-209
-
-
Baskaran, M.1
Hartono, A.2
Tavarageri, S.3
Henretty, T.4
Ramanujam, J.5
Sadayappan, P.6
-
10
-
-
0034512401
-
Combined selection of tile sizes and unroll factors using iterative compilation
-
Proceedings of the 2000 International Conference on Parallel Architectures and Compilation Techniques, ser. Washington, DC, USA: IEEE Computer Society
-
T. Kisuki, P. M. W. Knijnenburg, and M. F. P. O'Boyle, "Combined selection of tile sizes and unroll factors using iterative compilation," in Proceedings of the 2000 International Conference on Parallel Architectures and Compilation Techniques, ser. PACT '00. Washington, DC, USA: IEEE Computer Society, 2000, pp. 237-.
-
(2000)
PACT '00
, pp. 237
-
-
Kisuki, T.1
Knijnenburg, P.M.W.2
O'Boyle, M.F.P.3
-
12
-
-
47249109476
-
-
Secaucus, NJ, USA: Springer-Verlag New York, Inc.
-
C. A. C. Coello, G. B. Lamont, and D. A. V. Veldhuizen, Evolutionary Algorithms for Solving Multi-Objective Problems (Genetic and Evolutionary Computation). Secaucus, NJ, USA: Springer-Verlag New York, Inc., 2006.
-
(2006)
Evolutionary Algorithms for Solving Multi-Objective Problems (Genetic and Evolutionary Computation)
-
-
Coello, C.A.C.1
Lamont, G.B.2
Veldhuizen, D.A.V.3
-
13
-
-
0142000477
-
Differential Evolution - A Simple and Efficient Heuristic for Global Optimization over Continuous Spaces
-
R. Storn and K. Price, "Differential evolution: A simple and efficient heuristic for global optimization over continuous spaces." Journal of Global Optimization, vol. 11, no. 4, pp. 341-359, 1997. (Pubitemid 127502202)
-
(1997)
Journal of Global Optimization
, vol.11
, Issue.4
, pp. 341-359
-
-
Storn, R.1
Price, K.2
-
14
-
-
27744565978
-
Rough sets
-
Z. Pawlak, "Rough sets," International Journal of Parallel Programming, vol. 11, no. 5, pp. 341-356, 1982.
-
(1982)
International Journal of Parallel Programming
, vol.11
, Issue.5
, pp. 341-356
-
-
Pawlak, Z.1
-
15
-
-
84867637691
-
-
Distributed and Parallel Systems Group, University of Innsbruck. [Online]. Available
-
"Insieme comiler and runtime infrastructure." Distributed and Parallel Systems Group, University of Innsbruck. [Online]. Available: http://insieme-compiler.org
-
Insieme Comiler and Runtime Infrastructure
-
-
-
16
-
-
0000238336
-
A Simplex Method for Function Minimization
-
Jan.
-
J. A. Nelder and R. Mead, "A Simplex Method for Function Minimization," The Computer Journal, vol. 7, no. 4, pp. 308-313, Jan. 1965.
-
(1965)
The Computer Journal
, vol.7
, Issue.4
, pp. 308-313
-
-
Nelder, J.A.1
Mead, R.2
-
17
-
-
27144451406
-
Gde3: The third evolution step of generalized differential evolution
-
IEEE
-
S. Kukkonen and J. Lampinen, "Gde3: the third evolution step of generalized differential evolution," in IEEE Congress on Evolutionary Computation. IEEE, 2005, pp. 443-450.
-
(2005)
IEEE Congress on Evolutionary Computation
, pp. 443-450
-
-
Kukkonen, S.1
Lampinen, J.2
-
18
-
-
78649512137
-
Convergence speed in multi-objective metaheuristics: Efficiency criteria and empirical study
-
December
-
J. J. Durillo, A. J. Nebro, F. Luna, C. A. Coello Coello, and E. Alba, "Convergence speed in multi-objective metaheuristics: Efficiency criteria and empirical study," International Journal for Numerical Methods in Engineering, vol. 84, no. 11, pp. 1344-1375, December 2010.
-
(2010)
International Journal for Numerical Methods in Engineering
, vol.84
, Issue.11
, pp. 1344-1375
-
-
Durillo, J.J.1
Nebro, A.J.2
Luna, F.3
Coello Coello, C.A.4
Alba, E.5
-
19
-
-
0012055597
-
A micro-genetic algorithm for multiobjective optimization
-
[Online]. Available: citeseer.ist.psu.edu/444668.html
-
C. A. C. Coello and G. T. Pulido, "A micro-genetic algorithm for multiobjective optimization," Optimization, vol. 7, no. 5, pp. 126-140, 2001. [Online]. Available: citeseer.ist.psu.edu/444668.html
-
(2001)
Optimization
, vol.7
, Issue.5
, pp. 126-140
-
-
Coello, C.A.C.1
Pulido, G.T.2
-
20
-
-
70349269365
-
Demors: A hybrid multi-objective optimization algorithm using differential evolution and rough set theory for constrained problems
-
L. V. Santana-Quintero, A. G. Hernández-Díaz, J. M. Luque, C. A. C. Coello, and R. Caballero, "Demors: A hybrid multi-objective optimization algorithm using differential evolution and rough set theory for constrained problems," Computers & OR, vol. 37, no. 3, pp. 470-480, 2010.
-
(2010)
Computers & OR
, vol.37
, Issue.3
, pp. 470-480
-
-
Santana-Quintero, L.V.1
Hernández-Díaz, A.G.2
Luque, J.M.3
Coello, C.A.C.4
Caballero, R.5
-
21
-
-
10444289646
-
Code generation in the polyhedral model is easier than you think
-
C. Bastoul, "Code generation in the polyhedral model is easier than you think," in PACT'13 IEEE International Conference on Parallel Architecture and Compilation Techniques, Juan-les-Pins, France, September 2004, pp. 7-16.
-
PACT'13 IEEE International Conference on Parallel Architecture and Compilation Techniques, Juan-les-Pins, France, September 2004
, pp. 7-16
-
-
Bastoul, C.1
-
22
-
-
0033318858
-
Multiobjective evolutionary algorithms a comparative case study and the strength pareto approach
-
DOI 10.1109/4235.797969
-
E. Zitzler and L. Thiele, "Multiobjective evolutionary algorithms: a comparative case study and the strength pareto approach," IEEE Transactions on Evolutionary Computation, vol. 3, no. 4, pp. 257-271, 1999. (Pubitemid 30544879)
-
(1999)
IEEE Transactions on Evolutionary Computation
, vol.3
, Issue.4
, pp. 257-271
-
-
Zitzler, E.1
Thiele, L.2
-
23
-
-
19344368072
-
Spiral: Code generation for dsp transforms
-
M. Puschel, J. Moura, J. Johnson, D. Padua, M. Veloso, B. Singer, J. Xiong, F. Franchetti, A. Gacic, Y. Voronenko et al., "Spiral: Code generation for dsp transforms," Proceedings of the IEEE, vol. 93, no. 2, pp. 232-275, 2005.
-
(2005)
Proceedings of the IEEE
, vol.93
, Issue.2
, pp. 232-275
-
-
Puschel, M.1
Moura, J.2
Johnson, J.3
Padua, D.4
Veloso, M.5
Singer, B.6
Xiong, J.7
Franchetti, F.8
Gacic, A.9
Voronenko, Y.10
-
24
-
-
0348209599
-
A fast fourier transform compiler
-
ACM
-
M. Frigo, "A fast fourier transform compiler," in Acm Sigplan Notices, vol. 34, no. 5. ACM, 1999, pp. 169-180.
-
(1999)
Acm Sigplan Notices
, vol.34
, Issue.5
, pp. 169-180
-
-
Frigo, M.1
-
25
-
-
85117245869
-
Active harmony: Towards automated performance tuning
-
IEEE
-
C. Tapus, I. Chung, and J. Hollingsworth, "Active harmony: Towards automated performance tuning," in Supercomputing, ACM/IEEE 2002 Conference. IEEE, 2002, pp. 44-44.
-
(2002)
Supercomputing, ACM/IEEE 2002 Conference
, pp. 44-44
-
-
Tapus, C.1
Chung, I.2
Hollingsworth, J.3
-
26
-
-
34548207355
-
Sequoia: Programming the memory hierarchy
-
IEEE
-
K. Fatahalian, T. Knight, M. Houston, M. Erez, D. Horn, L. Leem, J. Park, M. Ren, A. Aiken, W. Dally et al., "Sequoia: programming the memory hierarchy," in SC 2006 Conference, Proceedings of the ACM/IEEE. IEEE, 2006, pp. 4-4.
-
(2006)
SC 2006 Conference, Proceedings of the ACM/IEEE
, pp. 4-4
-
-
Fatahalian, K.1
Knight, T.2
Houston, M.3
Erez, M.4
Horn, D.5
Leem, L.6
Park, J.7
Ren, M.8
Aiken, A.9
Dally, W.10
-
27
-
-
67650786281
-
-
ACM
-
J. Ansel, C. Chan, Y. Wong, M. Olszewski, Q. Zhao, A. Edelman, and S. Amarasinghe, PetaBricks: a language and compiler for algorithmic choice. ACM, 2009, vol. 44, no. 6.
-
(2009)
PetaBricks: A Language and Compiler for Algorithmic Choice
, vol.44
, Issue.6
-
-
Ansel, J.1
Chan, C.2
Wong, Y.3
Olszewski, M.4
Zhao, Q.5
Edelman, A.6
Amarasinghe, S.7
-
28
-
-
84877710210
-
Petabricks: A language and compiler based on autotuning
-
M. Katevenis, M. Martonosi, C. Kozyrakis, and O. Temam, Eds. ACM
-
S. P. Amarasinghe, "Petabricks: a language and compiler based on autotuning," in HiPEAC, M. Katevenis, M. Martonosi, C. Kozyrakis, and O. Temam, Eds. ACM, 2011, p. 3.
-
(2011)
HiPEAC
, pp. 3
-
-
Amarasinghe, S.P.1
-
29
-
-
80053238973
-
Patus: A code generation and autotuning framework for parallel iterative stencil computations on modern microarchitectures
-
IEEE Computer Society
-
M. Christen, O. Schenk, and H. Burkhart, "Patus: A code generation and autotuning framework for parallel iterative stencil computations on modern microarchitectures," in Proceedings of the 2011 IEEE International Symposium on Parallel&Distributed Processing. IEEE Computer Society, 2011, pp. 676-687.
-
(2011)
Proceedings of the 2011 IEEE International Symposium on Parallel&Distributed Processing
, pp. 676-687
-
-
Christen, M.1
Schenk, O.2
Burkhart, H.3
-
30
-
-
77954022347
-
An auto-tuning framework for parallel multicore stencil computations
-
S. Kamil, C. Chan, L. Oliker, J. Shalf, S. Williams, and S. Williams, "An auto-tuning framework for parallel multicore stencil computations." in IPDPS, 2010, pp. 1-12.
-
(2010)
IPDPS
, pp. 1-12
-
-
Kamil, S.1
Chan, C.2
Oliker, L.3
Shalf, J.4
Williams, S.5
Williams, S.6
-
31
-
-
57349167317
-
Iterative optimization in the polyhedral model: Part ii, multidimensional time
-
ACM
-
L. Pouchet, C. Bastoul, A. Cohen, and J. Cavazos, "Iterative optimization in the polyhedral model: Part ii, multidimensional time," in ACM SIGPLAN Notices, vol. 43, no. 6. ACM, 2008, pp. 90-100.
-
(2008)
ACM SIGPLAN Notices
, vol.43
, Issue.6
, pp. 90-100
-
-
Pouchet, L.1
Bastoul, C.2
Cohen, A.3
Cavazos, J.4
-
32
-
-
70449648906
-
-
U. of Southern California, Tech. Rep
-
C. Chen, J. Chame, and M. Hall, "Chill: A framework for composing high-level loop transformations," U. of Southern California, Tech. Rep, pp. 08-897, 2008.
-
(2008)
Chill: A Framework for Composing High-level Loop Transformations
, pp. 08-897
-
-
Chen, C.1
Chame, J.2
Hall, M.3
-
33
-
-
77954017994
-
Automated just-in-time compiler tuning
-
IEEE
-
K. Hoste, A. Georges, L. Eeckhout, and L. Eeckhout, "Automated just-in-time compiler tuning." in CGO. IEEE, 2010, pp. 62-72.
-
(2010)
CGO
, pp. 62-72
-
-
Hoste, K.1
Georges, A.2
Eeckhout, L.3
Eeckhout, L.4
-
34
-
-
65649128093
-
Peri auto-tuning
-
IOP Publishing
-
D. Bailey, J. Chame, C. Chen, J. Dongarra, M. Hall, J. Hollingsworth, P. Hovland, S. Moore, K. Seymour, J. Shin et al., "Peri auto-tuning," in Journal of Physics: Conference Series, vol. 125. IOP Publishing, 2008, p. 012089.
-
(2008)
Journal of Physics: Conference Series
, vol.125
, pp. 012089
-
-
Bailey, D.1
Chame, J.2
Chen, C.3
Dongarra, J.4
Hall, M.5
Hollingsworth, J.6
Hovland, P.7
Moore, S.8
Seymour, K.9
Shin, J.10
-
36
-
-
77954786323
-
Multiobjective exploration of compiler optimizations for real-time systems
-
P. Lokuciejewski, S. Plazar, H. Falk, P. Marwedel, and L. Thiele, "Multiobjective exploration of compiler optimizations for real-time systems," in ISORC, 2010, pp. 115-122.
-
(2010)
ISORC
, pp. 115-122
-
-
Lokuciejewski, P.1
Plazar, S.2
Falk, H.3
Marwedel, P.4
Thiele, L.5
-
38
-
-
84886006847
-
Using machine learning to focus iterative optimization
-
IEEE Computer Society
-
F. Agakov, E. Bonilla, J. Cavazos, B. Franke, G. Fursin, M. O'Boyle, J. Thomson, M. Toussaint, and C. Williams, "Using machine learning to focus iterative optimization," in Proceedings of the International Symposium on Code Generation and Optimization. IEEE Computer Society, 2006, pp. 295-305.
-
(2006)
Proceedings of the International Symposium on Code Generation and Optimization
, pp. 295-305
-
-
Agakov, F.1
Bonilla, E.2
Cavazos, J.3
Franke, B.4
Fursin, G.5
O'Boyle, M.6
Thomson, J.7
Toussaint, M.8
Williams, C.9
-
39
-
-
79953870899
-
Neural network assisted tile size selection
-
Berkeley, CA: Springer Verlag, Jun.
-
M. Rahman, L.-N. Pouchet, and P. Sadayappan, "Neural network assisted tile size selection," in International Workshop on Automatic Performance Tuning (IWAPT'2010). Berkeley, CA: Springer Verlag, Jun. 2010.
-
(2010)
International Workshop on Automatic Performance Tuning (IWAPT'2010)
-
-
Rahman, M.1
Pouchet, L.-N.2
Sadayappan, P.3
-
40
-
-
70449702074
-
Parametric multi-level tiling of imperfectly nested loops
-
M. Gschwind, A. Nicolau, V. Salapura, and J. E. Moreira, Eds. ACM
-
A. Hartono, M. M. Baskaran, C. Bastoul, A. Cohen, S. Krishnamoorthy, B. Norris, J. Ramanujam, and P. Sadayappan, "Parametric multi-level tiling of imperfectly nested loops," in ICS, M. Gschwind, A. Nicolau, V. Salapura, and J. E. Moreira, Eds. ACM, 2009, pp. 147-157.
-
(2009)
ICS
, pp. 147-157
-
-
Hartono, A.1
Baskaran, M.M.2
Bastoul, C.3
Cohen, A.4
Krishnamoorthy, S.5
Norris, B.6
Ramanujam, J.7
Sadayappan, P.8
-
41
-
-
77953999544
-
Dyntile: Parametric tiled loop generation for parallel execution on multicore processors
-
IEEE
-
A. Hartono, M. M. Baskaran, J. Ramanujam, and P. Sadayappan, "Dyntile: Parametric tiled loop generation for parallel execution on multicore processors," in IPDPS. IEEE, 2010, pp. 1-12.
-
(2010)
IPDPS
, pp. 1-12
-
-
Hartono, A.1
Baskaran, M.M.2
Ramanujam, J.3
Sadayappan, P.4
-
42
-
-
35448985754
-
Parameterized tiled loops for free
-
J. Ferrante and K. S. McKinley, Eds. ACM
-
L. Renganarayanan, D. Kim, S. V. Rajopadhye, and M. M. Strout, "Parameterized tiled loops for free," in PLDI, J. Ferrante and K. S. McKinley, Eds. ACM, 2007, pp. 405-414.
-
(2007)
PLDI
, pp. 405-414
-
-
Renganarayanan, L.1
Kim, D.2
Rajopadhye, S.V.3
Strout, M.M.4
-
44
-
-
77949633388
-
Adaptive multi-versioning for openmp parallelization via machine learning
-
Proceedings of the 2009 15th International Conference on Parallel and Distributed Systems, ser. Washington, DC, USA: IEEE Computer Society, [Online]. Available
-
X. Chen and S. Long, "Adaptive multi-versioning for openmp parallelization via machine learning," in Proceedings of the 2009 15th International Conference on Parallel and Distributed Systems, ser. ICPADS '09. Washington, DC, USA: IEEE Computer Society, 2009, pp. 907-912. [Online]. Available: http://dx.doi.org/10.1109/ICPADS.2009.77
-
(2009)
ICPADS '09
, pp. 907-912
-
-
Chen, X.1
Long, S.2
-
45
-
-
84877695633
-
-
IEEE
-
24th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010, Atlanta, Georgia, USA, 19-23 April 2010 - Conference Proceedings. IEEE, 2010.
-
(2010)
24th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010, Atlanta, Georgia, USA, 19-23 April 2010 - Conference Proceedings
-
-
|