-
3
-
-
0034512401
-
Combined selection of tile sizes and unroll factors using iterative compilation
-
Kisuki, T., Knijnenburg, P.M., O'Boyle, M.F.: Combined selection of tile sizes and unroll factors using iterative compilation. In: Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, Philadelphia, PA (October 2000)
-
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, Philadelphia, PA (October 2000)
-
-
Kisuki, T.1
Knijnenburg, P.M.2
O'Boyle, M.F.3
-
4
-
-
33745158666
-
Tuning high performance kernels through empirical compilation
-
IEEE Computer Society, Los Alamitos
-
Whalley, D.B.: Tuning high performance kernels through empirical compilation. In: ICPP 2005: Proceedings of the 2005 International Conference on Parallel Processing, Washington, DC, USA, pp. 89-98. IEEE Computer Society, Los Alamitos (2005)
-
(2005)
ICPP 2005: Proceedings of the 2005 International Conference on Parallel Processing, Washington, DC, USA
, pp. 89-98
-
-
Whalley, D.B.1
-
5
-
-
26444499564
-
A code isolator: Isolating code fragments from large programs
-
Eigenmann, R., Li, Z., Midkiff, S.P. (eds.) LCPC 2004. Springer, Heidelberg
-
Lee, Y.J., Hall, M.W.: A code isolator: Isolating code fragments from large programs. In: Eigenmann, R., Li, Z., Midkiff, S.P. (eds.) LCPC 2004. LNCS, vol. 3602, pp. 164-178. Springer, Heidelberg (2005)
-
(2005)
LNCS
, vol.3602
, pp. 164-178
-
-
Lee, Y.J.1
Hall, M.W.2
-
6
-
-
33646676076
-
Automatic tuning of whole applications using direct search and a performance-based transformation system
-
Qasem, A., Kennedy, K., Mellor-Crummey, J.: Automatic tuning of whole applications using direct search and a performance-based transformation system. J. Supercomput. 36(2), 183-196 (2006)
-
(2006)
J. Supercomput.
, vol.36
, Issue.2
, pp. 183-196
-
-
Qasem, A.1
Kennedy, K.2
Mellor-Crummey, J.3
-
7
-
-
44249112350
-
PEAK - A fast and effective performance tuning system via compiler optimization orchestration
-
Pan, Z., Eigenmann, R.: PEAK - a fast and effective performance tuning system via compiler optimization orchestration. ACM Trans. Program. Lang. Syst. 30(3), 1-43 (2008)
-
(2008)
ACM Trans. Program. Lang. Syst.
, vol.30
, Issue.3
, pp. 1-43
-
-
Pan, Z.1
Eigenmann, R.2
-
8
-
-
65649128093
-
PERI auto-tuning
-
Bailey, D., Chame, J., Chen, C., Dongarra, J., Hall, M., Hollingsworth, J.K., Hovland, P., Moore, S., Seymour, K., Shin, J., Tiwari, A., Williams, S., You, H.: PERI auto-tuning. Journal of Physics: Conference Series (2008)
-
(2008)
Journal of Physics: Conference Series
-
-
Bailey, D.1
Chame, J.2
Chen, C.3
Dongarra, J.4
Hall, M.5
Hollingsworth, J.K.6
Hovland, P.7
Moore, S.8
Seymour, K.9
Shin, J.10
Tiwari, A.11
Williams, S.12
You, H.13
-
9
-
-
34247133517
-
Ablego: A function outlining and partial inlining framework: Research articles
-
Zhao, P., Amaral, J.N.: Ablego: a function outlining and partial inlining framework: Research articles. Softw. Pract. Exper. 37(5), 465-491 (2007)
-
(2007)
Softw. Pract. Exper.
, vol.37
, Issue.5
, pp. 465-491
-
-
Zhao, P.1
Amaral, J.N.2
-
13
-
-
34547491662
-
-
Technical report, University of Tennessee
-
You, H., Seymour, K., Dongarra, J.: An effective empirical search method for autmatic software tuning. Technical report, University of Tennessee (2005)
-
(2005)
An Effective Empirical Search Method for Autmatic Software Tuning
-
-
You, H.1
Seymour, K.2
Dongarra, J.3
-
14
-
-
23944476799
-
Using information from prior runs to improve automated tuning systems
-
Chung, I.H., Hollingsworth, J.K.: Using information from prior runs to improve automated tuning systems. In: SC 2004: Proceedings of the 2004 ACM/IEEE conference on Supercomputing, Washington, DC, USA, p. 30 (2004)
-
(2004)
SC 2004: Proceedings of the 2004 ACM/IEEE Conference on Supercomputing, Washington, DC, USA
, pp. 30
-
-
Chung, I.H.1
Hollingsworth, J.K.2
-
15
-
-
26444528418
-
Applying loop optimizations to object-oriented abstractions through general classification of array semantics
-
Eigenmann, R., Li, Z., Midkiff, S.P. (eds.) LCPC 2004. Springer, Heidelberg
-
Yi, Q., Quinlan, D.: Applying loop optimizations to object-oriented abstractions through general classification of array semantics. In: Eigenmann, R., Li, Z., Midkiff, S.P. (eds.) LCPC 2004. LNCS, vol. 3602, pp. 253-267. Springer, Heidelberg (2005)
-
(2005)
LNCS
, vol.3602
, pp. 253-267
-
-
Yi, Q.1
Quinlan, D.2
-
16
-
-
77952410868
-
POET: Parameterized optimizations for empirical tuning
-
Yi, Q., Seymour, K., You, H., Vuduc, R., Quinlan, D.: POET: Parameterized optimizations for empirical tuning. In: Workshop on Performance Optimization of High-Level Languages and Libraries (POHLL) (March 2007)
-
Workshop on Performance Optimization of High-Level Languages and Libraries (POHLL) (March 2007)
-
-
Yi, Q.1
Seymour, K.2
You, H.3
Vuduc, R.4
Quinlan, D.5
-
17
-
-
70449959487
-
-
Technical report, USC Computer Science
-
Chen, C., Chame, J., Hall, M.: CHiLL: A framework for composing high-level loop transformations. Technical report, USC Computer Science (2008)
-
(2008)
CHiLL: A Framework for Composing High-level Loop Transformations
-
-
Chen, C.1
Chame, J.2
Hall, M.3
-
18
-
-
0033708935
-
Semicoarsening multigrid on distributed memory machines
-
Brown, P.N., Falgout, R.D., Jones, J.E.: Semicoarsening multigrid on distributed memory machines. SIAM J. Sci. Comput. 21(5), 1823-1834 (2000)
-
(2000)
SIAM J. Sci. Comput.
, vol.21
, Issue.5
, pp. 1823-1834
-
-
Brown, P.N.1
Falgout, R.D.2
Jones, J.E.3
-
19
-
-
77952006041
-
Extending automatic parallelization to optimize high-level abstractions for multicore
-
Müller, M.S., de Supinski, B.R., Chapman, B.M. (eds.) IWOMP 2009. Springer, Heidelberg
-
Liao, C., Quinlan, D.J., Willcock, J.J., Panas, T.: Extending automatic parallelization to optimize high-level abstractions for multicore. In: Müller, M.S., de Supinski, B.R., Chapman, B.M. (eds.) IWOMP 2009. LNCS, vol. 5568, pp. 28-41. Springer, Heidelberg (2009)
-
(2009)
LNCS
, vol.5568
, pp. 28-41
-
-
Liao, C.1
Quinlan, D.J.2
Willcock, J.J.3
Panas, T.4
-
20
-
-
0032290942
-
Restructuring programs by tucking statements into functions
-
Harman, M., Gallagher, K. (eds.)
-
Lakhotia, A., Deprez, J.C.: Restructuring programs by tucking statements into functions. In: Harman, M., Gallagher, K. (eds.) Special Issue on Program Slicing. Information and Software Technology, vol. 40, pp. 677-689 (1998)
-
(1998)
Special Issue on Program Slicing. Information and Software Technology
, vol.40
, pp. 677-689
-
-
Lakhotia, A.1
Deprez, J.C.2
-
21
-
-
84978946107
-
Effective, automatic procedure extraction
-
IEEE Computer Society, Los Alamitos
-
Komondoor, R., Horwitz, S.: Effective, automatic procedure extraction. In: IWPC 2003: Proceedings of the 11th IEEE International Workshop on Program Comprehension, Washington, DC, USA, p. 33. IEEE Computer Society, Los Alamitos (2003)
-
(2003)
IWPC 2003: Proceedings of the 11th IEEE International Workshop on Program Comprehension, Washington, DC, USA
, pp. 33
-
-
Komondoor, R.1
Horwitz, S.2
-
22
-
-
0036375922
-
Experiences tuning SMG98 - A semicoarsening multigrid benchmark based on the hypre library
-
Jin, G., Mellor-Crummey, J.: Experiences tuning SMG98: a semicoarsening multigrid benchmark based on the hypre library. In: ICS 2002: Proceedings of the 16th international conference on Supercomputing, pp. 305-314. ACM, New York (2002) (Pubitemid 35040009)
-
(2002)
Proceedings of the International Conference on Supercomputing
, pp. 305-314
-
-
Jin, G.1
Mellor-Crummey, J.2
|