-
1
-
-
70449893057
-
-
F. Agakov, E. Bonilla, J. Cavazos, B. Franke, G. Fursin, M. F. P. O'Boyle, J. Thomson, M. Toussaint, and C. K. I. Williams. Using machine learning to focus iterative optimization. In Proceedings of the International Symposium on Code Generation and Optimization, Mar. 2004.
-
F. Agakov, E. Bonilla, J. Cavazos, B. Franke, G. Fursin, M. F. P. O'Boyle, J. Thomson, M. Toussaint, and C. K. I. Williams. Using machine learning to focus iterative optimization. In Proceedings of the International Symposium on Code Generation and Optimization, Mar. 2004.
-
-
-
-
2
-
-
0032201716
-
An optimal algorithm for approximate nearest neighbor searching fixed dimensions
-
S. Arya, D. M. Mount, N. S. Netanyahu, R. Silverman, and A. Y. Wu. An optimal algorithm for approximate nearest neighbor searching fixed dimensions. J. ACM, 45(6):891-923, 1998.
-
(1998)
J. ACM
, vol.45
, Issue.6
, pp. 891-923
-
-
Arya, S.1
Mount, D.M.2
Netanyahu, N.S.3
Silverman, R.4
Wu, A.Y.5
-
3
-
-
0030661485
-
Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
-
June
-
J. Bilmes, K. Asanović, C.-W. Chin, and J. Demmel. Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology. In Proceedings of the 1997 ACM International Conference on Supercomputing, June 1997.
-
(1997)
Proceedings of the 1997 ACM International Conference on Supercomputing
-
-
Bilmes, J.1
Asanović, K.2
Chin, C.-W.3
Demmel, J.4
-
6
-
-
70449959487
-
CHiLL: A framework for composing high-level loop transformations
-
Technical report, University of Southern California
-
C. Chen, J. Chame, and M. Hall. CHiLL: A framework for composing high-level loop transformations. Technical report, University of Southern California, 2008.
-
(2008)
-
-
Chen, C.1
Chame, J.2
Hall, M.3
-
8
-
-
84934324812
-
Using information from prior runs to improve automated tuning systems
-
Washington, DC, USA, IEEE Computer Society
-
I.-H. Chung and J. K. Hollingsworth. Using information from prior runs to improve automated tuning systems. In SC '04: Proceedings of the 2004 ACM/IEEE conference on Supercomputing, page 30, Washington, DC, USA, 2004. IEEE Computer Society.
-
(2004)
SC '04: Proceedings of the 2004 ACM/IEEE conference on Supercomputing
, pp. 30
-
-
Chung, I.-H.1
Hollingsworth, J.K.2
-
11
-
-
33746593747
-
Semi-automatic composition of loop transformations for deep parallelism and memory hierarchies
-
June
-
S. Girbal, N. Vasilache, C. Bastoul, A. Cohen, D. Parello, M. Sigler, and O. Temam. Semi-automatic composition of loop transformations for deep parallelism and memory hierarchies. International Journal of Parallel Programming, 34(3):261-317, June 2006.
-
(2006)
International Journal of Parallel Programming
, vol.34
, Issue.3
, pp. 261-317
-
-
Girbal, S.1
Vasilache, N.2
Bastoul, C.3
Cohen, A.4
Parello, D.5
Sigler, M.6
Temam, O.7
-
12
-
-
70450018329
-
-
W. Kelly, V. Maslov, W. Pugh, E. Rosser, T. Shpeisman, and D. Wonnacott. The Omega Library interface guide. Technical Report CS-TR-3445, University of Maryland at College Park, Mar. 1995.
-
W. Kelly, V. Maslov, W. Pugh, E. Rosser, T. Shpeisman, and D. Wonnacott. The Omega Library interface guide. Technical Report CS-TR-3445, University of Maryland at College Park, Mar. 1995.
-
-
-
-
13
-
-
56749175334
-
Multi-level tiling: M for the price of one
-
New York, NY, USA, ACM
-
D. Kim, L. Renganarayanan, D. Rostron, S. Rajopadhye, and M. M. Strout. Multi-level tiling: M for the price of one. In SC '07: Proceedings of the 2007 ACM/IEEE conference on Supercomputing, pages 1-12, New York, NY, USA, 2007. ACM.
-
(2007)
SC '07: Proceedings of the 2007 ACM/IEEE conference on Supercomputing
, pp. 1-12
-
-
Kim, D.1
Renganarayanan, L.2
Rostron, D.3
Rajopadhye, S.4
Strout, M.M.5
-
14
-
-
0034512401
-
-
T. Kisuki, P. M. W. Knijnenburg, and M. F. P. O'Boyle. Combined selection of tile sizes and unroll factors using iterative compilation. In PACT '00: Proceedings of the 2000 International Conference on Parallel Architectures and Compilation Techniques, page 237, Washington, DC, USA, 2000. IEEE Computer Society.
-
T. Kisuki, P. M. W. Knijnenburg, and M. F. P. O'Boyle. Combined selection of tile sizes and unroll factors using iterative compilation. In PACT '00: Proceedings of the 2000 International Conference on Parallel Architectures and Compilation Techniques, page 237, Washington, DC, USA, 2000. IEEE Computer Society.
-
-
-
-
15
-
-
70449897623
-
-
last accessed: Feb 09, 2009
-
D. M. Mount. http://www.cs.umd.edu/̃mount/ANN/. [last accessed: Feb 09, 2009].
-
-
-
Mount, D.M.1
-
16
-
-
51049092126
-
Model-guided performance tuning of parameter values: A case study with molecular dynamics visualization
-
April
-
Y. Nelson, B. Bansal, M. Hall, A. Nakano, and K. Lerman. Model-guided performance tuning of parameter values: A case study with molecular dynamics visualization. Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on, pages 1-8, April 2008.
-
(2008)
Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on
, pp. 1-8
-
-
Nelson, Y.1
Bansal, B.2
Hall, M.3
Nakano, A.4
Lerman, K.5
-
17
-
-
57349167317
-
Iterative optimization in the polyhedral model: Part II, multidimensional time
-
Tucson, Arizona, June, ACM Press
-
L.-N. Pouchet, C. Bastoul, A. Cohen, and J. Cavazos. Iterative optimization in the polyhedral model: Part II, multidimensional time. In ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI'08), pages 90-100, Tucson, Arizona, June 2008. ACM Press.
-
(2008)
ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI'08)
, pp. 90-100
-
-
Pouchet, L.-N.1
Bastoul, C.2
Cohen, A.3
Cavazos, J.4
-
18
-
-
33646676076
-
Automatic tuning of whole applications using direct search and a performance-based transformation system
-
A. Qasem, K. Kennedy, and J. Mellor-Crummey. Automatic tuning of whole applications using direct search and a performance-based transformation system. J. Supercomput., 36(2):183-196, 2006.
-
(2006)
J. Supercomput
, vol.36
, Issue.2
, pp. 183-196
-
-
Qasem, A.1
Kennedy, K.2
Mellor-Crummey, J.3
-
19
-
-
33845443250
-
Parallel parameter tuning for applications with performance variability
-
Washington, DC, USA, IEEE Computer Society
-
V. Tabatabaee, A. Tiwari, and J. K. Hollingsworth. Parallel parameter tuning for applications with performance variability. In SC '05: Proceedings of the 2005 ACM/IEEE conference on Supercomputing, page 57, Washington, DC, USA, 2005. IEEE Computer Society.
-
(2005)
SC '05: Proceedings of the 2005 ACM/IEEE conference on Supercomputing
, pp. 57
-
-
Tabatabaee, V.1
Tiwari, A.2
Hollingsworth, J.K.3
-
20
-
-
24344485098
-
-
R. Vuduc, J. W. Demmel, and K. A. Yelick. Oski: A library of automatically tuned sparse matrix kernels. Journal of Physics: Conference Series, 16:521-530, June 2005.
-
R. Vuduc, J. W. Demmel, and K. A. Yelick. Oski: A library of automatically tuned sparse matrix kernels. Journal of Physics: Conference Series, 16:521-530, June 2005.
-
-
-
-
23
-
-
34548765138
-
Poet: Parameterized optimizations for empirical tuning
-
March
-
Q. Yi, K. Seymour, H. You, R. Vuduc, and D. Quinlan. Poet: Parameterized optimizations for empirical tuning. Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International, pages 1-8, March 2007.
-
(2007)
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
, pp. 1-8
-
-
Yi, Q.1
Seymour, K.2
You, H.3
Vuduc, R.4
Quinlan, D.5
-
24
-
-
20744459570
-
Is search really necessary to generate high-performance BLAS?
-
Feb
-
K. Yotov, X. Li, G. Ren, M. Garzaran, D. Padua, K. Pingali, and P. Stodghill. Is search really necessary to generate high-performance BLAS? Proceedings of the IEEE: Special Issue on Program Generation, Optimization, and Platform Adaptation, 93(2):358-386, Feb. 2005.
-
(2005)
Proceedings of the IEEE: Special Issue on Program Generation, Optimization, and Platform Adaptation
, vol.93
, Issue.2
, pp. 358-386
-
-
Yotov, K.1
Li, X.2
Ren, G.3
Garzaran, M.4
Padua, D.5
Pingali, K.6
Stodghill, P.7
|