메뉴 건너뛰기




Volumn , Issue , 2008, Pages

Positivity, posynomials and tile size selection

Author keywords

[No Author keywords available]

Indexed keywords

COST MODELS; DATA LOCALITY; FUNDAMENTAL PROPERTIES; LOOP TRANSFORMATION; MULTIPLE LEVELS; TILE SIZE;

EID: 70350786558     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/SC.2008.5213293     Document Type: Conference Paper
Times cited : (21)

References (57)
  • 2
    • 70350767611 scopus 로고    scopus 로고
    • R. Schreiber and J. Dongarra, Automatic blocking of nested loops, RIACS, NASA Ames Research Center, Tech. Rep. 90.38, Aug 1990.
    • R. Schreiber and J. Dongarra, "Automatic blocking of nested loops," RIACS, NASA Ames Research Center, Tech. Rep. 90.38, Aug 1990.
  • 3
    • 35449000510 scopus 로고
    • A data locality optimizing algorithm (with retrospective)
    • M. S. Lam and M. E. Wolf, "A data locality optimizing algorithm (with retrospective)," in Best of PLDI, 1991, pp. 442-459.
    • (1991) Best of PLDI , pp. 442-459
    • Lam, M.S.1    Wolf, M.E.2
  • 9
    • 0029373981 scopus 로고
    • Automatic partitioning of parallel loops and data arrays for distributed shared-memory multiprocessors
    • A. Agarwal, D. A. Kranz, and V. Natarajan, "Automatic partitioning of parallel loops and data arrays for distributed shared-memory multiprocessors," IEEE Trans. Parallel Distrib. Syst., vol. 6, no. 9, pp. 943-962, 1995.
    • (1995) IEEE Trans. Parallel Distrib. Syst , vol.6 , Issue.9 , pp. 943-962
    • Agarwal, A.1    Kranz, D.A.2    Natarajan, V.3
  • 12
    • 0032069399 scopus 로고    scopus 로고
    • On supernode transformation with minimized total running time
    • E. Hodzic and W. Shang, "On supernode transformation with minimized total running time," IEEE Trans. Parallel Distrib. Syst., vol. 9, no. 5, pp. 417-428, 1998.
    • (1998) IEEE Trans. Parallel Distrib. Syst , vol.9 , Issue.5 , pp. 417-428
    • Hodzic, E.1    Shang, W.2
  • 14
    • 38249009019 scopus 로고
    • Tiling multidimensional itertion spaces for multicomputers
    • J. Ramanujam and P. Sadayappan, "Tiling multidimensional itertion spaces for multicomputers." J. Parallel Distrib. Comput., vol. 16, no. 2, pp. 108-120, 1992.
    • (1992) J. Parallel Distrib. Comput , vol.16 , Issue.2 , pp. 108-120
    • Ramanujam, J.1    Sadayappan, P.2
  • 16
    • 0003125942 scopus 로고    scopus 로고
    • Communication-minimal tiling of uniform dependence loops
    • J. Xue, "Communication-minimal tiling of uniform dependence loops," J. Parallel Distrib. Comput., vol. 42, no. 1, pp. 42-59, 1997.
    • (1997) J. Parallel Distrib. Comput , vol.42 , Issue.1 , pp. 42-59
    • Xue, J.1
  • 17
    • 0036601528 scopus 로고    scopus 로고
    • Time-minimal tiling when rise is larger than zero
    • J. Xue and W. Cai, "Time-minimal tiling when rise is larger than zero," Parallel Comput., vol. 28, no. 6, pp. 915-939, 2002.
    • (2002) Parallel Comput , vol.28 , Issue.6 , pp. 915-939
    • Xue, J.1    Cai, W.2
  • 22
    • 0036565622 scopus 로고    scopus 로고
    • Automatic partitioning of parallel loops with parallelepiped-shaped tiles
    • F. Rastello and Y. Robert, "Automatic partitioning of parallel loops with parallelepiped-shaped tiles," IEEE Trans. Parallel Distrib. Syst., vol. 13, no. 5, pp. 460-470, 2002.
    • (2002) IEEE Trans. Parallel Distrib. Syst , vol.13 , Issue.5 , pp. 460-470
    • Rastello, F.1    Robert, Y.2
  • 23
    • 70449690852 scopus 로고    scopus 로고
    • Optimal tile size selection guided by analytical models
    • B. B. Fraguela, M. G. Carmueja, and D. Andrade, "Optimal tile size selection guided by analytical models." in PARCO, 2005, pp. 565-572.
    • (2005) PARCO , pp. 565-572
    • Fraguela, B.B.1    Carmueja, M.G.2    Andrade, D.3
  • 27
    • 33748307622 scopus 로고    scopus 로고
    • An analytical model for loop tiling and its solution
    • V. Sarkar and N. Megiddo, "An analytical model for loop tiling and its solution," in Proceedings of ISPASS, 2000.
    • (2000) Proceedings of ISPASS
    • Sarkar, V.1    Megiddo, N.2
  • 31
    • 0034512401 scopus 로고    scopus 로고
    • T. Kisuki, P. M. W. Knijnenburg, and M. F. P. O'Boyle, Combined selection of tile sizes and unroll factors using iterative compilation, in PACT '00: Proceedings of the 2000 International Conference on Parallel Architectures and Compilation Techniques. Washington, DC, USA: IEEE Computer Society, 2000, p. 237.
    • T. Kisuki, P. M. W. Knijnenburg, and M. F. P. O'Boyle, "Combined selection of tile sizes and unroll factors using iterative compilation," in PACT '00: Proceedings of the 2000 International Conference on Parallel Architectures and Compilation Techniques. Washington, DC, USA: IEEE Computer Society, 2000, p. 237.
  • 32
    • 33646828918 scopus 로고    scopus 로고
    • Combining models and guided empirical search to optimize for multiple levels of the memory hierarchy
    • Washington, DC, USA: IEEE Computer Society
    • C. Chen, J. Chame, and M. Hall, "Combining models and guided empirical search to optimize for multiple levels of the memory hierarchy," in CGO '05: Proceedings of the international symposium on Code generation and optimization. Washington, DC, USA: IEEE Computer Society, 2005, pp. 111-122.
    • (2005) CGO '05: Proceedings of the international symposium on Code generation and optimization , pp. 111-122
    • Chen, C.1    Chame, J.2    Hall, M.3
  • 34
    • 0442295621 scopus 로고    scopus 로고
    • The effect of cache models on iterative compilation for combined tiling and unrolling: Research articles
    • P. M.W. Knijnenburg, T. Kisuki, K. Gallivan, and M. F. P. O'Boyle, "The effect of cache models on iterative compilation for combined tiling and unrolling: Research articles," Concurr. Comput. : Pract. Exper., vol. 16, no. 2-3, pp. 247-270, 2004.
    • (2004) Concurr. Comput. : Pract. Exper , vol.16 , Issue.2-3 , pp. 247-270
    • Knijnenburg, P.M.W.1    Kisuki, T.2    Gallivan, K.3    O'Boyle, M.F.P.4
  • 36
    • 0043048462 scopus 로고    scopus 로고
    • An infeasible interior-point algorithm for solving primal and dual geometric programs
    • K. O. Kortanek, X. Xu, and Y. Ye, "An infeasible interior-point algorithm for solving primal and dual geometric programs," Math. Program., vol. 76, no. 1, pp. 155-181, 1997.
    • (1997) Math. Program , vol.76 , Issue.1 , pp. 155-181
    • Kortanek, K.O.1    Xu, X.2    Ye, Y.3
  • 37
    • 0004055894 scopus 로고    scopus 로고
    • Online version available at:, 2004
    • S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge University Press. (Online version available at: http://www.stanford.edu/ ̃boyd/cvxbook.html), 2004.
    • Convex Optimization
    • Boyd, S.1    Vandenberghe, L.2
  • 40
    • 1242352552 scopus 로고    scopus 로고
    • A quantitative analysis of tile size selection algorithms
    • C. hsing Hsu and U. Kremer, "A quantitative analysis of tile size selection algorithms," J. Supercomput., vol. 27, no. 3, pp. 279-294, 2004.
    • (2004) J. Supercomput , vol.27 , Issue.3 , pp. 279-294
    • hsing Hsu, C.1    Kremer, U.2
  • 41
    • 0003455775 scopus 로고
    • Improving data locality for caches,
    • Master's thesis, Rice University, September
    • K. Esseghir, "Improving data locality for caches," Master's thesis, Rice University, September 1993.
    • (1993)
    • Esseghir, K.1
  • 45
    • 0031140581 scopus 로고    scopus 로고
    • Automatic selection of high-order transformations in the IBM XL FORTRAN compilers
    • V. Sarkar, "Automatic selection of high-order transformations in the IBM XL FORTRAN compilers," IBM J. Res. Dev., vol. 41, no. 3, pp. 233-264, 1997.
    • (1997) IBM J. Res. Dev , vol.41 , Issue.3 , pp. 233-264
    • Sarkar, V.1
  • 50
    • 85015240805 scopus 로고
    • On estimating and enhancing cache effectiveness
    • Fourth International Workshop on Languages and Compilers for Parallel Computing, U. Banerjee, D. Gelernter, A. Nicolau, and D. Padua, Eds, Springer Verlag, August
    • J. Ferrante, V. Sarkar, and W. Thrash, "On estimating and enhancing cache effectiveness," in Fourth International Workshop on Languages and Compilers for Parallel Computing, U. Banerjee, D. Gelernter, A. Nicolau, and D. Padua, Eds. Lecture Notes on Computer Science 589, Springer Verlag, August 1991, pp. 328-343.
    • (1991) Lecture Notes on Computer Science , vol.589 , pp. 328-343
    • Ferrante, J.1    Sarkar, V.2    Thrash, W.3
  • 51
    • 19044386208 scopus 로고    scopus 로고
    • An updated set of basic linear algebra subprograms (BLAS)
    • "An updated set of basic linear algebra subprograms (BLAS)," ACM Trans. Math. Softw., vol. 28, no. 2, pp. 135-151, 2002.
    • (2002) ACM Trans. Math. Softw , vol.28 , Issue.2 , pp. 135-151
  • 52
    • 20344396845 scopus 로고    scopus 로고
    • YALMIP : A toolbox for modeling and optimization in MATLAB
    • Taipei, Taiwan, available from
    • J. Löfberg, "YALMIP : A toolbox for modeling and optimization in MATLAB," in Proceedings of the CACSD Conference, Taipei, Taiwan, 2004, available from http://control.ee.ethz.ch/̃joloef/yalmip.php.
    • (2004) Proceedings of the CACSD Conference
    • Löfberg, J.1
  • 54
    • 0029717349 scopus 로고    scopus 로고
    • Counting solutions to linear and nonlinear constraints through ehrhart polynomials: Applications to analyze and transform scientific programs
    • ACM Press
    • P. Clauss, "Counting solutions to linear and nonlinear constraints through ehrhart polynomials: applications to analyze and transform scientific programs," in Proceedings of the 10th international conference on Supercomputing. ACM Press, 1996, pp. 278-285.
    • (1996) Proceedings of the 10th international conference on Supercomputing , pp. 278-285
    • Clauss, P.1
  • 55
    • 0001714824 scopus 로고    scopus 로고
    • Cache miss equations: A compiler framework for analyzing and tuning memory behavior
    • S. Ghosh, M. Martonosi, and S. Malik, "Cache miss equations: a compiler framework for analyzing and tuning memory behavior," ACM Trans. Program. Lang. Syst., vol. 21, no. 4, pp. 703-746, 1999.
    • (1999) ACM Trans. Program. Lang. Syst , vol.21 , Issue.4 , pp. 703-746
    • Ghosh, S.1    Martonosi, M.2    Malik, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.