메뉴 건너뛰기




Volumn , Issue , 2007, Pages 170-182

Loop optimization using hierarchical compilation and kernel decomposition

Author keywords

[No Author keywords available]

Indexed keywords

HIERARCHICAL SYSTEMS; NETWORK ARCHITECTURE; OPTIMIZATION; PROGRAM COMPILERS;

EID: 34547678265     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CGO.2007.22     Document Type: Conference Paper
Times cited : (7)

References (23)
  • 1
    • 34547664405 scopus 로고    scopus 로고
    • Tiny C compiler
    • Tiny C compiler, http://www.tinycc.org.
  • 3
    • 33646828918 scopus 로고    scopus 로고
    • Combining models and guided empirical search to optimize for multiple levels of the memory hierarchy
    • C. Chen, J. Chame, and M. W. Hall. Combining models and guided empirical search to optimize for multiple levels of the memory hierarchy. In CGO '05, pages 111-122, 2005.
    • (2005) CGO '05 , pp. 111-122
    • Chen, C.1    Chame, J.2    Hall, M.W.3
  • 4
    • 0029717349 scopus 로고    scopus 로고
    • Counting solutions to linear and nonlinear constraints through Ehrhart polynomials: Applications to analyze and transform scientific programs
    • P. Clauss. Counting solutions to linear and nonlinear constraints through Ehrhart polynomials: Applications to analyze and transform scientific programs. In ICS '96, pages 278-295, 1996.
    • (1996) ICS '96 , pp. 278-295
    • Clauss, P.1
  • 5
    • 84976745804 scopus 로고
    • Tile size selection using cache organization and data layout
    • S. Coleman and K. S. McKinley. Tile size selection using cache organization and data layout. In PLDI '95, pages 279-290, 1995.
    • (1995) PLDI '95 , pp. 279-290
    • Coleman, S.1    McKinley, K.S.2
  • 9
    • 34547709623 scopus 로고    scopus 로고
    • Engineering and scientific subroutine library. Guide and Reference. IBM
    • Engineering and scientific subroutine library. Guide and Reference. IBM.
  • 10
    • 0026109335 scopus 로고
    • Dataflow analysis of scalar and array references
    • Feb
    • P. Feautrier. Dataflow analysis of scalar and array references. Int. J. of Parallel Programming, 20(1):23-53, Feb. 1991.
    • (1991) Int. J. of Parallel Programming , vol.20 , Issue.1 , pp. 23-53
    • Feautrier, P.1
  • 11
    • 0033358624 scopus 로고    scopus 로고
    • Automatic analytical modeling for the estimation of cache misses
    • B. B. Fraguela, R. Doallo, and E. L. Zapata. Automatic analytical modeling for the estimation of cache misses. In PACT '99, page 221, 1999.
    • (1999) PACT '99 , pp. 221
    • Fraguela, B.B.1    Doallo, R.2    Zapata, E.L.3
  • 12
    • 1542392269 scopus 로고    scopus 로고
    • On reducing tlb misses in matrix multiplication
    • Technical report, The University of Texas at Austin, Department of Computer Sciences
    • K. Goto and R. van de Geijn. On reducing tlb misses in matrix multiplication. Technical report, The University of Texas at Austin, Department of Computer Sciences, 2002.
    • (2002)
    • Goto, K.1    van de Geijn, R.2
  • 13
    • 2942538624 scopus 로고    scopus 로고
    • Wbtk: A new set of microbenchmarks to explore memory system performance for scientific computing
    • W. Jalby, C. Lemuet, and X. L. Pasteur. Wbtk: a new set of microbenchmarks to explore memory system performance for scientific computing. Int. J. High Perform. Comput. Appl., 18(2):211-224, 2004.
    • (2004) Int. J. High Perform. Comput. Appl , vol.18 , Issue.2 , pp. 211-224
    • Jalby, W.1    Lemuet, C.2    Pasteur, X.L.3
  • 14
    • 0030685988 scopus 로고    scopus 로고
    • Data-centric multi-level blocking
    • I. Kodukula, N. Ahmed, and K. Pingali. Data-centric multi-level blocking. In PLDI '97, pages 346-357, 1997.
    • (1997) PLDI '97 , pp. 346-357
    • Kodukula, I.1    Ahmed, N.2    Pingali, K.3
  • 15
    • 84855816103 scopus 로고    scopus 로고
    • Transformations for imperfectly nested loops
    • I. Kodukula and K. Pingali. Transformations for imperfectly nested loops. In Supercomputing '96, page 12, 1996.
    • (1996) Supercomputing '96 , pp. 12
    • Kodukula, I.1    Pingali, K.2
  • 16
    • 0032068586 scopus 로고    scopus 로고
    • Automatic storage management for parallel programs
    • V. Lefebvre and P. Feautrier. Automatic storage management for parallel programs. Parallel Computing, 24(3-4):649-671, 1998.
    • (1998) Parallel Computing , vol.24 , Issue.3-4 , pp. 649-671
    • Lefebvre, V.1    Feautrier, P.2
  • 17
    • 34547679236 scopus 로고    scopus 로고
    • Intel math kernel library intel mkl, Intel
    • Intel math kernel library (intel mkl). Intel.
  • 18
    • 18844383699 scopus 로고    scopus 로고
    • A unified framework for schedule and storage optimization
    • W. Thies, F. Vivien, J. Sheldon, and S. P. Amarasinghe. A unified framework for schedule and storage optimization. In PLDI '01, pages 232-242, 2001.
    • (2001) PLDI '01 , pp. 232-242
    • Thies, W.1    Vivien, F.2    Sheldon, J.3    Amarasinghe, S.P.4
  • 19
    • 24144471069 scopus 로고    scopus 로고
    • Optimization-Space Exploration
    • S. Triantafyllis, M. Vachharajani, and D. I. August. Compiler
    • S. Triantafyllis, M. Vachharajani, and D. I. August. Compiler Optimization-Space Exploration. The Journal of Instruction-level Parallelism (JILP), 2005.
    • (2005) The Journal of Instruction-level Parallelism (JILP)
  • 22


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.