메뉴 건너뛰기




Volumn 24, Issue 1, 2003, Pages 43-67

Combined selection of tile sizes and unroll factors using iterative compilation

Author keywords

Adaptive compilation; Instruction level parallelism; Locality optimization; Program optimization; Program transformation

Indexed keywords

BENCHMARKING; COMPUTER HARDWARE DESCRIPTION LANGUAGES; COMPUTER SYSTEMS PROGRAMMING; GENETIC ALGORITHMS; PROGRAM COMPILERS; RANDOM PROCESSES; SAMPLING; SIMULATED ANNEALING;

EID: 0037266298     PISSN: 09208542     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1020989410030     Document Type: Article
Times cited : (33)

References (32)
  • 1
    • 0001775038 scopus 로고
    • A catalogue of optimizing transformations
    • Prentice-Hall, Englewood Cliffs
    • F. E. Allen and J. Cocke, A catalogue of optimizing transformations. In Design and Optimization of Compilers, pp. 1-30, Prentice-Hall, Englewood Cliffs, 1972.
    • (1972) Design and Optimization of Compilers , pp. 1-30
    • Allen, F.E.1    Cocke, J.2
  • 3
    • 0442286501 scopus 로고    scopus 로고
    • Transformation mechanisms in MTI
    • Technical Report 2000-21, LIACS, Leiden University, Leiden
    • A. J. C. Bik, P. J. Brinkhaus, P. M. W. Knijnenburg, and H. A. G. Wijshoff. Transformation mechanisms in MTI. Technical Report 2000-21, LIACS, Leiden University, Leiden, 2000.
    • (2000)
    • Bik, A.J.C.1    Brinkhaus, P.J.2    Knijnenburg, P.M.W.3    Wijshoff, H.A.G.4
  • 4
    • 84882625796 scopus 로고
    • MTI: A prototype restructuring compiler
    • Technical Report 93-32 Department of Computer Science, Leiden University, Leiden
    • A. J. C. Bik and H. A. G. Wijshoff, MTI: A prototype restructuring compiler. Technical Report 93-32 Department of Computer Science, Leiden University, Leiden 1993.
    • (1993)
    • Bik, A.J.C.1    Wijshoff, H.A.G.2
  • 5
    • 0030661485 scopus 로고    scopus 로고
    • Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
    • ACM Press, New York
    • J. Bilmes, K. Asanović, C. W. Chin, and J. Demmel. Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology. In Proc. International Conference on Supercomputing, pp. 340-347, ACM Press, New York, 1997.
    • (1997) Proc. International Conference on Supercomputing , pp. 340-347
    • Bilmes, J.1    Asanović, K.2    Chin, C.W.3    Demmel, J.4
  • 7
    • 0029749714 scopus 로고    scopus 로고
    • Combining optimization for cache and instruction level parallelism
    • IEEE Computer Society Press, Los Alamitos, Calif.
    • S. Carr. Combining optimization for cache and instruction level parallelism. In Proc. Conference on Parallel Architectures and Compilation Techniques, pp. 238-247. IEEE Computer Society Press, Los Alamitos, Calif., 1996.
    • (1996) Proc. Conference on Parallel Architectures and Compilation Techniques , pp. 238-247
    • Carr, S.1
  • 8
    • 0028549474 scopus 로고
    • Improving the ratio of memory operations to floating-point operations in loops
    • S. Carr and K. Kennedy. Improving the ratio of memory operations to floating-point operations in loops. ACM Transactions on Programming Languages and Systems, 16(6):1768-1810, 1994.
    • (1994) ACM Transactions on Programming Languages and Systems , vol.16 , Issue.6 , pp. 1768-1810
    • Carr, S.1    Kennedy, K.2
  • 9
    • 16244396196 scopus 로고    scopus 로고
    • Feedback-directed selection and characterization of compiler optimizations
    • Organized in conjunction with MICRO32
    • K. Chow and Y. Wu. Feedback-directed selection and characterization of compiler optimizations. In Proc. 2nd Workshop on Feedback Directed Optimization, Haifa, 1999. Organized in conjunction with MICRO32.
    • Proc. 2nd Workshop on Feedback Directed Optimization, Haifa, 1999
    • Chow, K.1    Wu, Y.2
  • 14
    • 0001366267 scopus 로고
    • Strategies for cache and local memory management by global program transformations
    • D. Gannon, W. Jalby and K. Gallivan. Strategies for cache and local memory management by global program transformations. J. Parallel and Distributed Computing, 5:587-616, 1988.
    • (1988) J. Parallel and Distributed Computing , vol.5 , pp. 587-616
    • Gannon, D.1    Jalby, W.2    Gallivan, K.3
  • 15
    • 0001714824 scopus 로고    scopus 로고
    • Cache miss equations: A compiler framework for analyzing and tunig memory behavior
    • S. Gosh, M. Martonosi, and S. Malik. Cache miss equations: A compiler framework for analyzing and tunig memory behavior. ACM Trans. on Programming Languages and Systems, 21(4):703-746, 1999.
    • (1999) ACM Trans. on Programming Languages and Systems , vol.21 , Issue.4 , pp. 703-746
    • Gosh, S.1    Martonosi, M.2    Malik, S.3
  • 16
  • 18
    • 0013103242 scopus 로고    scopus 로고
    • Iterative compilation for tile sizes and unroll factors: Implementation, performance, search strategies
    • Technical Report TR2000-06, LIACS, Leiden University, Leiden
    • T. Kisuki, P. M. W. Knijnenburg, and M. F. P. O'Boyle. Iterative compilation for tile sizes and unroll factors: Implementation, performance, search strategies. Technical Report TR2000-06, LIACS, Leiden University, Leiden, 2000.
    • (2000)
    • Kisuki, T.1    Knijnenburg, P.M.W.2    O'Boyle, M.F.P.3
  • 28
    • 26744439790 scopus 로고    scopus 로고
    • Evaluating iterative compilation in massive optimization spaces
    • Preprint, University of Edinburgh
    • M. F. P. O'Boyle, P. M. W. Knijnenburg, T. Kisuki, and G. Fursin, Evaluating iterative compilation in massive optimization spaces. Preprint, University of Edinburgh, 2001.
    • (2001)
    • O'Boyle, M.F.P.1    Knijnenburg, P.M.W.2    Kisuki, T.3    Fursin, G.4
  • 31
    • 0003418094 scopus 로고    scopus 로고
    • Automatically tuned linear algebra software
    • Technical Report UT-CS-97-366, University of Tennessee, TN
    • R. C. Whaley and J. J. Dongarra. Automatically tuned linear algebra software. Technical Report UT-CS-97-366, University of Tennessee, TN, 1997.
    • (1997)
    • Whaley, R.C.1    Dongarra, J.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.