SCOPUS 정보 검색 플랫폼

Volumn 24, Issue 1, 2003, Pages 43-67

Combined selection of tile sizes and unroll factors using iterative compilation

(3) Knijnenburg, P M W a Kisuki, T a O'Boyle, M F P b

b UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

Adaptive compilation; Instruction level parallelism; Locality optimization; Program optimization; Program transformation

Indexed keywords

BENCHMARKING; COMPUTER HARDWARE DESCRIPTION LANGUAGES; COMPUTER SYSTEMS PROGRAMMING; GENETIC ALGORITHMS; PROGRAM COMPILERS; RANDOM PROCESSES; SAMPLING; SIMULATED ANNEALING;

ADAPTIVE COMPILATION; INSTRUCTION LEVEL PARALLELISM; ITERATIVE COMPILATION; LOCALITY OPTIMIZATION; LOOP TILING; PROGRAM OPTIMIZATION; PROGRAM TRANSFORMATIONS; UNROLLING;

PARALLEL PROCESSING SYSTEMS;

EID: 0037266298 PISSN: 09208542 EISSN: None Source Type: Journal
DOI: 10.1023/A:1020989410030 Document Type: Article

Times cited : (33)

References (32)

1
- 0001775038
- A catalogue of optimizing transformations
- Prentice-Hall, Englewood Cliffs
- F. E. Allen and J. Cocke, A catalogue of optimizing transformations. In Design and Optimization of Compilers, pp. 1-30, Prentice-Hall, Englewood Cliffs, 1972.
- (1972) Design and Optimization of Compilers , pp. 1-30
- Allen, F.E.¹ Cocke, J.²

2
- 84862468357
- OCEANS: Optimizing compilers for embedded applications
- In P. Amestory et al., ed.; Springer Verlag, Berlin
- M. Barreteau, F. Bodin, Z. Chamski, H.-P. Charles, C. Eisenbeis, J. Gurd, J. Hoogerbrugge, P. Hu, W. Jalby, T. Kisuki, P. M. W. Knijnenburg, P. van der Mark, A. Nisbet, M. F. P. O'Boyle, E. Rohou, A. Seznec, E. A. Stöhr, M. Treffers, and H. A. G. Wijshoff, OCEANS: Optimizing compilers for embedded applications. In P. Amestory et al., ed., Proc. Euro-Par 99, volume 1685 of Lecture Notes in Computer Science, pp. 1171-1175, Springer Verlag, Berlin, 1999.
- (1999) Proc. Euro-Par 99, Volume 1685 of Lecture Notes in Computer Science , pp. 1171-1175
- Barreteau, M.¹ Bodin, F.² Chamski, Z.³ Charles, H.-P.⁴ Eisenbeis, C.⁵ Gurd, J.⁶ Hoogerbrugge, J.⁷ Hu, P.⁸ Jalby, W.⁹ Kisuki, T.¹⁰ Knijnenburg, P.M.W.¹¹ Van Der Mark, P.¹² Nisbet, A.¹³ O'Boyle, M.F.P.¹⁴ Rohou, E.¹⁵ Seznec, A.¹⁶ Stöhr, E.A.¹⁷ Treffers, M.¹⁸ Wijshoff, H.A.G.¹⁹

3
- 0442286501
- Transformation mechanisms in MTI
- Technical Report 2000-21, LIACS, Leiden University, Leiden
- A. J. C. Bik, P. J. Brinkhaus, P. M. W. Knijnenburg, and H. A. G. Wijshoff. Transformation mechanisms in MTI. Technical Report 2000-21, LIACS, Leiden University, Leiden, 2000.
- (2000)
- Bik, A.J.C.¹ Brinkhaus, P.J.² Knijnenburg, P.M.W.³ Wijshoff, H.A.G.⁴

4
- 84882625796
- MTI: A prototype restructuring compiler
- Technical Report 93-32 Department of Computer Science, Leiden University, Leiden
- A. J. C. Bik and H. A. G. Wijshoff, MTI: A prototype restructuring compiler. Technical Report 93-32 Department of Computer Science, Leiden University, Leiden 1993.
- (1993)
- Bik, A.J.C.¹ Wijshoff, H.A.G.²

5
- 0030661485
- Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
- ACM Press, New York
- J. Bilmes, K. Asanović, C. W. Chin, and J. Demmel. Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology. In Proc. International Conference on Supercomputing, pp. 340-347, ACM Press, New York, 1997.
- (1997) Proc. International Conference on Supercomputing , pp. 340-347
- Bilmes, J.¹ Asanović, K.² Chin, C.W.³ Demmel, J.⁴

6
- 31844449895
- Iterative compilation in a non-linear optimization space
- Organized in conjunction with PACT98, Paris, France
- F. Bodin, T. Kisuki, P. M. W. Knijnenburg, M. F. P. O'Boyle, and E. Rohou. Iterative compilation in a non-linear optimization space. In Proc. ACM Workshop on Profile and Feedback Directed Compilation, 1998. Organized in conjunction with PACT98, Paris, France.
- Proc. ACM Workshop on Profile and Feedback Directed Compilation, 1998
- Bodin, F.¹ Kisuki, T.² Knijnenburg, P.M.W.³ O'Boyle, M.F.P.⁴ Rohou, E.⁵

7
- 0029749714
- Combining optimization for cache and instruction level parallelism
- IEEE Computer Society Press, Los Alamitos, Calif.
- S. Carr. Combining optimization for cache and instruction level parallelism. In Proc. Conference on Parallel Architectures and Compilation Techniques, pp. 238-247. IEEE Computer Society Press, Los Alamitos, Calif., 1996.
- (1996) Proc. Conference on Parallel Architectures and Compilation Techniques , pp. 238-247
- Carr, S.¹

8
- 0028549474
- Improving the ratio of memory operations to floating-point operations in loops
- S. Carr and K. Kennedy. Improving the ratio of memory operations to floating-point operations in loops. ACM Transactions on Programming Languages and Systems, 16(6):1768-1810, 1994.
- (1994) ACM Transactions on Programming Languages and Systems , vol.16 , Issue.6 , pp. 1768-1810
- Carr, S.¹ Kennedy, K.²

9
- 16244396196
- Feedback-directed selection and characterization of compiler optimizations
- Organized in conjunction with MICRO32
- K. Chow and Y. Wu. Feedback-directed selection and characterization of compiler optimizations. In Proc. 2nd Workshop on Feedback Directed Optimization, Haifa, 1999. Organized in conjunction with MICRO32.
- Proc. 2nd Workshop on Feedback Directed Optimization, Haifa, 1999
- Chow, K.¹ Wu, Y.²

10
- 17244367371
- Feedback directed optimization in Compaq's compilation tools for Alpha
- Organized in conjunction with MICRO32
- R. Cohn and P. G. Lowney. Feedback directed optimization in Compaq's compilation tools for Alpha. In Proc. 2nd Workshop on Feedback Directed Optimization, Haifa, 1999. Organized in conjunction with MICRO32.
- Proc. 2nd Workshop on Feedback Directed Optimization, Haifa, 1999
- Cohn, R.¹ Lowney, P.G.²

11
- 84976745804
- Tile size selection using cache organization and data layout
- ACM Press, New York
- S. Coleman and K. S. McKinley. Tile size selection using cache organization and data layout. In Proc. ACM SIGPLAN Conference on Programming Language Design and Implementation, pp. 279-290, ACM Press, New York, 1995.
- (1995) Proc. ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 279-290
- Coleman, S.¹ McKinley, K.S.²

12
- 0004148399
- John Wiley, New York
- H. Corporaal, Microprocessor Architectures: From VLIW to TTA. John Wiley, New York, 1997.
- (1997) Microprocessor Architectures: From VLIW to TTA
- Corporaal, H.¹

13
- 0004077620
- McGraw-Hill, New York
- G. de Micheli. Synthesis and Optimization of Digital Circuits. McGraw-Hill, New York, 1994.
- (1994) Synthesis and Optimization of Digital Circuits
- De Micheli, G.¹

14
- 0001366267
- Strategies for cache and local memory management by global program transformations
- D. Gannon, W. Jalby and K. Gallivan. Strategies for cache and local memory management by global program transformations. J. Parallel and Distributed Computing, 5:587-616, 1988.
- (1988) J. Parallel and Distributed Computing , vol.5 , pp. 587-616
- Gannon, D.¹ Jalby, W.² Gallivan, K.³

15
- 0001714824
- Cache miss equations: A compiler framework for analyzing and tunig memory behavior
- S. Gosh, M. Martonosi, and S. Malik. Cache miss equations: A compiler framework for analyzing and tunig memory behavior. ACM Trans. on Programming Languages and Systems, 21(4):703-746, 1999.
- (1999) ACM Trans. on Programming Languages and Systems , vol.21 , Issue.4 , pp. 703-746
- Gosh, S.¹ Martonosi, M.² Malik, S.³

16
- 0013107503
- Software support for improving locality in scientific codes
- Aussois
- H. Han, G. Rivera and C.-W. Tseng. Software support for improving locality in scientific codes. In Proc. Compilers for Parallel Computers, pp. 213-228, Aussois, 2000.
- (2000) Proc. Compilers for Parallel Computers , pp. 213-228
- Han, H.¹ Rivera, G.² Tseng, C.-W.³

17
- 0027595384
- The superblock: An effective technique for vliw and superscalar compilation
- W.-M. W. Hwu, S. A. Mahlke, W. Y. Chen, P. P. Cahng, N. J. Warter, R. A. Bringman, R. G. Oullette, R. E. Hank, T. Kiyohara, G. E. Haah, J. G. Holm, and D. M. Lavery. The superblock: An effective technique for vliw and superscalar compilation. The Journal of Supercomputing, 7(1/2):229-248, 1993.
- (1993) The Journal of Supercomputing , vol.7 , Issue.1-2 , pp. 229-248
- Hwu, W.-M.W.¹ Mahlke, S.A.² Chen, W.Y.³ Cahng, P.P.⁴ Warter, N.J.⁵ Bringman, R.A.⁶ Oullette, R.G.⁷ Hank, R.E.⁸ Kiyohara, T.⁹ Haah, G.E.¹⁰ Holm, J.G.¹¹ Lavery, D.M.¹²

18
- 0013103242
- Iterative compilation for tile sizes and unroll factors: Implementation, performance, search strategies
- Technical Report TR2000-06, LIACS, Leiden University, Leiden
- T. Kisuki, P. M. W. Knijnenburg, and M. F. P. O'Boyle. Iterative compilation for tile sizes and unroll factors: Implementation, performance, search strategies. Technical Report TR2000-06, LIACS, Leiden University, Leiden, 2000.
- (2000)
- Kisuki, T.¹ Knijnenburg, P.M.W.² O'Boyle, M.F.P.³

19
- 84958060114
- A feasibility study in iterative compilation
- Springer Verlag, Berlin
- T. Kisuki, P. M. W. Knijnenburg, M. F. P. O'Boyle, F. Bodin, and H. A. G. Wijshoff. A feasibility study in iterative compilation. In Proc. International Symposium on High Performance Computing, volume 1615 of Lecture Notes in Computer Science, pp. 121-132. Springer Verlag, Berlin, 1999.
- (1999) Proc. International Symposium on High Performance Computing, Volume 1615 of Lecture Notes in Computer Science , pp. 121-132
- Kisuki, T.¹ Knijnenburg, P.M.W.² O'Boyle, M.F.P.³ Bodin, F.⁴ Wijshoff, H.A.G.⁵

20
- 0002363292
- Iterative compilation in program optimization
- Aussois
- T. Kisuki, P. M. W. Knijnenburg, M. F. P. O'Boyle, and H. A. G. Wijshoff. Iterative compilation in program optimization. In Proc. Compilers for Parallel Computers, pp. 35-44, Aussois, 2000.
- (2000) Proc. Compilers for Parallel Computers , pp. 35-44
- Kisuki, T.¹ Knijnenburg, P.M.W.² O'Boyle, M.F.P.³ Wijshoff, H.A.G.⁴

21
- 0013103243
- The effect of cache models on iterative compilation for combined tiling and unrolling
- Monterey; Organized in conjunction with MICRO-33
- P. M. W. Knijnenburg, T. Kisuki, K. Gallivan, and M. F. P. O'Boyle. The effect of cache models on iterative compilation for combined tiling and unrolling. In Proc. 3rd ACM Workshop on Profile Directed and Dynamic Optimization, pp. 31-40, Monterey, 2000. Organized in conjunction with MICRO-33.
- (2000) Proc. 3rd ACM Workshop on Profile Directed and Dynamic Optimization , pp. 31-40
- Knijnenburg, P.M.W.¹ Kisuki, T.² Gallivan, K.³ O'Boyle, M.F.P.⁴

22
- 0026137116
- The cache performance and optimizations of blocked algorithms
- ACM Press, New York
- M. S. Lam, E. E. Rothberg, and M. E. Wolf. The cache performance and optimizations of blocked algorithms. In Proc. International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 63-74. ACM Press, New York, 1991.
- (1991) Proc. International Conference on Architectural Support for Programming Languages and Operating Systems , pp. 63-74
- Lam, M.S.¹ Rothberg, E.E.² Wolf, M.E.³

23
- 0026980852
- Effective compiler support for predicated execution using the hyperblock
- IEEE Computer Society Press, Los Alamitos, Calif.
- S. A. Mahlke, D. C. Lin, W. Y. Chen, R. E. Hank and R. A. Bringmann. Effective compiler support for predicated execution using the hyperblock. In Proc. 25th International Symposium on Microarchitecture, pp. 45-54. IEEE Computer Society Press, Los Alamitos, Calif., 1992.
- (1992) Proc. 25th International Symposium on Microarchitecture , pp. 45-54
- Mahlke, S.A.¹ Lin, D.C.² Chen, W.Y.³ Hank, R.E.⁴ Bringmann, R.A.⁵

24
- 0038255639
- Calpa: A tool for automating dynamic compilation
- Organized in conjunction with MICRO32, Paris, France
- M. Mock, M. Berryman, C. Chambers, and S. J. Eggers, Calpa: A tool for automating dynamic compilation. In Proc. 2nd Workshop on Feedback Directed Optimization, 1999. Organized in conjunction with MICRO32, Paris, France.
- Proc. 2nd Workshop on Feedback Directed Optimization, 1999
- Mock, M.¹ Berryman, M.² Chambers, C.³ Eggers, S.J.⁴

25
- 0003502903
- Morgan Kaufmann, San Francisco
- S. S. Muchnick. Advanced Compiler Design and Implementation. Morgan Kaufmann, San Francisco, 1997.
- (1997) Advanced Compiler Design and Implementation
- Muchnick, S.S.¹

26
- 0442317946
- GAPS: Genetic algorithm optimised parallelization
- Organized in conjunction with PACT98
- A. Nisbet. GAPS: Genetic algorithm optimised parallelization. In Proc. Workshop on Profile and Feedback Directed Compilation, Paris, 1998. Organized in conjunction with PACT98.
- Proc. Workshop on Profile and Feedback Directed Compilation, Paris, 1998
- Nisbet, A.¹

27
- 0033359030
- Efficient parallelization using combined loop and data transformations
- IEEE Computer Society Press, Los Alamitos, Calif.
- M. F. P. O'Boyle and P. M. W. Knijnenburg. Efficient parallelization using combined loop and data transformations. In Proc. IEEE International Conference on Parallel Architectures and Compilation Techniques, pp. 283-291. IEEE Computer Society Press, Los Alamitos, Calif., 1999.
- (1999) Proc. IEEE International Conference on Parallel Architectures and Compilation Techniques , pp. 283-291
- O'Boyle, M.F.P.¹ Knijnenburg, P.M.W.²

28
- 26744439790
- Evaluating iterative compilation in massive optimization spaces
- Preprint, University of Edinburgh
- M. F. P. O'Boyle, P. M. W. Knijnenburg, T. Kisuki, and G. Fursin, Evaluating iterative compilation in massive optimization spaces. Preprint, University of Edinburgh, 2001.
- (2001)
- O'Boyle, M.F.P.¹ Knijnenburg, P.M.W.² Kisuki, T.³ Fursin, G.⁴

29
- 84949210195
- A comparison of compiler tiling algorithms
- Springer Verlag, Berlin
- G. Rivera and C.-W. Tseng. A comparison of compiler tiling algorithms. In Proc. 8th International Conference on Compiler Construction, Lecture Notes in Computer Science. Springer Verlag, Berlin, 1999.
- (1999) Proc. 8th International Conference on Compiler Construction, Lecture Notes in Computer Science
- Rivera, G.¹ Tseng, C.-W.²

30
- 0013149209
- Using iterative compilation for managing software pipeline-unrolling tradeoffs
- P. van der Mark, E. Rohou, F. Bodin, Z. Chamski, and C. Eisenbeis. Using iterative compilation for managing software pipeline-unrolling tradeoffs. In Proc. 4th International Workshop on Software and Compilers for Embedded Systems (SCOPES99), 1999.
- Proc. 4th International Workshop on Software and Compilers for Embedded Systems (SCOPES99), 1999
- Van Der Mark, P.¹ Rohou, E.² Bodin, F.³ Chamski, Z.⁴ Eisenbeis, C.⁵

31
- 0003418094
- Automatically tuned linear algebra software
- Technical Report UT-CS-97-366, University of Tennessee, TN
- R. C. Whaley and J. J. Dongarra. Automatically tuned linear algebra software. Technical Report UT-CS-97-366, University of Tennessee, TN, 1997.
- (1997)
- Whaley, R.C.¹ Dongarra, J.J.²

32
- 0032141341
- Combining loop transformations considering caches and scheduling
- M. E. Wolf, D. E. Maydan, and D.-K. Chen. Combining loop transformations considering caches and scheduling. International Journal of Parallel Programming, 26(4):479-503, 1998.
- (1998) International Journal of Parallel Programming , vol.26 , Issue.4 , pp. 479-503
- Wolf, M.E.¹ Maydan, D.E.² Chen, D.-K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.